-
llama3
Meta Llama 3: The most capable openly available LLM to date
1.3M Pulls 67 Tags Updated 8 days ago
-
phi3
Phi-3 Mini is a 3.8B parameters, lightweight, state-of-the-art open model by Microsoft.
172.5K Pulls 6 Tags Updated 3 weeks ago
-
wizardlm2
State of the art large language model from Microsoft AI with improved performance on complex chat, multilingual, reasoning and agent use cases.
50.2K Pulls 22 Tags Updated 4 weeks ago
-
mistral
The 7B model released by Mistral AI, updated to version 0.2.
765.9K Pulls 68 Tags Updated 7 weeks ago
-
gemma
Gemma is a family of lightweight, state-of-the-art open models built by Google DeepMind. Updated to version 1.1
1.4M Pulls 102 Tags Updated 5 weeks ago
-
mixtral
A set of Mixture of Experts (MoE) model with open weights by Mistral AI in 8x7b and 8x22b parameter sizes.
235.5K Pulls 69 Tags Updated 12 days ago
-
llama2
Llama 2 is a collection of foundation language models ranging from 7B to 70B parameters.
1.5M Pulls 102 Tags Updated 3 months ago
-
codegemma
CodeGemma is a collection of powerful, lightweight models that can perform a variety of coding tasks like fill-in-the-middle code completion, code generation, natural language understanding, mathematical reasoning, and instruction following.
72.4K Pulls 85 Tags Updated 12 days ago
-
command-r
Command R is a Large Language Model optimized for conversational interaction and long context tasks.
35.7K Pulls 17 Tags Updated 6 weeks ago
-
command-r-plus
Command R+ is a powerful, scalable large language model purpose-built to excel at real-world enterprise use cases.
28.8K Pulls 6 Tags Updated 4 weeks ago
-
llava
🌋 LLaVA is a novel end-to-end trained large multimodal model that combines a vision encoder and Vicuna for general-purpose visual and language understanding. Updated to version 1.6.
215.6K Pulls 98 Tags Updated 3 months ago
-
dbrx
DBRX is an open, general-purpose LLM created by Databricks.
6,175 Pulls 7 Tags Updated 4 weeks ago
-
falcon2
Falcon2 is an 11B parameters causal decoder-only model built by TII and trained over 5T tokens.
2,106 Pulls 17 Tags Updated 2 days ago
-
llama3-chatqa
A model from NVIDIA based on Llama 3 that excels at conversational question answering (QA) and retrieval-augmented generation (RAG).
7,884 Pulls 35 Tags Updated 5 days ago
-
llava-phi3
A new small LLaVA model fine-tuned from Phi 3 Mini.
3,714 Pulls 4 Tags Updated 8 days ago
-
llava-llama3
A LLaVA model fine-tuned from Llama 3 Instruct with better scores in several benchmarks.
6,720 Pulls 4 Tags Updated 8 days ago
-
llama3-gradient
This model extends LLama-3 8B's context length from 8k to over 1m tokens.
16.6K Pulls 35 Tags Updated 11 days ago
-
moondream
moondream2 is a small vision language model designed to run efficiently on edge devices.
6,910 Pulls 19 Tags Updated 3 days ago
-
dolphin-llama3
Dolphin 2.9 is a new model with 8B and 70B sizes by Eric Hartford based on Llama 3 that has a variety of instruction, conversational, and coding skills.
43.1K Pulls 54 Tags Updated 5 days ago
-
codeqwen
CodeQwen1.5 is a large language model pretrained on a large amount of code data.
18.2K Pulls 21 Tags Updated 4 weeks ago
-
snowflake-arctic-embed
A suite of text embedding models by Snowflake, optimized for performance.
7,493 Pulls 16 Tags Updated 4 weeks ago
-
mxbai-embed-large
State-of-the-art large embedding model from mixedbread.ai
37.3K Pulls 4 Tags Updated 9 days ago
-
dolphincoder
A 7B and 15B uncensored variant of the Dolphin model family that excels at coding, based on StarCoder2.
25.1K Pulls 35 Tags Updated 5 weeks ago
-
starcoder2
StarCoder2 is the next generation of transparently trained open code LLMs that comes in three sizes: 3B, 7B and 15B parameters.
57.2K Pulls 67 Tags Updated 2 weeks ago
-
all-minilm
Embedding models on very large sentence level datasets.
17.3K Pulls 10 Tags Updated 9 days ago