-
llama3
Meta Llama 3: The most capable openly available LLM to date
8B 70B1.3M Pulls 67 Tags Updated 12 days ago
-
phi3
Phi-3 Mini is a 3.8B parameters, lightweight, state-of-the-art open model by Microsoft.
4B205.1K Pulls 6 Tags Updated 3 weeks ago
-
wizardlm2
State of the art large language model from Microsoft AI with improved performance on complex chat, multilingual, reasoning and agent use cases.
7B 141B53K Pulls 22 Tags Updated 4 weeks ago
-
mistral
The 7B model released by Mistral AI, updated to version 0.2.
7B781.5K Pulls 68 Tags Updated 8 weeks ago
-
gemma
Gemma is a family of lightweight, state-of-the-art open models built by Google DeepMind. Updated to version 1.1
2B 7B1.5M Pulls 102 Tags Updated 5 weeks ago
-
mixtral
A set of Mixture of Experts (MoE) model with open weights by Mistral AI in 8x7b and 8x22b parameter sizes.
47B 141B244.2K Pulls 69 Tags Updated 3 days ago
-
llama2
Llama 2 is a collection of foundation language models ranging from 7B to 70B parameters.
7B 13B 70B1.6M Pulls 102 Tags Updated 3 months ago
-
codegemma
CodeGemma is a collection of powerful, lightweight models that can perform a variety of coding tasks like fill-in-the-middle code completion, code generation, natural language understanding, mathematical reasoning, and instruction following.
2B 7B78.6K Pulls 85 Tags Updated 2 weeks ago
-
command-r
Command R is a Large Language Model optimized for conversational interaction and long context tasks.
35B37.1K Pulls 17 Tags Updated 7 weeks ago
-
command-r-plus
Command R+ is a powerful, scalable large language model purpose-built to excel at real-world enterprise use cases.
104B29.8K Pulls 6 Tags Updated 4 weeks ago
-
llava
🌋 LLaVA is a novel end-to-end trained large multimodal model that combines a vision encoder and Vicuna for general-purpose visual and language understanding. Updated to version 1.6.
7B 13B 34B225.7K Pulls 98 Tags Updated 3 months ago
-
dbrx
DBRX is an open, general-purpose LLM created by Databricks.
132B6,559 Pulls 7 Tags Updated 4 weeks ago
-
falcon2
Falcon2 is an 11B parameters causal decoder-only model built by TII and trained over 5T tokens.
11B3,378 Pulls 17 Tags Updated 6 days ago
-
llama3-chatqa
A model from NVIDIA based on Llama 3 that excels at conversational question answering (QA) and retrieval-augmented generation (RAG).
8B 70B10.9K Pulls 35 Tags Updated 9 days ago
-
llava-phi3
A new small LLaVA model fine-tuned from Phi 3 Mini.
4B4,780 Pulls 4 Tags Updated 12 days ago
-
llava-llama3
A LLaVA model fine-tuned from Llama 3 Instruct with better scores in several benchmarks.
8B8,531 Pulls 4 Tags Updated 12 days ago
-
llama3-gradient
This model extends LLama-3 8B's context length from 8k to over 1m tokens.
8B 70B18.7K Pulls 35 Tags Updated 2 weeks ago
-
moondream
moondream2 is a small vision language model designed to run efficiently on edge devices.
1B7,752 Pulls 18 Tags Updated 7 days ago
-
dolphin-llama3
Dolphin 2.9 is a new model with 8B and 70B sizes by Eric Hartford based on Llama 3 that has a variety of instruction, conversational, and coding skills.
8B 70B47.4K Pulls 54 Tags Updated 9 days ago
-
codeqwen
CodeQwen1.5 is a large language model pretrained on a large amount of code data.
7B19.9K Pulls 21 Tags Updated 4 weeks ago
-
snowflake-arctic-embed
A suite of text embedding models by Snowflake, optimized for performance.
23M 33M 109M 137M 334M8,230 Pulls 16 Tags Updated 4 weeks ago
-
mxbai-embed-large
State-of-the-art large embedding model from mixedbread.ai
334M42.7K Pulls 4 Tags Updated 13 days ago
-
dolphincoder
A 7B and 15B uncensored variant of the Dolphin model family that excels at coding, based on StarCoder2.
7B 16B26.3K Pulls 35 Tags Updated 5 weeks ago
-
starcoder2
StarCoder2 is the next generation of transparently trained open code LLMs that comes in three sizes: 3B, 7B and 15B parameters.
3B 7B 16B60.9K Pulls 67 Tags Updated 2 weeks ago
-
all-minilm
Embedding models on very large sentence level datasets.
23M 33M18.7K Pulls 10 Tags Updated 13 days ago