
Foundation Text-Generation Models Below 360M Parameters
Great candidates for fine-tuning targeting Wllama and Transformers.js for mobile devices, ordered by number of parameters.
Text Generation • Updated • 19.9k • • 39Note License: apache-2.0 Context Length: 8k
OuteAI/Lite-Oute-1-300M
Text Generation • Updated • 366 • 7Note License: apache-2.0 Context Length: 4k
keeeeenw/MicroLlama
Text Generation • Updated • 1.99k • 43Note License: apache-2.0 Context Length: 2k
cerebras/Cerebras-GPT-256M
Text Generation • Updated • 2k • 25Note License: apache-2.0 Context Length: 2k
upstage/TinySolar-248m-4k
Text Generation • Updated • 802 • 5Note License: apache-2.0 Context Length: 4k
M4-ai/TinyMistral-248M-v3
Text Generation • Updated • 238 • 6Note License: apache-2.0 Context Length: 2k
MiniLLM/MiniPLM-llama3.1-212M
Text Generation • Updated • 847 • 2Note License: apache-2.0 Context Length: ?
MiniLLM/MiniPLM-Qwen-200M
Text Generation • Updated • 143 • 2Note License: apache-2.0 Context Length: ?
princeton-nlp/Sheared-Pythia-160m
Text Generation • Updated • 162 • 4Note License: apache-2.0 Context Length: 2k
JackFram/llama-160m
Text Generation • Updated • 267k • 33Note License: apache-2.0 Context Length: 2k
SmallDoge/Doge-160M
Text Generation • Updated • 7.09k • 2Note License: apache-2.0 Context Length: 2k
EleutherAI/pythia-160m
Text Generation • Updated • 150k • • 29Note License: apache-2.0 Context Length: 2k
HuggingFaceTB/SmolLM2-135M
Text Generation • Updated • 256k • • 65Note License: apache-2.0 Context Length: 8k
amd/AMD-Llama-135m
Text Generation • Updated • 17.4k • • 111Note License: apache-2.0 Context Length: 2k
MiniLLM/MiniPLM-Mamba-130M
Text Generation • Updated • 24 • 2Note License: apache-2.0 Context Length: ?
EleutherAI/gpt-neo-125m
Text Generation • Updated • 135k • • 196Note License: mit Context Length: 2k
cerebras/Cerebras-GPT-111M
Text Generation • Updated • 4.46k • • 76Note License: apache-2.0 Context Length: 2k
BEE-spoke-data/smol_llama-101M-GQA
Text Generation • Updated • 5k • • 28Note License: apache-2.0 Context Length: 1k
openai-community/gpt2
Text Generation • Updated • 16.6M • • 2.6kNote License: mit Context Length: 1k
distilbert/distilgpt2
Text Generation • Updated • 1.87M • • 489Note License: apache-2.0 Context Length: 1k
weiser/82M-0.4
Text Generation • Updated • 52Note License: apache-2.0 Context Length: 1k
BEE-spoke-data/smol_llama-81M-tied
Text Generation • Updated • 2.96k • 6Note License: apache-2.0 Context Length: 1k
EleutherAI/pythia-70m
Updated • 71.9k • 64Note License: apache-2.0 Context Length: 2k
JackFram/llama-68m
Text Generation • Updated • 530k • 26Note License: apache-2.0 Context Length: 2k
OuteAI/Lite-Oute-1-65M
Text Generation • Updated • 222 • 9Note License: apache-2.0 Context Length: 2k
SmallDoge/Doge-60M
Text Generation • Updated • 23.8k • 4Note License: apache-2.0 Context Length: 2k
Felladrin/Minueza-32M-Base
Text Generation • Updated • 410 • 16Note License: apache-2.0 Context Length: 2k
GerbilLab/Gerbil-A-32m
Text Generation • Updated • 76 • 2Note License: apache-2.0 Context Length: 2k
EleutherAI/pythia-31m
Text Generation • Updated • 13.1k • 5Note License: apache-2.0 Context Length: 2k
SmallDoge/Doge-20M
Text Generation • Updated • 20.1k • 7Note License: apache-2.0 Context Length: 2k
EleutherAI/pythia-14m
Text Generation • Updated • 119k • 20Note License: apache-2.0 Context Length: 2k