Deliberation in Latent Space via Differentiable Cache Augmentation Paper • 2412.17747 • Published 6 days ago • 25
Smaller Language Models Are Better Instruction Evolvers Paper • 2412.11231 • Published 14 days ago • 25
Merging in a Bottle: Differentiable Adaptive Merging (DAM) and the Path from Averaging to Automation Paper • 2410.08371 • Published Oct 10 • 1
GGUF Llama-3.2-Instruct-OQ8_0-F32.EF32.IQ4_K-Q8_0 IQuants Collection Custom GGUF quants of Meta’s Llama-3.2-Instruct's finetunes, where the Output Tensors are quantized to Q8_0 or F32 and the Embeddings are kept @F32 • 3 items • Updated 17 days ago • 2
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 23 days ago • 548
Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 • 40 items • Updated Nov 28 • 258
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing Paper • 2406.08464 • Published Jun 12 • 65
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Nov 28 • 446
BESA: Pruning Large Language Models with Blockwise Parameter-Efficient Sparsity Allocation Paper • 2402.16880 • Published Feb 18 • 2
Llama 3.1 Collection This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated 23 days ago • 636
LLM Compiler Collection Meta LLM Compiler is a state-of-the-art LLM that builds upon Code Llama with improved performance for code optimization and compiler reasoning. • 4 items • Updated Jun 27 • 146
view article Article Expanding Model Context and Creating Chat Models with a Single Click By maywell • Apr 28 • 37
Honorable mentions Collection Some models I've made and I liked but isn't part of a serie. • 10 items • Updated Feb 4 • 6