LLM Compiler Collection Meta LLM Compiler is a state-of-the-art LLM that builds upon Code Llama with improved performance for code optimization and compiler reasoning. • 4 items • Updated 4 days ago • 121
DataComp-LM: In search of the next generation of training sets for language models Paper • 2406.11794 • Published 14 days ago • 39
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale Paper • 2406.17557 • Published 6 days ago • 70
4M Models Collection Multimodal models from https://4m.epfl.ch/ • 14 items • Updated 17 days ago • 29
Nemotron 4 340B Collection Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated 17 days ago • 146
Show, Don't Tell: Aligning Language Models with Demonstrated Feedback Paper • 2406.00888 • Published 29 days ago • 29
Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of Experts Paper • 2405.11273 • Published May 18 • 17
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 22 items • Updated May 31 • 348
PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma • 16 items • Updated 4 days ago • 118
Granite Code Models Collection A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 20 items • Updated 3 days ago • 145
Llama3-ChatQA-1.5 Collection Llama3-ChatQA-1.5 models excel at conversational question answering (QA) and retrieval-augmented generation (RAG). • 6 items • Updated 17 days ago • 37
Better & Faster Large Language Models via Multi-token Prediction Paper • 2404.19737 • Published Apr 30 • 65
Llama 3 - Smashed Collection Many variations of Llama 3 with many compression techniques. • 81 items • Updated May 2 • 5
RePLan: Robotic Replanning with Perception and Language Models Paper • 2401.04157 • Published Jan 8 • 3
PERL: Parameter Efficient Reinforcement Learning from Human Feedback Paper • 2403.10704 • Published Mar 15 • 56
Learning to Learn Faster from Human Feedback with Language Model Predictive Control Paper • 2402.11450 • Published Feb 18 • 20