Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 11 items • Updated 3 days ago • 277
LLM Reasoning Papers Collection Papers to improve reasoning capabilities of LLMs • 13 items • Updated 4 days ago • 41
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale Paper • 2406.17557 • Published Jun 25 • 84
view article Article Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models Mar 20 • 60
Qwen2 Collection Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated 11 days ago • 339
📀 Dataset comparison models Collection 1.8B models trained on 350BT to compare different pretraining datasets • 8 items • Updated Jun 12 • 27
view article Article The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare Apr 19 • 100
view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) Dec 9, 2022 • 81