view article Article Releasing the largest multilingual open pretraining dataset By Pclanglais and 2 others • Nov 13, 2024 • 98
Gemma 2 2B Release Collection The 2.6B parameter version of Gemma 2. • 6 items • Updated Dec 13, 2024 • 78
Instruction Pre-Training: Language Models are Supervised Multitask Learners Paper • 2406.14491 • Published Jun 20, 2024 • 88
view post Post 1972 cooking up something....anyone interested in a daily activity tracker for HF? 12 replies · ❤️ 43 43 ➕ 14 14 🔥 14 14 👀 1 1 + Reply
Running on CPU Upgrade 4.75k 4.75k MTEB Leaderboard 🥇 Select and filter benchmarks for text embedding tasks
view article Article StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation Apr 29, 2024 • 76