RedPajama: an Open Dataset for Training Large Language Models Paper • 2411.12372 • Published Nov 19, 2024 • 47
Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining Paper • 2410.08102 • Published Oct 10, 2024 • 20
Mixture-of-Agents Enhances Large Language Model Capabilities Paper • 2406.04692 • Published Jun 7, 2024 • 55
Mixture-of-Agents Enhances Large Language Model Capabilities Paper • 2406.04692 • Published Jun 7, 2024 • 55
Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order Paper • 2404.00399 • Published Mar 30, 2024 • 41
DMLR: Data-centric Machine Learning Research -- Past, Present and Future Paper • 2311.13028 • Published Nov 21, 2023 • 1
Adding NVMe SSDs to Enable and Accelerate 100B Model Fine-tuning on a Single GPU Paper • 2403.06504 • Published Mar 11, 2024 • 53
DeltaZip: Multi-Tenant Language Model Serving via Delta Compression Paper • 2312.05215 • Published Dec 8, 2023 • 1
Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding Paper • 2309.08168 • Published Sep 15, 2023
Skill-it! A Data-Driven Skills Framework for Understanding and Training Language Models Paper • 2307.14430 • Published Jul 26, 2023 • 3
Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time Paper • 2310.17157 • Published Oct 26, 2023 • 12
Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time Paper • 2310.17157 • Published Oct 26, 2023 • 12