The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale Paper ā¢ 2406.17557 ā¢ Published Jun 25 ā¢ 87
Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations Paper ā¢ 2405.18392 ā¢ Published May 28 ā¢ 12
Power Hungry Processing: Watts Driving the Cost of AI Deployment? Paper ā¢ 2311.16863 ā¢ Published Nov 28, 2023 ā¢ 6
What's in the Box? A Preliminary Analysis of Undesirable Content in the Common Crawl Corpus Paper ā¢ 2105.02732 ā¢ Published May 6, 2021
OctoPack: Instruction Tuning Code Large Language Models Paper ā¢ 2308.07124 ā¢ Published Aug 14, 2023 ā¢ 28
Fair Diffusion: Instructing Text-to-Image Generation Models on Fairness Paper ā¢ 2302.10893 ā¢ Published Feb 7, 2023 ā¢ 6
Evaluating the Social Impact of Generative AI Systems in Systems and Society Paper ā¢ 2306.05949 ā¢ Published Jun 9, 2023 ā¢ 9
Quantifying the Carbon Emissions of Machine Learning Paper ā¢ 1910.09700 ā¢ Published Oct 21, 2019 ā¢ 11
ClimateGAN: Raising Climate Change Awareness by Generating Images of Floods Paper ā¢ 2110.02871 ā¢ Published Oct 6, 2021
Evaluate & Evaluation on the Hub: Better Best Practices for Data and Model Measurements Paper ā¢ 2210.01970 ā¢ Published Sep 30, 2022 ā¢ 11
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset Paper ā¢ 2303.03915 ā¢ Published Mar 7, 2023 ā¢ 6
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model Paper ā¢ 2211.05100 ā¢ Published Nov 9, 2022 ā¢ 27