SelfCodeAlign: Self-Alignment for Code Generation Paper ā¢ 2410.24198 ā¢ Published Oct 31, 2024 ā¢ 23
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale Paper ā¢ 2406.17557 ā¢ Published Jun 25, 2024 ā¢ 90
Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations Paper ā¢ 2405.18392 ā¢ Published May 28, 2024 ā¢ 12
StarCoder 2 and The Stack v2: The Next Generation Paper ā¢ 2402.19173 ā¢ Published Feb 29, 2024 ā¢ 136