admarcosai
's Collections
Beyond Human Data: Scaling Self-Training for Problem-Solving with
Language Models
Paper
•
2312.06585
•
Published
•
26
TinyGSM: achieving >80% on GSM8k with small language models
Paper
•
2312.09241
•
Published
•
34
Viewer
•
Updated
•
70k
•
81
Paper
•
2309.17425
•
Published
•
6
jondurbin/gutenberg-dpo-v0.1
Viewer
•
Updated
•
918
•
127
•
64
garage-bAInd/Open-Platypus
Viewer
•
Updated
•
24.9k
•
11.9k
•
347
Viewer
•
Updated
•
243k
•
119
•
188
Viewer
•
Updated
•
58.7k
•
760
•
34
Viewer
•
Updated
•
1.49M
•
401
•
132
Viewer
•
Updated
•
166k
•
1.61k
•
99
Viewer
•
Updated
•
198k
•
68
•
109
Viewer
•
Updated
•
2.75M
•
202
•
296
Viewer
•
Updated
•
6.2M
•
45
•
65
open-web-math/open-web-math
Viewer
•
Updated
•
6.32M
•
1.89k
•
238
Viewer
•
Updated
•
4.04k
•
12.1k
•
96
Viewer
•
Updated
•
14.3k
•
37
•
43
Viewer
•
Updated
•
44.8k
•
203
•
47
Viewer
•
Updated
•
6.14k
•
9.35k
•
92
Viewer
•
Updated
•
262k
•
5.45k
•
205
argilla/ultrafeedback-binarized-preferences-cleaned
Viewer
•
Updated
•
60.9k
•
3.86k
•
103
WhiteRabbitNeo/Code-Functions-Level-Cyber
Viewer
•
Updated
•
8.44k
•
3
•
15
WhiteRabbitNeo/Code-Functions-Level-General
Viewer
•
Updated
•
8.69k
•
17
•
8
Viewer
•
Updated
•
317k
•
1k
•
26
Updated
•
3.82k
•
58
Viewer
•
Updated
•
183k
•
2.23k
•
262
selfrag/selfrag_train_data
Viewer
•
Updated
•
146k
•
337
•
60
Viewer
•
Updated
•
463k
•
1
•
17
Locutusque/UltraTextbooks
Viewer
•
Updated
•
5.52M
•
390
•
184
Undi95/ConversationChronicles-sharegpt-SHARDED
Viewer
•
Updated
•
787k
•
5
•
6
OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset
Paper
•
2402.10176
•
Published
•
33
Viewer
•
Updated
•
31.1M
•
2.99k
•
505
togethercomputer/RedPajama-Data-1T
Viewer
•
Updated
•
1.73M
•
2.03k
•
1.02k
Viewer
•
Updated
•
968M
•
1.59k
•
759
Viewer
•
Updated
•
276M
•
1.93k
•
123