Nickyang commited on
Commit
d18c25e
·
verified ·
1 Parent(s): bb2ce63

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -14,7 +14,7 @@ library_name: transformers
14
 
15
  ## FastCuRL Overview
16
 
17
- We release **FastCuRL-1.5B-Preview**, a slow-thinking reasoning model that achieves 43.1% accuracy on the AIME 2024 benchmark! We adapt a novel curriculum-guided iterative lengthening reinforcement learning to the distilled 1.5B model and observe continuous performance improvement as training steps increase. To better reproduce our work and advance research progress, we open-source our code, model, and data.
18
 
19
  Code: https://github.com/nick7nlp/FastCuRL
20
 
 
14
 
15
  ## FastCuRL Overview
16
 
17
+ We release **FastCuRL-1.5B-Preview**, a slow-thinking reasoning model that **outperforms** the previous SoTA *DeepScaleR-1.5B-Preview* with **50% training steps**! We adapt a novel curriculum-guided iterative lengthening reinforcement learning to the *DeepSeek-R1-Distill-Qwen-1.5B* and observe continuous performance improvement as training steps increase. To better reproduce our work and advance research progress, we open-source our code, model, and data.
18
 
19
  Code: https://github.com/nick7nlp/FastCuRL
20