qingy2024
/

UwU-14B-Math-v0.2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

qingy2024 commited on 13 days ago

Commit

e71fd11

•

1 Parent(s): 44c844c

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -22,6 +22,8 @@ This model is a fine-tuned version of **Qwen 2.5-14B**, trained on QwQ 32B Previ
 **Note:** This model uses the standard ChatML template.
 ---
 #### Training Details

 **Note:** This model uses the standard ChatML template.
+At 500 steps, the loss was plateauing so I decided to stop training to prevent excessive overfitting.
 ---
 #### Training Details