qingy2024 commited on
Commit
e71fd11
1 Parent(s): 44c844c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -22,6 +22,8 @@ This model is a fine-tuned version of **Qwen 2.5-14B**, trained on QwQ 32B Previ
22
 
23
  **Note:** This model uses the standard ChatML template.
24
 
 
 
25
  ---
26
 
27
  #### Training Details
 
22
 
23
  **Note:** This model uses the standard ChatML template.
24
 
25
+ At 500 steps, the loss was plateauing so I decided to stop training to prevent excessive overfitting.
26
+
27
  ---
28
 
29
  #### Training Details