Update README.md
Browse files
README.md
CHANGED
@@ -22,6 +22,8 @@ This model is a fine-tuned version of **Qwen 2.5-14B**, trained on QwQ 32B Previ
|
|
22 |
|
23 |
**Note:** This model uses the standard ChatML template.
|
24 |
|
|
|
|
|
25 |
---
|
26 |
|
27 |
#### Training Details
|
|
|
22 |
|
23 |
**Note:** This model uses the standard ChatML template.
|
24 |
|
25 |
+
At 500 steps, the loss was plateauing so I decided to stop training to prevent excessive overfitting.
|
26 |
+
|
27 |
---
|
28 |
|
29 |
#### Training Details
|