Update README.md
Browse files
README.md
CHANGED
@@ -4,6 +4,8 @@ license: "apache-2.0"
|
|
4 |
|
5 |
*This model was trained as part of a series of experiments testing the performance of pure DPO vs SFT vs ORPO, all supported by Unsloth/Huggingface TRL.*
|
6 |
|
|
|
|
|
7 |
**Benchmarks**
|
8 |
|
9 |
TBA
|
|
|
4 |
|
5 |
*This model was trained as part of a series of experiments testing the performance of pure DPO vs SFT vs ORPO, all supported by Unsloth/Huggingface TRL.*
|
6 |
|
7 |
+
Note: It seems pretty broken.
|
8 |
+
|
9 |
**Benchmarks**
|
10 |
|
11 |
TBA
|