TheDrummer
commited on
Commit
•
3734d86
1
Parent(s):
5cf0426
Update README.md
Browse files
README.md
CHANGED
@@ -37,7 +37,7 @@ base_model:
|
|
37 |
|
38 |
*Refer to [Lazarus 2407 100B](https://huggingface.co/TheDrummer/Lazarus-2407-100B) for pruning details.*
|
39 |
|
40 |
-
Endurance
|
41 |
|
42 |
![image/png](https://cdn-uploads.huggingface.co/production/uploads/65f2fd1c25b848bd061b5c2e/s0uELhSkSSwseyBrFzw7q.png)
|
43 |
|
|
|
37 |
|
38 |
*Refer to [Lazarus 2407 100B](https://huggingface.co/TheDrummer/Lazarus-2407-100B) for pruning details.*
|
39 |
|
40 |
+
Endurance used the same hyperparameters as Behemoth. Training loss indicates that they are exactly the same albeit with lower confidence.
|
41 |
|
42 |
![image/png](https://cdn-uploads.huggingface.co/production/uploads/65f2fd1c25b848bd061b5c2e/s0uELhSkSSwseyBrFzw7q.png)
|
43 |
|