lordspline
commited on
Commit
•
9ee4765
1
Parent(s):
914b165
End of training
Browse files- README.md +6 -6
- pytorch_model.bin +1 -1
README.md
CHANGED
@@ -26,9 +26,9 @@ strict: false
|
|
26 |
|
27 |
chat_template: chatml
|
28 |
datasets:
|
29 |
-
|
30 |
-
|
31 |
-
|
32 |
- path: lordspline/wizard_v2_196k_unfiltered
|
33 |
type: sharegpt
|
34 |
conversation: chatml
|
@@ -80,7 +80,7 @@ warmup_steps: 10
|
|
80 |
evals_per_epoch: 1
|
81 |
eval_table_size:
|
82 |
eval_max_new_tokens: 128
|
83 |
-
saves_per_epoch:
|
84 |
debug:
|
85 |
|
86 |
# deepspeed: deepspeed_configs/zero3_bf16.json
|
@@ -101,7 +101,7 @@ tokens:
|
|
101 |
|
102 |
This model is a fine-tuned version of [lordspline/mergestein](https://huggingface.co/lordspline/mergestein) on the None dataset.
|
103 |
It achieves the following results on the evaluation set:
|
104 |
-
- Loss: 1.
|
105 |
|
106 |
## Model description
|
107 |
|
@@ -133,7 +133,7 @@ The following hyperparameters were used during training:
|
|
133 |
|
134 |
| Training Loss | Epoch | Step | Validation Loss |
|
135 |
|:-------------:|:-----:|:-----:|:---------------:|
|
136 |
-
| 1.
|
137 |
|
138 |
|
139 |
### Framework versions
|
|
|
26 |
|
27 |
chat_template: chatml
|
28 |
datasets:
|
29 |
+
- path: lordspline/scidata
|
30 |
+
type: sharegpt
|
31 |
+
conversation: chatml
|
32 |
- path: lordspline/wizard_v2_196k_unfiltered
|
33 |
type: sharegpt
|
34 |
conversation: chatml
|
|
|
80 |
evals_per_epoch: 1
|
81 |
eval_table_size:
|
82 |
eval_max_new_tokens: 128
|
83 |
+
saves_per_epoch: 300
|
84 |
debug:
|
85 |
|
86 |
# deepspeed: deepspeed_configs/zero3_bf16.json
|
|
|
101 |
|
102 |
This model is a fine-tuned version of [lordspline/mergestein](https://huggingface.co/lordspline/mergestein) on the None dataset.
|
103 |
It achieves the following results on the evaluation set:
|
104 |
+
- Loss: 1.2069
|
105 |
|
106 |
## Model description
|
107 |
|
|
|
133 |
|
134 |
| Training Loss | Epoch | Step | Validation Loss |
|
135 |
|:-------------:|:-----:|:-----:|:---------------:|
|
136 |
+
| 1.4476 | 1.0 | 48435 | 1.2069 |
|
137 |
|
138 |
|
139 |
### Framework versions
|
pytorch_model.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 1589947346
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:33b1343643141d6472066aaebabd297f82b0f61ecd310ecb18d4d7c616dc8cb0
|
3 |
size 1589947346
|