End of training
Browse files- README.md +5 -6
- adapter_model.bin +1 -1
README.md
CHANGED
@@ -107,7 +107,7 @@ special_tokens:
|
|
107 |
|
108 |
This model is a fine-tuned version of [deepseek-ai/deepseek-coder-6.7b-instruct](https://huggingface.co/deepseek-ai/deepseek-coder-6.7b-instruct) on the None dataset.
|
109 |
It achieves the following results on the evaluation set:
|
110 |
-
- Loss: 0.
|
111 |
|
112 |
## Model description
|
113 |
|
@@ -144,11 +144,10 @@ The following hyperparameters were used during training:
|
|
144 |
|
145 |
| Training Loss | Epoch | Step | Validation Loss |
|
146 |
|:-------------:|:-----:|:----:|:---------------:|
|
147 |
-
| 0.
|
148 |
-
| 0.
|
149 |
-
| 0.
|
150 |
-
| 0.
|
151 |
-
| 0.0174 | 1.0 | 60 | 0.0202 |
|
152 |
|
153 |
|
154 |
### Framework versions
|
|
|
107 |
|
108 |
This model is a fine-tuned version of [deepseek-ai/deepseek-coder-6.7b-instruct](https://huggingface.co/deepseek-ai/deepseek-coder-6.7b-instruct) on the None dataset.
|
109 |
It achieves the following results on the evaluation set:
|
110 |
+
- Loss: 0.1634
|
111 |
|
112 |
## Model description
|
113 |
|
|
|
144 |
|
145 |
| Training Loss | Epoch | Step | Validation Loss |
|
146 |
|:-------------:|:-----:|:----:|:---------------:|
|
147 |
+
| 0.3241 | 0.02 | 1 | 0.3550 |
|
148 |
+
| 0.2785 | 0.25 | 11 | 0.2303 |
|
149 |
+
| 0.2129 | 0.51 | 22 | 0.1771 |
|
150 |
+
| 0.1803 | 0.76 | 33 | 0.1634 |
|
|
|
151 |
|
152 |
|
153 |
### Framework versions
|
adapter_model.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 848460690
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:3819cb5d5e46941f03a7f51ca30705ee8c8cb14dc6cf5ef1b056b94e2798cde2
|
3 |
size 848460690
|