End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -14,7 +14,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [meta-llama/Llama-2-7b-hf](https://huggingface.co/meta-llama/Llama-2-7b-hf) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.1400
 ## Model description
@@ -44,7 +44,7 @@ The following hyperparameters were used during training:
 - total_eval_batch_size: 16
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- training_steps: 2350
 ### Training results
@@ -140,10 +140,20 @@ The following hyperparameters were used during training:
 | 2.2717        | 0.35  | 2200 | 2.2825          |
 | 2.1958        | 0.36  | 2225 | 2.2827          |
 | 2.1635        | 0.36  | 2250 | 2.2867          |
-| 2.3032        | 0.36  | 2275 | 2.2841          |
-| 2.3084        | 0.37  | 2300 | 2.2846          |
-| 2.1975        | 0.37  | 2325 | 2.2834          |
-| 2.2212        | 0.38  | 2350 | 2.2828          |
 ### Framework versions

 This model is a fine-tuned version of [meta-llama/Llama-2-7b-hf](https://huggingface.co/meta-llama/Llama-2-7b-hf) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.1370
 ## Model description
 - total_eval_batch_size: 16
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- training_steps: 2600
 ### Training results
 | 2.2717        | 0.35  | 2200 | 2.2825          |
 | 2.1958        | 0.36  | 2225 | 2.2827          |
 | 2.1635        | 0.36  | 2250 | 2.2867          |
+| 2.3043        | 0.36  | 2275 | 2.2820          |
+| 2.3106        | 0.37  | 2300 | 2.2845          |
+| 2.1992        | 0.37  | 2325 | 2.2829          |
+| 2.2189        | 0.38  | 2350 | 2.2817          |
+| 2.3096        | 0.38  | 2375 | 2.2830          |
+| 2.2803        | 0.38  | 2400 | 2.2839          |
+| 2.1752        | 0.39  | 2425 | 2.2825          |
+| 2.1324        | 0.39  | 2450 | 2.2834          |
+| 2.207         | 0.4   | 2475 | 2.2846          |
+| 2.3369        | 0.4   | 2500 | 2.2844          |
+| 2.3512        | 0.4   | 2525 | 2.2853          |
+| 2.1432        | 0.41  | 2550 | 2.2861          |
+| 2.1743        | 0.41  | 2575 | 2.2832          |
+| 2.2696        | 0.42  | 2600 | 2.2832          |
 ### Framework versions

model-00001-of-00003.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:46332f63aaed4988d57cd0a8430eb291dd6408f1a0edd2a3907cd5e636eabea5
 size 4938985352

 version https://git-lfs.github.com/spec/v1
+oid sha256:50e638e2dcee3f62f30d649753a4e0c5873f07f48fd0534ee5da240911197a62
 size 4938985352

model-00002-of-00003.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:01046014a6305273fb40c751b7c90fa5f1e2f335f6cebd326c15b404992da4e8
 size 4947390880

 version https://git-lfs.github.com/spec/v1
+oid sha256:fc8827b836d19c7ef8519c38bf7492c1be686b7a84dc372b1235e1a11b1fde1f
 size 4947390880

model-00003-of-00003.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1e3ee84cc62fac73a24dd8334a9ec4cc43b640ca08166177bc3bd936d43b9c3c
 size 3590488816

 version https://git-lfs.github.com/spec/v1
+oid sha256:c6f9a673c42afa9fbd42506dc3d04562c20fc2ff9b6dfca8bbd63e6f6bcd6e30
 size 3590488816