End of training

Files changed (5) hide show

README.md CHANGED Viewed

@@ -2,8 +2,6 @@
 license: other
 library_name: peft
 tags:
-- trl
-- sft
 - generated_from_trainer
 base_model: meta-llama/Meta-Llama-3-8B
 datasets:
@@ -20,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [meta-llama/Meta-Llama-3-8B](https://huggingface.co/meta-llama/Meta-Llama-3-8B) on the scitldr dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.4126
 ## Model description
@@ -53,9 +51,9 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 2.2246        | 0.5020 | 500  | 2.3259          |
-| 2.2211        | 1.0040 | 1000 | 2.3181          |
-| 1.684         | 1.5060 | 1500 | 2.4126          |
 ### Framework versions

 license: other
 library_name: peft
 tags:
 - generated_from_trainer
 base_model: meta-llama/Meta-Llama-3-8B
 datasets:
 This model is a fine-tuned version of [meta-llama/Meta-Llama-3-8B](https://huggingface.co/meta-llama/Meta-Llama-3-8B) on the scitldr dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.4051
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 2.226         | 0.5020 | 500  | 2.3232          |
+| 2.2207        | 1.0040 | 1000 | 2.3130          |
+| 1.6901        | 1.5060 | 1500 | 2.4051          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -20,13 +20,13 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "k_proj",
     "up_proj",
-    "o_proj",
     "v_proj",
     "q_proj",
-    "gate_proj",
-    "down_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "gate_proj",
     "up_proj",
+    "down_proj",
     "v_proj",
+    "k_proj",
     "q_proj",
+    "o_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b0aaa9ac7a872e1ae5c60c4412ffb59d83417802336915a9a36097ebe85a05f7
 size 167832240

 version https://git-lfs.github.com/spec/v1
+oid sha256:b9b7955f3697b07107f191eb54ad1b4421f1bd91a41ef72583b046fe42e9a517
 size 167832240

runs/Apr25_21-32-23_4c1f1b88f73d/events.out.tfevents.1714080820.4c1f1b88f73d.55510.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:0e4d82767bc1bb1a7c882ae1ac75de756ca0ec189f444eba6a350775db5fa5bd
+size 8317

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d373a0484ed51cd82ead25f7eba9fcb483902b51f996ada263529c79449ce92d
 size 4984

 version https://git-lfs.github.com/spec/v1
+oid sha256:a5e30dfb176d6a15fce26b0121eaf0b249e3d5a959bcb2304b365ade09850a5a
 size 4984