End of training

Files changed (5) hide show

README.md CHANGED Viewed

@@ -102,7 +102,7 @@ xformers_attention: true
 This model is a fine-tuned version of [unsloth/SmolLM-1.7B-Instruct](https://huggingface.co/unsloth/SmolLM-1.7B-Instruct) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.1869
 ## Model description
@@ -140,8 +140,8 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 6.6306        | 0.0138 | 1    | 6.8053          |
-| 4.2874        | 0.3460 | 25   | 3.8760          |
-| 3.5258        | 0.6920 | 50   | 3.1869          |
 ### Framework versions

 This model is a fine-tuned version of [unsloth/SmolLM-1.7B-Instruct](https://huggingface.co/unsloth/SmolLM-1.7B-Instruct) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.1944
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 6.6306        | 0.0138 | 1    | 6.8053          |
+| 4.3006        | 0.3460 | 25   | 3.8811          |
+| 3.5398        | 0.6920 | 50   | 3.1944          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -21,12 +21,12 @@
   "revision": null,
   "target_modules": [
     "q_proj",
-    "v_proj",
-    "up_proj",
     "gate_proj",
     "down_proj",
     "o_proj",
-    "k_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "revision": null,
   "target_modules": [
     "q_proj",
     "gate_proj",
+    "k_proj",
     "down_proj",
     "o_proj",
+    "v_proj",
+    "up_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6198dc25f93848a8b32d31ba965d7ca0a66f5ff745b7168653322ba304c2bd64
 size 144824970

 version https://git-lfs.github.com/spec/v1
+oid sha256:dec1ad0ee3abf0ffc8a21d0e98af81338e22e2b965346f96c9e096c956d3732c
 size 144824970

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a334af04127f5624a904703d6ecadb97c848b90372ed8a7447abecd288cd4255
 size 144748392

 version https://git-lfs.github.com/spec/v1
+oid sha256:77de871694caae9be418270cfe1846fd732ec0169d8d218391fbaec100b30020
 size 144748392

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:297365864432a66351de4a28dcab5df4abc8350f5eb8210d38b1e903e733b0b8
 size 6776

 version https://git-lfs.github.com/spec/v1
+oid sha256:4a76349aa92ef48b300c140adc465b946c6362057acd6daf4a99536f902f6066
 size 6776