Dev-SriramB/qa_final01

Files changed (5) hide show

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.2620
 ## Model description
@@ -44,26 +44,29 @@ The following hyperparameters were used during training:
 - optimizer: Use OptimizerNames.PAGED_ADAMW_8BIT with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 2
-- num_epochs: 7
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 10.0354       | 1.0   | 25   | 2.1524          |
-| 8.0146        | 2.0   | 50   | 1.7660          |
-| 6.243         | 3.0   | 75   | 1.3734          |
-| 5.268         | 4.0   | 100  | 1.2929          |
-| 5.0111        | 5.0   | 125  | 1.2709          |
-| 4.8748        | 6.0   | 150  | 1.2624          |
-| 4.8125        | 7.0   | 175  | 1.2620          |
 ### Framework versions
 - PEFT 0.14.0
-- Transformers 4.47.1
 - Pytorch 2.5.1+cu124
 - Datasets 3.2.0
 - Tokenizers 0.21.0

 This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.2424
 ## Model description
 - optimizer: Use OptimizerNames.PAGED_ADAMW_8BIT with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 2
+- num_epochs: 10
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 2.4997        | 1.0   | 25   | 2.1442          |
+| 1.982         | 2.0   | 50   | 1.7214          |
+| 1.5052        | 3.0   | 75   | 1.3375          |
+| 1.2921        | 4.0   | 100  | 1.2731          |
+| 1.2246        | 5.0   | 125  | 1.2574          |
+| 1.18          | 6.0   | 150  | 1.2512          |
+| 1.1403        | 7.0   | 175  | 1.2428          |
+| 1.1051        | 8.0   | 200  | 1.2458          |
+| 1.0811        | 9.0   | 225  | 1.2420          |
+| 1.0686        | 10.0  | 250  | 1.2424          |
 ### Framework versions
 - PEFT 0.14.0
+- Transformers 4.48.2
 - Pytorch 2.5.1+cu124
 - Datasets 3.2.0
 - Tokenizers 0.21.0

adapter_config.json CHANGED Viewed

@@ -19,12 +19,12 @@
   "megatron_core": "megatron.core",
   "modules_to_save": null,
   "peft_type": "LORA",
-  "r": 16,
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "v_proj",
-    "q_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "megatron_core": "megatron.core",
   "modules_to_save": null,
   "peft_type": "LORA",
+  "r": 8,
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "q_proj",
+    "v_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e06b90827d092941ed0a4801a5ff3d4c3a30127ab67382d97a2e5918f13af9c2
-size 27280152

 version https://git-lfs.github.com/spec/v1
+oid sha256:0af397b7982754cda5a356966cc848e69c764c39688e81a0ec3915060e569a93
+size 13648432

runs/Feb07_15-29-35_4f00189539be/events.out.tfevents.1738942178.4f00189539be.446.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:561b7e355a76f9b4c97b0e62788119e4ec45476a338551c4d0aa0a6440dc74cb
+size 10855

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8f57697774a3dec1c12cb3b3f60dc28af49a1d010e9b4799c210c018b1fa5616
 size 5304

 version https://git-lfs.github.com/spec/v1
+oid sha256:08470a030982ef2f9b1879f96c8aa61a8fe1c05160157935e64ff2c6b68c957f
 size 5304