Revert "merge"

Browse files

This reverts commit d2a4baf26cc9b085e92fc52e41b49d5c388af957.

Files changed (4) hide show

README.md +29 -0
adapter_config.json +2 -2
adapter_model.safetensors +2 -2
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -12,9 +12,12 @@ model-index:
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
 # starcoder2-3b-peft-lora
 This model is a fine-tuned version of [bigcode/starcoder2-3b](https://huggingface.co/bigcode/starcoder2-3b) on an unknown dataset.
 ## Model description
@@ -42,6 +45,32 @@ The following hyperparameters were used during training:
 - lr_scheduler_warmup_steps: 30
 - training_steps: 2000
 ### Framework versions
 - PEFT 0.10.1.dev0

 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/hvl-ml/huggingface/runs/71217l2g)
 # starcoder2-3b-peft-lora
 This model is a fine-tuned version of [bigcode/starcoder2-3b](https://huggingface.co/bigcode/starcoder2-3b) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.7576
 ## Model description
 - lr_scheduler_warmup_steps: 30
 - training_steps: 2000
+### Training results
+| Training Loss | Epoch | Step | Validation Loss |
+|:-------------:|:-----:|:----:|:---------------:|
+| 0.5348        | 0.05  | 100  | 0.9196          |
+| 0.8854        | 0.1   | 200  | 0.8079          |
+| 0.6684        | 0.15  | 300  | 0.7977          |
+| 0.7444        | 0.2   | 400  | 0.7962          |
+| 0.2232        | 0.25  | 500  | 0.8324          |
+| 0.4756        | 0.3   | 600  | 0.7965          |
+| 0.4507        | 0.35  | 700  | 0.7983          |
+| 0.747         | 0.4   | 800  | 0.7863          |
+| 0.4285        | 0.45  | 900  | 0.7854          |
+| 0.3905        | 0.5   | 1000 | 0.8041          |
+| 0.7737        | 0.55  | 1100 | 0.7641          |
+| 0.5301        | 0.6   | 1200 | 0.7599          |
+| 0.6514        | 0.65  | 1300 | 0.7561          |
+| 0.1931        | 0.7   | 1400 | 0.7659          |
+| 0.4201        | 0.75  | 1500 | 0.7567          |
+| 0.4191        | 0.8   | 1600 | 0.7566          |
+| 0.6998        | 0.85  | 1700 | 0.7530          |
+| 0.4025        | 0.9   | 1800 | 0.7528          |
+| 0.3656        | 0.95  | 1900 | 0.7605          |
+| 0.6816        | 1.0   | 2000 | 0.7576          |
 ### Framework versions
 - PEFT 0.10.1.dev0

adapter_config.json CHANGED Viewed

@@ -22,8 +22,8 @@
   "target_modules": [
     "c_attn",
     "c_proj",
-    "c_fc",
-    "q_attn"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "target_modules": [
     "c_attn",
     "c_proj",
+    "q_attn",
+    "c_fc"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e44ce263e6fd885f50d82ca515b9325375b43ee36ededb75acf161ce88bc2e41
-size 48

 version https://git-lfs.github.com/spec/v1
+oid sha256:948ead8d16696715ccdb9f30397f7bb750919502d1c6fe3f71837bad086cbfca
+size 29506408

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b19229e76492c9f2c5c4061c4c8ba272990d2c64d9788e6f8743f15877965176
 size 5048

 version https://git-lfs.github.com/spec/v1
+oid sha256:f4d8e5fb078487928a16d1178a0db1480e026e840d1270cbb9a4fdfa08845b4c
 size 5048