cwaud commited on
Commit
d1912bf
1 Parent(s): 6ef1018

End of training

Browse files
Files changed (2) hide show
  1. README.md +6 -12
  2. adapter_model.bin +1 -1
README.md CHANGED
@@ -25,13 +25,7 @@ chat_template: llama3
25
  dataset_prepared_path: null
26
  datasets:
27
  - path: mhenrichsen/alpaca_2k_test
28
- type:
29
- field_input: input
30
- field_instruction: instruction
31
- field_output: output
32
- field_system: text
33
- system_format: '{system}'
34
- system_prompt: you are helpful
35
  debug: null
36
  deepspeed: null
37
  early_stopping_patience: null
@@ -96,7 +90,7 @@ xformers_attention: null
96
 
97
  This model is a fine-tuned version of [unsloth/Llama-3.2-3B-Instruct](https://huggingface.co/unsloth/Llama-3.2-3B-Instruct) on the None dataset.
98
  It achieves the following results on the evaluation set:
99
- - Loss: 0.4253
100
 
101
  ## Model description
102
 
@@ -130,10 +124,10 @@ The following hyperparameters were used during training:
130
 
131
  | Training Loss | Epoch | Step | Validation Loss |
132
  |:-------------:|:------:|:----:|:---------------:|
133
- | 6.0039 | 0.0042 | 1 | 5.7351 |
134
- | 6.0484 | 0.0126 | 3 | 5.6486 |
135
- | 4.8758 | 0.0253 | 6 | 3.8893 |
136
- | 1.3682 | 0.0379 | 9 | 0.4253 |
137
 
138
 
139
  ### Framework versions
 
25
  dataset_prepared_path: null
26
  datasets:
27
  - path: mhenrichsen/alpaca_2k_test
28
+ type: alpaca
 
 
 
 
 
 
29
  debug: null
30
  deepspeed: null
31
  early_stopping_patience: null
 
90
 
91
  This model is a fine-tuned version of [unsloth/Llama-3.2-3B-Instruct](https://huggingface.co/unsloth/Llama-3.2-3B-Instruct) on the None dataset.
92
  It achieves the following results on the evaluation set:
93
+ - Loss: 1.0050
94
 
95
  ## Model description
96
 
 
124
 
125
  | Training Loss | Epoch | Step | Validation Loss |
126
  |:-------------:|:------:|:----:|:---------------:|
127
+ | 4.8197 | 0.0042 | 1 | 4.6394 |
128
+ | 4.6489 | 0.0126 | 3 | 4.5547 |
129
+ | 4.0712 | 0.0253 | 6 | 2.9871 |
130
+ | 1.3689 | 0.0379 | 9 | 1.0050 |
131
 
132
 
133
  ### Framework versions
adapter_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9a5f1f3db3449ac4064e08005aafebb4800d250142238c83063af55fd3fefffd
3
  size 982663982
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a0f5480285728eabc395dd7b2f6aa5f5e8e8ad30c8acb9171dd1a5c5b6cae604
3
  size 982663982