kennethli319
/

distilgpt2-finetuned-wikitext2

Generated from Trainer

Model card Files Files and versions Metrics Training metrics Community

kennethli319 commited on Jan 11

Commit

f8ddfd5

•

1 Parent(s): 2460103

End of training

Files changed (2) hide show

README.md +9 -7
adapter_model.safetensors +1 -1

README.md CHANGED Viewed

@@ -1,8 +1,9 @@
 ---
 license: apache-2.0
-base_model: distilgpt2
 tags:
 - generated_from_trainer
 model-index:
 - name: distilgpt2-finetuned-wikitext2
   results: []
@@ -15,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.3608
 ## Model description
@@ -46,14 +47,15 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 290  | 3.3948          |
-| 3.5536        | 2.0   | 580  | 3.3654          |
-| 3.5536        | 3.0   | 870  | 3.3608          |
 ### Framework versions
 - Transformers 4.36.2
 - Pytorch 2.1.2+cu121
-- Datasets 2.15.0
-- Tokenizers 0.15.0

 ---
 license: apache-2.0
+library_name: peft
 tags:
 - generated_from_trainer
+base_model: distilgpt2
 model-index:
 - name: distilgpt2-finetuned-wikitext2
   results: []
 This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.6455
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 1.0   | 290  | 3.7059          |
+| 3.8948        | 2.0   | 580  | 3.6574          |
+| 3.8948        | 3.0   | 870  | 3.6455          |
 ### Framework versions
+- PEFT 0.7.1
 - Transformers 4.36.2
 - Pytorch 2.1.2+cu121
+- Datasets 2.16.1
+- Tokenizers 0.15.0

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2a75e6626f5f89bf739588d2e686dfb8f982f51555fff5c43b2ecf2705e5a99c
 size 591352

 version https://git-lfs.github.com/spec/v1
+oid sha256:d26ce68d5802fc6814224511c3bdc4e52583b87b0d609c85fa57d07ce5a3604a
 size 591352