tiennm commited on
Commit
73b685a
1 Parent(s): 92056fc

End of training

Browse files
README.md CHANGED
@@ -17,7 +17,7 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [distilbert/distilgpt2](https://huggingface.co/distilbert/distilgpt2) on the eli5_category dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 3.8365
21
 
22
  ## Model description
23
 
@@ -48,14 +48,14 @@ The following hyperparameters were used during training:
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:-----:|:----:|:---------------:|
51
- | 3.9504 | 1.0 | 1308 | 3.8461 |
52
- | 3.8509 | 2.0 | 2616 | 3.8372 |
53
- | 3.8114 | 3.0 | 3924 | 3.8365 |
54
 
55
 
56
  ### Framework versions
57
 
58
  - Transformers 4.42.4
59
- - Pytorch 2.3.1+cu121
60
  - Datasets 2.21.0
61
  - Tokenizers 0.19.1
 
17
 
18
  This model is a fine-tuned version of [distilbert/distilgpt2](https://huggingface.co/distilbert/distilgpt2) on the eli5_category dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 3.8235
21
 
22
  ## Model description
23
 
 
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:-----:|:----:|:---------------:|
51
+ | 3.946 | 1.0 | 1323 | 3.8340 |
52
+ | 3.8542 | 2.0 | 2646 | 3.8252 |
53
+ | 3.8166 | 3.0 | 3969 | 3.8235 |
54
 
55
 
56
  ### Framework versions
57
 
58
  - Transformers 4.42.4
59
+ - Pytorch 2.4.0+cu121
60
  - Datasets 2.21.0
61
  - Tokenizers 0.19.1
runs/Aug28_04-17-23_28acd65ec4c0/events.out.tfevents.1724818644.28acd65ec4c0.307.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4a6d3cb9551e9ba02b0d315574808a034c56ddd9b60651896bf31c6be49605bf
3
- size 7227
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ea502adfe3824fa6eae94914ecfcc94675bc83262208f78d04fb1ed74ed8f4bc
3
+ size 7852
runs/Aug28_04-17-23_28acd65ec4c0/events.out.tfevents.1724819557.28acd65ec4c0.307.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:49372390acb9c97c3f489b4503bf285e4bf6a5326cbb11d835752ddfea8acf78
3
+ size 359