Model save

Files changed (3) hide show

README.md CHANGED Viewed

@@ -13,7 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model was trained from scratch on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.8440
 ## Model description
@@ -33,8 +33,8 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
-- train_batch_size: 256
-- eval_batch_size: 256
 - seed: 2
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
@@ -541,6 +541,15 @@ The following hyperparameters were used during training:
 | 0.3182        | 22.1943 | 494000 | 3.8617          |
 | 0.3633        | 22.2392 | 495000 | 3.8879          |
 | 0.3282        | 22.2841 | 496000 | 3.8440          |
 ### Framework versions

 This model was trained from scratch on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.2898
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
+- train_batch_size: 32
+- eval_batch_size: 32
 - seed: 2
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 | 0.3182        | 22.1943 | 494000 | 3.8617          |
 | 0.3633        | 22.2392 | 495000 | 3.8879          |
 | 0.3282        | 22.2841 | 496000 | 3.8440          |
+| 0.4121        | 29.7712 | 497000 | 1.3002          |
+| 0.4185        | 29.8311 | 498000 | 1.3007          |
+| 0.4           | 29.8910 | 499000 | 1.2910          |
+| 0.4009        | 29.9509 | 500000 | 1.2899          |
+| 0.3744        | 30.0108 | 501000 | 1.2981          |
+| 0.3691        | 30.0707 | 502000 | 1.2914          |
+| 0.3775        | 30.1306 | 503000 | 1.2877          |
+| 0.3604        | 30.1905 | 504000 | 1.2967          |
+| 0.3934        | 30.2504 | 505000 | 1.2898          |
 ### Framework versions

last-checkpoint/trainer_state.json CHANGED Viewed

The diff for this file is too large to render. See raw diff

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:90522abbdd95ad74158c8fe9b382e088ef9a6cb34be2bf478efc3daa7d826730
 size 5240

 version https://git-lfs.github.com/spec/v1
+oid sha256:39526ab562e7eecfc574042300c36bed45d9a29bbf93315c727e39bef593c57a
 size 5240