End of training

Files changed (11) hide show

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.1487
 ## Model description
@@ -46,9 +46,9 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 495  | 1.2865          |
-| 1.6273        | 2.0   | 990  | 1.1770          |
-| 1.3219        | 3.0   | 1485 | 1.1487          |
 ### Framework versions

 This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.2714
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 1.0   | 3    | 3.5388          |
+| No log        | 2.0   | 6    | 3.3409          |
+| No log        | 3.0   | 9    | 3.2714          |
 ### Framework versions

config.json CHANGED Viewed

@@ -6,10 +6,12 @@
   ],
   "attn_pdrop": 0.1,
   "bos_token_id": 50256,
   "embd_pdrop": 0.1,
   "eos_token_id": 50256,
   "initializer_range": 0.02,
   "layer_norm_epsilon": 1e-05,
   "model_type": "gpt2",
   "n_ctx": 1024,
   "n_embd": 768,

   ],
   "attn_pdrop": 0.1,
   "bos_token_id": 50256,
+  "do_sample": true,
   "embd_pdrop": 0.1,
   "eos_token_id": 50256,
   "initializer_range": 0.02,
   "layer_norm_epsilon": 1e-05,
+  "max_length": 50,
   "model_type": "gpt2",
   "n_ctx": 1024,
   "n_embd": 768,

generation_config.json CHANGED Viewed

@@ -1,6 +1,9 @@
 {
   "_from_model_config": true,
   "bos_token_id": 50256,
   "eos_token_id": 50256,
   "transformers_version": "4.41.2"
 }

 {
   "_from_model_config": true,
   "bos_token_id": 50256,
+  "do_sample": true,
   "eos_token_id": 50256,
+  "max_length": 50,
+  "pad_token_id": 50256,
   "transformers_version": "4.41.2"
 }

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c2116a6390a20e1d15f39736be47286f60308ac8c0a21fadf8484bced1542c2f
 size 497774208

 version https://git-lfs.github.com/spec/v1
+oid sha256:49c4672d24e79732c642c6803783920d9b18e29aa70b71ced578c5be7f4b0f66
 size 497774208

runs/Jun15_11-37-46_8423c480c5d4/events.out.tfevents.1718451467.8423c480c5d4.3030.2 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:63a8feba4ecd2b4058f611c4c9a7321f1067c18de90a9e868d8d88fb5e79feb2
+size 5116

runs/Jun15_11-38-20_8423c480c5d4/events.out.tfevents.1718451501.8423c480c5d4.3030.3 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:98eb91ac308465de21d3eec43032b70f6c2084379d57116dfbbb678f3c0d3ce7
+size 5116

runs/Jun15_11-41-21_8423c480c5d4/events.out.tfevents.1718451681.8423c480c5d4.3030.4 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:6c65768e041b731fe4f65604fe5dedae86b5aec4a83055ce62f20f7942e0fa1e
+size 5116

runs/Jun15_11-45-10_8423c480c5d4/events.out.tfevents.1718451910.8423c480c5d4.3030.5 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:36e3a3003f1796531ba8fd2cb5f545a13bc98bbef284726eaf4b997d16bb78f6
+size 5116

runs/Jun15_11-51-39_8423c480c5d4/events.out.tfevents.1718452299.8423c480c5d4.3030.6 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:4c8c0f5b486e5db013375e94f8c9398e517c3e03b05a3edaa22e0edd173f877d
+size 6262

runs/Jun15_11-51-39_8423c480c5d4/events.out.tfevents.1718452320.8423c480c5d4.3030.7 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:436ac86bd38772dffb592815916175ab7abe4b72230a1c3886a9fed7edef1386
+size 354

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7756109a9c95e98e3b9452ad49f0dbebfe093dcbc4d51a3f21f715bd510a374a
 size 5176

 version https://git-lfs.github.com/spec/v1
+oid sha256:766b3c4cd493c10475b8fddb906959a4abee22819d32a7ec7188882a59e595f2
 size 5176