jackoyoungblood
/

TinyStoriesProject

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

jackoyoungblood commited on Dec 11, 2023

Commit

30b2178

·

1 Parent(s): 8d55a5b

End of training

Files changed (2) hide show

README.md +6 -1
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -14,6 +14,8 @@ should probably proofread and complete it, then remove this comment. -->
 # TinyStoriesProject
 This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on an unknown dataset.
 ## Model description
@@ -40,12 +42,15 @@ The following hyperparameters were used during training:
 - total_train_batch_size: 256
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
-- lr_scheduler_warmup_steps: 100
 - num_epochs: 1
 - mixed_precision_training: Native AMP
 ### Training results
 ### Framework versions

 # TinyStoriesProject
 This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 1.3478
 ## Model description
 - total_train_batch_size: 256
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
+- lr_scheduler_warmup_steps: 1000
 - num_epochs: 1
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss |
+|:-------------:|:-----:|:----:|:---------------:|
+| 1.8529        | 0.63  | 5000 | 1.3478          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:df43dee36fb557819531b98e8b763250ea3c836d6cc878a7f7605da471bc1f11
 size 497774208

 version https://git-lfs.github.com/spec/v1
+oid sha256:ef5e525cf99a699b822983e9cd04aab27105007b6f6157dc38d3909c91780571
 size 497774208