Toflamus
/

GPT-2_para3M

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Toflamus commited on Sep 1, 2023

Commit

7e39477

•

1 Parent(s): 0daf75f

Update README.md

Files changed (1) hide show

README.md +4 -3

README.md CHANGED Viewed

@@ -13,7 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
 # GPT-2_para3M
-This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
 - Loss: 2.3207
@@ -23,8 +23,9 @@ More information needed
 ## Intended uses & limitations
-More information needed
 ## Training and evaluation data
 More information needed

 # GPT-2_para3M
+This model is a pretrained version of [gpt2](https://huggingface.co/gpt2) on an [Tinystory](https://huggingface.co/datasets/roneneldan/TinyStories) dataset.
 It achieves the following results on the evaluation set:
 - Loss: 2.3207
 ## Intended uses & limitations
+The limitation of this model are mainly 2 aspects.
+* The number of parameter of the model is only around 3.6 million which is not large. As a result the model cannot generate text in all perspectives.
+* The dataset is only composed of stories, this greatly hinder the performance of the model. Only stories can be generated.
 ## Training and evaluation data
 More information needed