jed351
/

gpt2_tiny_zh-hk-shikoto

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

jed351 commited on Feb 3, 2023

Commit

7dcaf95

•

1 Parent(s): 924e4c5

Update README.md

Files changed (1) hide show

README.md +4 -3

README.md CHANGED Viewed

@@ -27,12 +27,13 @@ should probably proofread and complete it, then remove this comment. -->
 # gpt2-shikoto
 This model was trained on a dataset I obtained from an online novel site.
-**Please be aware that the stories (training data) might contain inappropriate content, and thus the model is for research purposes only**
 The base model can be found [here](https://huggingface.co/jed351/gpt2-tiny-zh-hk), which was obtained by
 patching a [GPT2 Chinese model](https://huggingface.co/ckiplab/gpt2-tiny-chinese) and its tokenizer with Cantonese characters.
-Refer to the base model for info of the patching process.
@@ -43,7 +44,7 @@ Please refer to the [script](https://github.com/huggingface/transformers/tree/ma
 provided by Huggingface.
-The model was trained for 400,000 steps on 2 NVIDIA Quadro RTX6000 for around 15 hours.
 ### Training hyperparameters

 # gpt2-shikoto
 This model was trained on a dataset I obtained from an online novel site.
+**Please be aware that the stories (training data) might contain inappropriate content. This model is intended for research purposes only.**
 The base model can be found [here](https://huggingface.co/jed351/gpt2-tiny-zh-hk), which was obtained by
 patching a [GPT2 Chinese model](https://huggingface.co/ckiplab/gpt2-tiny-chinese) and its tokenizer with Cantonese characters.
+Refer to the base model for info on the patching process.
 provided by Huggingface.
+The model was trained for 400,000 steps on 2 NVIDIA Quadro RTX6000 for around 15 hours at the Research Computing Services of Imperial College London.
 ### Training hyperparameters