uer commited on
Commit
204e52c
1 Parent(s): 01a2a7e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -1
README.md CHANGED
@@ -12,7 +12,9 @@ widget:
12
 
13
  ## Model description
14
 
15
- The model is used to generate Chinese ancient poems. You can download the model either from the [GPT2-Chinese Github page](https://github.com/Morizeyao/GPT2-Chinese), or via HuggingFace from the link [gpt2-chinese-poem](https://huggingface.co/uer/gpt2-chinese-poem]).
 
 
16
 
17
  Since the parameter skip_special_tokens is used in the pipelines.py, special tokens such as [SEP], [UNK] will be deleted, the output results of Hosted inference API (right) may not be properly displayed.
18
 
@@ -90,4 +92,12 @@ python3 scripts/convert_gpt2_from_uer_to_huggingface.py --input_model_path poem_
90
  pages={241},
91
  year={2019}
92
  }
 
 
 
 
 
 
 
 
93
  ```
 
12
 
13
  ## Model description
14
 
15
+ The model is pre-trained by [UER-py](https://github.com/dbiir/UER-py/), which is introduced in [this paper](https://arxiv.org/abs/1909.05658). Besides, the model could also be pre-trained by [TencentPretrain](https://github.com/Tencent/TencentPretrain) introduced in [this paper](https://arxiv.org/abs/2212.06385), which inherits UER-py to support models with parameters above one billion, and extends it to a multimodal pre-training framework.
16
+
17
+ The model is used to generate Chinese ancient poems. You can download the model from the [UER-py Modelzoo page](https://github.com/dbiir/UER-py/wiki/Modelzoo), or [GPT2-Chinese Github page](https://github.com/Morizeyao/GPT2-Chinese), or via HuggingFace from the link [gpt2-chinese-poem](https://huggingface.co/uer/gpt2-chinese-poem]).
18
 
19
  Since the parameter skip_special_tokens is used in the pipelines.py, special tokens such as [SEP], [UNK] will be deleted, the output results of Hosted inference API (right) may not be properly displayed.
20
 
 
92
  pages={241},
93
  year={2019}
94
  }
95
+
96
+ @article{zhao2023tencentpretrain,
97
+ title={TencentPretrain: A Scalable and Flexible Toolkit for Pre-training Models of Different Modalities},
98
+ author={Zhao, Zhe and Li, Yudong and Hou, Cheng and Zhao, Jing and others},
99
+ journal={ACL 2023},
100
+ pages={217},
101
+ year={2023}
102
+ }
103
  ```