File size: 574 Bytes
eb4f161 9b9b353 eb4f161 9b9b353 eb4f161 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 |
# japanese-gpt2-medium
![rinna-icon](./rinna.png)
This repository provides a medium-sized Japanese GPT-2 model trained on [Japanese CC-100](http://data.statmt.org/cc-100/ja.txt.xz). The model is provided by [rinna](https://corp.rinna.co.jp/).
# Use the model
*NOTE:* Use `T5Tokenizer` to initiate the tokenizer with argument `extra_ids=0`.
~~~~
from transformers import T5Tokenizer, AutoModelForCausalLM
tokenizer = T5Tokenizer.from_pretrained("rinna/japanese-gpt2-medium", extra_ids=0)
model = AutoModelForCausalLM.from_pretrained("rinna/japanese-gpt2-medium")
~~~~ |