File size: 574 Bytes
eb4f161
 
 
 
 
 
 
 
 
9b9b353
eb4f161
 
 
 
9b9b353
eb4f161
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18

# japanese-gpt2-medium

![rinna-icon](./rinna.png)

This repository provides a medium-sized Japanese GPT-2 model trained on [Japanese CC-100](http://data.statmt.org/cc-100/ja.txt.xz). The model is provided by [rinna](https://corp.rinna.co.jp/).

# Use the model

*NOTE:* Use `T5Tokenizer` to initiate the tokenizer with argument `extra_ids=0`.

~~~~
from transformers import T5Tokenizer, AutoModelForCausalLM

tokenizer = T5Tokenizer.from_pretrained("rinna/japanese-gpt2-medium", extra_ids=0)

model = AutoModelForCausalLM.from_pretrained("rinna/japanese-gpt2-medium")
~~~~