ERNIE-Layout-Pytorch / tokenizer_config.json
Norm's picture
Update tokenizer_config.json
c76ba7e
raw
history blame
No virus
255 Bytes
{
"do_tokenize_postprocess": false,
"sep_token": "[SEP]",
"cls_token": "[CLS]",
"unk_token": "[UNK]",
"pad_token": "[PAD]",
"mask_token": "[MASK]",
"do_lower_case": true,
"model_max_length": 512,
"vocab_file": "sentencepiece.bpe.model"
}