bart-ocr-correction-ga-en / tokenizer_config.json
ancatmara's picture
Upload tokenizer
3998b4b verified
raw
history blame contribute delete
206 Bytes
{
"clean_up_tokenization_spaces": true,
"model_max_length": 64,
"special_tokens": [
"<s>",
"<pad>",
"</s>",
"<unk>",
"<mask>"
],
"tokenizer_class": "PreTrainedTokenizerFast"
}