bert-base-thai / tokenizer_config.json
monsoon-nlp's picture
Thai post-sentence-segment model from github.com/ThAIKeras
296a0a9
raw
history blame
181 Bytes
{"unk_token": "<unk>", "sep_token": "[SEP]", "pad_token": "[PAD]", "cls_token": "[CLS]", "mask_token": "[MASK]", "strip_accents": false, "lowercase": false, "do_lower_case": false}