stefan-it commited on
Commit
ad60d7b
1 Parent(s): 0685c7d

tokenizer: fix number of extra ids

Browse files

It's 300, as specified here:
https://github.com/google-research/t5x/blob/main/t5x/examples/scalable_t5/umt5/vocab.gin#L5

Files changed (1) hide show
  1. tokenizer_config.json +1 -1
tokenizer_config.json CHANGED
@@ -1 +1 @@
1
- {"eos_token": "</s>", "unk_token": "<unk>", "pad_token": "<pad>", "extra_ids": 384}
 
1
+ {"eos_token": "</s>", "unk_token": "<unk>", "pad_token": "<pad>", "extra_ids": 300}