[Train] pad_token_id set as "!"

#7
by eungizoa - opened

According to the special_tokens_map, the pad_token is set as "!".
I want to fine-tune this model to perform specific task, and rom the trl.SFTTrainer document, it says that pad_token_id should be set differently as eos_token_id.

"Make sure to have a pad_token_id which is different from eos_token_id which can result in the model not properly predicting EOS (End of Sentence) tokens during generation." (https://huggingface.co/docs/trl/sft_trainer)

But the string "!" is a very common string, so I wonder if I can use this pad_token during training.

Can I train this model as below?
"tokenizer.pad_token_id = tokenizer.eos_token_id"

Or should I use this pad_token ("!") without any modification?

Sign up or log in to comment