Why is the "add_bos_token" set to True in tokenizer_config.json?

#17
by heya5 - opened

Should I keep the bos_token_id in my input_ids for downstream tasks?

Hey @heyaa,

OPT uses a GPT2Tokenizer but prepends every prompt with a BOS TOKEN (e.g. <s> Hello there instead of Hello there)

Your need to confirm your account before you can post a new comment.

Sign up or log in to comment