Why is the "add_bos_token" set to True in tokenizer_config.json?
#17
by
heya5
- opened
Should I keep the bos_token_id
in my input_ids for downstream tasks?
Hey @heyaa,
OPT uses a GPT2Tokenizer but prepends every prompt with a BOS TOKEN (e.g. <s> Hello there
instead of Hello there
)