codegen-16B-multi-6-parts / tokenizer_config.json
Andrei Panferov
tokenizer and such
d6e09e8
raw
history blame
240 Bytes
{"unk_token": "<|endoftext|>", "bos_token": "<|endoftext|>", "eos_token": "<|endoftext|>", "add_prefix_space": false, "model_max_length": 2048, "special_tokens_map_file": null, "name_or_path": "gpt2", "tokenizer_class": "CodeGenTokenizer"}