iwontbecreative
commited on
Commit
•
e33b2da
1
Parent(s):
2a5a828
Update tokenizer config to match latest bugfixes, add tokenizer.json
Browse files- tokenizer.json +0 -0
- tokenizer_config.json +1 -1
tokenizer.json
ADDED
The diff for this file is too large to render.
See raw diff
|
|
tokenizer_config.json
CHANGED
@@ -1 +1 @@
|
|
1 |
-
{"do_lower_case":
|
|
|
1 |
+
{"do_lower_case": false, "remove_space": true, "keep_accents": true, "bos_token": "[CLS]", "eos_token": "[SEP]", "unk_token": "[UNK]", "sep_token": "[SEP]", "pad_token": "[PAD]", "cls_token": "[CLS]", "mask_token": "[MASK]", "tokenizer_class": "RemBertTokenizer"}
|