crossdelenna's picture
add tokenizer
13873ea
raw
history blame
286 Bytes
{"b": 0, "o": 1, "q": 2, "l": 3, "w": 4, "y": 5, "x": 6, "e": 7, "a": 8, "z": 9, "-": 10, "f": 11, "k": 12, "u": 13, "c": 14, "p": 15, "'": 16, "n": 17, "t": 18, "g": 19, "d": 20, "m": 22, ",": 23, "r": 24, "v": 25, "s": 26, "i": 27, "j": 28, "h": 29, "|": 21, "[UNK]": 30, "[PAD]": 31}