kmfoda's picture
First version of the wav2vec2-large-xlsr-arabice model and tokenizer.
30e4171
raw
history blame
760 Bytes
{"t": 0, "ه": 1, "ح": 2, "؛": 3, "ى": 4, "ف": 5, "غ": 6, "ج": 7, "ع": 8, "چ": 9, ".": 10, "ﻻ": 11, "ٌ": 12, "e": 13, "ْ": 14, ":": 15, "،": 16, "خ": 17, "ئ": 18, "َ": 19, "ﺃ": 20, "ب": 21, "ڨ": 22, "أ": 23, "د": 24, "ا": 25, "ً": 26, ";": 27, "ذ": 28, "و": 29, "ة": 30, "-": 31, "_": 32, "ّ": 34, "\"": 35, "?": 36, "آ": 37, "ء": 38, "—": 39, "”": 40, "ش": 41, "“": 42, "ر": 43, "ٍ": 44, "ـ": 45, "ك": 46, "!": 47, "ي": 48, ",": 49, "ط": 50, "ھ": 51, "ز": 52, "م": 53, "ض": 54, "ِ": 55, "ؤ": 56, "ث": 57, "ُ": 58, "ق": 59, "ٰ": 60, "؟": 61, "ص": 62, "ۚ": 63, "g": 64, "ۖ": 65, "☭": 66, "ل": 67, "ی": 68, "ن": 69, "س": 70, "ظ": 71, "ک": 72, "ت": 73, "إ": 74, "|": 33, "[UNK]": 75, "[PAD]": 76}