Model | DarijaMMLU | DarijaHellaSwag | Belebele Ary | DarijaAlpacaEval |
jais-family-1p3b-chat | 35.39 | 32.51 | 38.33 | 35.56 |
jais-family-2p7b-chat | 37.44 | 34.49 | 44.11 | 52.97 |
gemma-2-2b-it | 28.58 | 32.42 | 25.22 | 58.67 |
Llama-3.2-1B-Instruct | 27.66 | 26.88 | 28.89 | 23.57 |
Llama-3.2-3B-Instruct | 32.60 | 28.33 | 38.00 | 47.62 |
Atlas-Chat-2B | 44.97 | 41.48 | 53.89 | 92.31 |
jais-family-6p7b-chat | 39.96 | 41.57 | 51.22 | 65.18 |
jais-adapted-7b-chat | 39.30 | 35.19 | 43.67 | 61.84 |
jais-family-13b-chat | 45.11 | 43.90 | 58.67 | 69.93 |
jais-adapted-13b-chat | 45.20 | 40.65 | 49.67 | 77.52 |
AceGPT-7b-chat | 35.98 | 36.57 | 30.11 | 47.31 |
AceGPT-13b-chat | 41.09 | 38.35 | 33.11 | 52.79 |
gemma-2-9b-it | 35.91 | 42.43 | 31.00 | 90.86 |
Llama-3.1-8B-Instruct | 44.13 | 38.24 | 47.00 | 78.08 |
Atlas-Chat-9B | 58.23 | 57.75 | 74.56 | 95.62 |
jais-family-30b-8k-chat | 51.88 | 35.61 | 65.67 | 24.64 |
gemma-2-27b-it | 36.47 | 37.04 | 35.78 | 95.07 |
Atlas-Chat-27B | 61.95 | 48.37 | 75.67 | 96.58 |
Model | DODa-10k (Translation) | MADAR (Translation) | FLORES+ (Translation) | NLLB-Seed (Translation) | DODa-10k (Transliteration) | MArSum (Summarization) (LLM as a judge) |
Sentiment Analysis | |||||
BLEU | chrF | BLEU | chrF | BLEU | chrF | BLEU | chrF | BLEU | chrF | |||
jais-family-1p3b-chat | 00.13 | 06.18 | 00.50 | 15.43 | 02.44 | 19.14 | 01.99 | 12.60 | 00.01 | 03.01 | 00.50 | 45.29 |
jais-family-2p7b-chat | 00.25 | 07.46 | 00.62 | 16.36 | 04.25 | 18.22 | 03.10 | 08.19 | 00.01 | 03.27 | 00.90 | 51.56 |
gemma-2-2b-it | 00.10 | 04.96 | 00.12 | 06.66 | 01.55 | 18.59 | 02.78 | 23.69 | 00.01 | 02.08 | 06.80 | 53.36 |
Llama-3.2-1B-Instruct | 00.07 | 05.95 | 00.80 | 18.71 | 04.53 | 18.39 | 04.52 | 17.06 | 00.02 | 03.74 | 08.23 | 46.27 |
Llama-3.2-3B-Instruct | 00.62 | 13.67 | 01.18 | 22.12 | 08.59 | 35.21 | 13.75 | 43.63 | 00.21 | 09.68 | 08.23 | 49.20 |
Atlas-Chat-2B | 22.76 | 44.86 | 16.67 | 41.64 | 14.92 | 43.03 | 23.88 | 52.19 | 08.18 | 21.54 | 55.22 | 73.99 |
jais-family-6p7b-chat | 00.73 | 11.85 | 01.88 | 23.22 | 04.25 | 18.22 | 04.62 | 20.22 | 00.02 | 03.79 | 03.02 | 56.78 |
jais-adapted-7b-chat | 00.60 | 09.43 | 03.45 | 25.88 | 07.25 | 23.21 | 01.25 | 02.22 | 00.04 | 03.24 | 02.82 | 52.72 |
jais-family-13b-chat | 00.92 | 11.71 | 04.01 | 28.48 | 05.70 | 27.24 | 04.50 | 22.56 | 00.03 | 03.57 | 01.77 | 41.73 |
jais-adapted-13b-chat | 00.87 | 10.52 | 04.02 | 25.29 | 06.66 | 23.46 | 20.14 | 47.87 | 0.04 | 04.77 | 01.92 | 66.68 |
AceGPT-7b-chat | 00.44 | 11.33 | 01.05 | 19.24 | 06.92 | 36.03 | 11.05 | 44.55 | 00.06 | 04.74 | 02.28 | 40.23 |
AceGPT-13b-chat | 00.98 | 16.70 | 00.81 | 20.23 | 08.73 | 40.76 | 14.02 | 48.28 | 00.12 | 06.32 | 02.80 | 59.58 |
gemma-2-9b-it | 03.10 | 19.16 | 01.72 | 24.35 | 05.18 | 36.96 | 08.23 | 43.57 | 00.17 | 09.14 | 13.81 | 59.87 |
Llama-3.1-8B-Instruct | 00.92 | 14.19 | 01.46 | 23.82 | 08.89 | 33.08 | 11.85 | 35.51 | 00.11 | 06.02 | 16.14 | 44.08 |
Atlas-Chat-9B | 28.08 | 50.48 | 18.16 | 43.91 | 18.63 | 47.53 | 29.98 | 58.26 | 22.08 | 34.17 | 59.76 | 81.89 |
jais-family-30b-8k-chat | 01.10 | 14.40 | 01.67 | 23.37 | 08.52 | 35.41 | 13.71 | 41.33 | 00.05 | 04.48 | 00.46 | 56.73 |
gemma-2-27b-it | 00.67 | 13.04 | 01.74 | 24.63 | 05.17 | 37.08 | 07.36 | 42.49 | 00.03 | 04.94 | 11.10 | 57.59 |
Atlas-Chat-27B | 29.55 | 51.74 | 19.66 | 45.65 | 20.34 | 49.19 | 31.61 | 59.37 | 33.03 | 40.95 | 60.70 | 73.00 |