Collection of Arabic Tokenizers with different sizes based on SentencePiece & PBE Encodings suitable for training LLMs
Robotics and Interne-of-Things
riotu-lab
AI & ML interests
None yet
Recent Activity
liked
a Space
7 days ago
Omartificial-Intelligence-Space/Arabic-Reranking-Eval
liked
a model
7 days ago
NAMAA-Space/GATE-Reranker-V1
liked
a model
11 days ago
Omartificial-Intelligence-Space/GATE-AraBert-v1
Organizations
None yet
models
19
riotu-lab/ArabianGPT-1.5B-FT-SA-v2
Updated
•
7
riotu-lab/Aranizer-PBE-64k
Updated
•
1
riotu-lab/Aranizer-SP-32k
Updated
riotu-lab/Aranizer-SP-64k
Updated
riotu-lab/Aranizer-SP-86k
Updated
riotu-lab/Aranizer-PBE-32k
Updated
•
1
riotu-lab/Aranizer-PBE-86k
Updated
riotu-lab/ArabianGPT-0.8B-Sum-FT
Updated
•
7
riotu-lab/ArabianGPT-0.8B-FT-QA
Updated
•
6
riotu-lab/ArabianGPT1.5B-QA-FT
Text Generation
•
Updated
•
13
•
1
datasets
6
riotu-lab/combined-arabic-dataset
Viewer
•
Updated
•
523k
•
38
•
1
riotu-lab/ARABIC-RAW-TEXT
Viewer
•
Updated
•
100M
•
105
•
4
riotu-lab/ArabicQA_2.1M
Viewer
•
Updated
•
2.14M
•
86
•
2
riotu-lab/Arabic-books-and-research-dataset
Viewer
•
Updated
•
37k
•
135
•
4
riotu-lab/Synthetic-UAV-Flight-Trajectories
Viewer
•
Updated
•
766k
•
74
•
1
riotu-lab/Quran-Tafseers
Viewer
•
Updated
•
56.1k
•
66
•
6