~1.69M raw Swahili text samples from news, government, education, and legal domains, ideal for LLM pretraining and unsupervised NLP research.
Samwel Ngusa
ngusadeep
·
AI & ML interests
None yet
Recent Activity
updated a Space about 21 hours ago
lengai-ai/README published a Space about 21 hours ago
lengai-ai/README updated a dataset about 22 hours ago
lengai-ai/Swahili-FineTome-Dataset