Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
madoss 's Collections
MT Quality Estimation
Language ID
Synthetic Data Gen
Tokenization
African Languages Datasets
Audio
MT Models
SLM
LLMs Distillation
IE and Entity Linking
NL2SQL Models
Text to sql papers

African Languages Datasets

updated 29 days ago
Upvote
-

  • google/WaxalNLP

    Viewer • Updated 4 days ago • 2.55M • 9.66k • 142

  • openlanguagedata/flores_plus

    Viewer • Updated 26 days ago • 877k • 15.1k • 107

  • McGill-NLP/african_celtic_dataset

    Viewer • Updated 27 days ago • 57.5k • 421 • 1

  • HPLT/HPLT3.0

    Updated Nov 14, 2025 • 83 • 15

  • google/smol

    Viewer • Updated Oct 31, 2025 • 798k • 1.95k • 88

  • CohereLabs/Global-MMLU

    Viewer • Updated Aug 14, 2025 • 602k • 13.8k • 150

  • allenai/c4

    Viewer • Updated Jan 9, 2024 • 10.4B • 465k • 527

  • cis-lmu/Glot500

    Viewer • Updated Dec 10, 2025 • 1.23B • 4.69k • 42

  • facebook/omnilingual-asr-corpus

    Viewer • Updated Nov 14, 2025 • 548k • 1.09k • 189
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs