FLORES-extensions Collection Partial translations of the FLORES(+) dataset and translations into non-textual modalities (speech, ASL). • 5 items • Updated 3 days ago
jasonrichdarmawan/nllb-primary-datasets-public-data-embedding Viewer • Updated Sep 24, 2025 • 10.7M • 80 • 1
OLDI and friends Collection This collection groups the datasets that have been featured as part of WMT’s Open Language Data Initiative shared task. • 5 items • Updated Mar 25 • 5