HuggingFaceFW/fineweb-2
Viewer
•
Updated
•
13.8B
•
59.1k
•
320
Collection of a comprehensive dataset of Arabic text sourced from the FineWeb2 project, representing diverse content across Arabic MSA and Dialect.
Note This is the Original Repo for FineWeb2 include 1000s languages. Fine the Arabic Portions below