MultiSynt

community
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

gramirez-prompsit  updated a Space about 22 hours ago
MultiSynt/README
gramirez-prompsit  updated a dataset about 22 hours ago
MultiSynt/MT-Nemotron-CC
maxidl  updated a dataset 3 days ago
MultiSynt/Nemotron-CC-sample-2
View all activity

MultiSynt is a collaborative initiative between OpenEuroLLM and EuroLLM focused on developing high-quality multilingual synthetic datasets for language model pretraining. By combining expertise from both organizations, MultiSynt aims to advance the creation of multilingual synthetic training data that supports diverse European languages to enable more inclusive AI development across languages.