AfriHate: A Multilingual Collection of Hate Speech and Abusive Language Datasets for African Languages Paper • 2501.08284 • Published 15 days ago • 6
Building Foundations for Natural Language Processing of Historical Turkish: Resources and Models Paper • 2501.04828 • Published 21 days ago • 11
view post Post 3198 🇸🇰 Hovorte po slovensky? Help build better AI for Slovak! We only need 90 more annotations to include Slovak in the next Hugging Face FineWeb2-C dataset ( data-is-better-together/fineweb-c) release! Your contribution will help create better language models for 5+ million Slovak speakers.Annotate here: data-is-better-together/fineweb-c.Read more about why we're doing it: https://huggingface.co/blog/davanstrien/fineweb2-community See translation 3 replies · ❤️ 10 10 🤝 1 1 🚀 1 1 😔 1 1 + Reply
U-MATH and μ-MATH - University-level math evaluation Collection Paper: A UNIVERSITY-LEVEL BENCHMARK FOR EVALUATING MATHEMATICAL SKILLS IN LLMS • 4 items • Updated 16 days ago • 15