Quentin Lhoest PRO

lhoestq

AI & ML interests

Maintainer of 🤗Datasets: NLP, Multimodal data processing and sharing

Recent Activity

upvoted an article about 13 hours ago
updated a Space about 19 hours ago
lhoestq/dataset-spreadsheets
New activity about 21 hours ago
MLCommons/peoples_speech

Articles

Organizations

lhoestq's activity

upvoted an article about 13 hours ago
upvoted an article 8 days ago
view article
Article

Releasing the largest multilingual open pretraining dataset

94
upvoted an article 30 days ago
view article
Article

Transformers.js v3: WebGPU support, new models & tasks, and more…

62
upvoted an article about 1 month ago
upvoted 2 articles about 1 month ago
view article
Article

Scaling AI-based Data Processing with Hugging Face + Dask

23
view article
Article

Improving Parquet Dedupe on Hugging Face Hub

30
upvoted an article about 2 months ago
view article
Article

🥐CroissantLLM: A Truly Bilingual French-English Language Model

By manu
10
upvoted an article about 2 months ago
view article
Article

FineVideo: behind the scenes

23
upvoted an article about 2 months ago
view article
Article

🌟 Easy Fine-Tuning with Hugging Face SQL Console, Notebook Creator, and SFT

By asoria
12
upvoted an article 2 months ago
view article
Article

Introducing the SQL Console on Datasets

18
upvoted an article 3 months ago
view article
Article

Scaling robotics datasets with video encoding

34