Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
86
11
15
Guilherme Penedo
guipenedo
Follow
casiimir's profile picture
Dharmmy22ab's profile picture
mhrnsltni's profile picture
856 followers
·
6 following
gui_penedo
guipenedo
AI & ML interests
None yet
Articles
FineWeb2-C: Help Build Better Language Models in Your Language
Dec 23, 2024
•
18
Organizations
guipenedo
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a dataset
about 2 months ago
HuggingFaceFW/fineweb-2
Viewer
•
Updated
21 days ago
•
12.5B
•
71.4k
•
398
liked
a Space
2 months ago
Runtime error
34
💬
Discussion Forum
liked
a model
3 months ago
HuggingFaceTB/SmolLM2-1.7B-Instruct
Text Generation
•
Updated
22 days ago
•
88.6k
•
482
liked
a Space
3 months ago
Running
53
📝
Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks
liked
a Space
4 months ago
Running
96
📖
TxT360: Trillion Extracted Text
liked
a model
4 months ago
cis-lmu/glotlid
Text Classification
•
Updated
Oct 26, 2024
•
6.97k
•
55
liked
a dataset
4 months ago
tiiuae/falcon-refinedweb
Viewer
•
Updated
Jun 20, 2023
•
968M
•
20.7k
•
829
liked
a Space
6 months ago
Running
375
🧽
Finegrain Object Eraser
Erase any object just by naming it!
liked
3 models
7 months ago
HuggingFaceTB/SmolLM-1.7B
Text Generation
•
Updated
Oct 16, 2024
•
14.1k
•
167
HuggingFaceTB/SmolLM-1.7B-Instruct
Text Generation
•
Updated
Aug 18, 2024
•
41.9k
•
107
AI-MO/NuminaMath-7B-TIR
Text Generation
•
Updated
Aug 14, 2024
•
2.82k
•
331
liked
a dataset
8 months ago
HuggingFaceFW/fineweb-edu
Viewer
•
Updated
23 days ago
•
3.24B
•
304k
•
603
liked
a Space
8 months ago
Running
558
🍷
FineWeb: decanting the web for the finest text data at scale
liked
a dataset
9 months ago
HuggingFaceFW/fineweb
Viewer
•
Updated
26 days ago
•
48.6B
•
431k
•
1.83k
liked
a Space
about 1 year ago
Running
206
🚀
GPT Baker