Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
gaunernst
's Collections
Mini BERT models
Face Recognition Models
LLMs < 1B
LLMs 1B - 2B
LLMs 2B - 4B
Smallish LLM pre-training datasets
Llama2-compatible
Llama3-compatible
Smallish LLM pre-training datasets
updated
4 days ago
Upvote
-
roneneldan/TinyStories
Viewer
•
Updated
Aug 12
•
2.14M
•
32.3k
•
534
Note
V2 - 2GB
allenai/c4
Viewer
•
Updated
Jan 9
•
10.4B
•
284k
•
284
Note
realnewslike subset - 15GB
HuggingFaceFW/fineweb-edu
Viewer
•
Updated
Aug 25
•
3B
•
80.6k
•
493
Note
sample-10BT subset - 28GB
HuggingFaceTB/smollm-corpus
Viewer
•
Updated
28 days ago
•
237M
•
4.28k
•
213
cerebras/SlimPajama-627B
Preview
•
Updated
Jul 7, 2023
•
4.5k
•
416
Upvote
-
Share collection
View history
Collection guide
Browse collections