Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
BEE-spoke-data
's Collections
Survivor Library Books - OCR
smol llama
finetuned smol 220M
Pretrained Encoders
Bee Models 🍯
book genre classifiers
tokenizers
FineWeb Concept Datasets
FineWeb Concept Datasets
updated
Mar 2
concept datasets extracted from fineweb
Upvote
-
BEE-spoke-data/SaunaWeb-50k
Viewer
•
Updated
Dec 29, 2025
•
50k
•
12
BEE-spoke-data/FineMeme-100k
Viewer
•
Updated
Dec 29, 2025
•
100k
•
25
BEE-spoke-data/beeweb-5k
Viewer
•
Updated
Dec 29, 2025
•
5k
•
5
BEE-spoke-data/fineweb-synergy-20k
Viewer
•
Updated
Dec 29, 2025
•
20k
•
6
BEE-spoke-data/MoistWeb-25k
Viewer
•
Updated
Dec 29, 2025
•
25k
•
8
•
1
BEE-spoke-data/fineweb-cryptid-5k
Viewer
•
Updated
Dec 29, 2025
•
5k
•
11
BEE-spoke-data/fineweb-literature-100k
Viewer
•
Updated
Dec 29, 2025
•
100k
•
17
•
1
BEE-spoke-data/fineweb-cinema-100k
Viewer
•
Updated
Dec 29, 2025
•
100k
•
20
Upvote
-
Share collection
View history
Collection guide
Browse collections