Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Lumees
company
https://lumees.io
Activity Feed
Follow
4
AI & ML interests
LLM, OCR, Embedding Models, Private Intelligence
Recent Activity
hasankursun
new
activity
about 1 month ago
lumees/github-code-2025-language-split:
[bot] Conversion to Parquet
hasankursun
updated
a dataset
about 1 month ago
lumees/github-code-2025-language-split
hasankursun
updated
a collection
about 1 month ago
Global Corpus
View all activity
Team members
2
lumees
's datasets
10
Sort: Recently updated
lumees/github-code-2025-language-split
Viewer
•
Updated
Dec 1, 2025
•
148M
•
12.8k
•
4
lumees/dutch-corpus-200b
Viewer
•
Updated
Dec 1, 2025
•
170M
•
377
•
3
lumees/bulgarian-corpus-33b
Viewer
•
Updated
Nov 30, 2025
•
34.9M
•
885
•
3
lumees/turkish-corpus-100b
Viewer
•
Updated
Nov 30, 2025
•
107M
•
1.29k
•
3
lumees/turkish-legislation-corpus
Viewer
•
Updated
Nov 30, 2025
•
899
•
39
•
2
lumees/codesearchnet-hard-negatives
Viewer
•
Updated
Nov 28, 2025
•
955k
•
33
•
2
lumees/wikipedia-turkish-synthetic-query
Viewer
•
Updated
Nov 28, 2025
•
19.8k
•
26
•
3
lumees/ms-marco-tr-hard-negatives
Viewer
•
Updated
Nov 27, 2025
•
786k
•
48
•
2
lumees/multilingual-safety-classification-dataset
Viewer
•
Updated
Oct 24, 2025
•
213k
•
236
•
2
lumees/age-specific-text-simplification
Viewer
•
Updated
Aug 13, 2025
•
17.2k
•
43
•
2