Daniel van Strien's picture

Building on HF

Daniel van Strien PRO

davanstrien

huggingface

·

https://danielvanstrien.xyz/

AI & ML interests

Machine Learning Librarian

Recent Activity

updated a dataset 8 minutes ago

librarian-bots/model_cards_with_metadata

updated a dataset 22 minutes ago

librarian-bots/dataset_cards_with_metadata

updated a dataset about 2 hours ago

librarian-bots/dataset-columns

View all activity

Organizations

New activity in uv-scripts/transcription 5 days ago

Add easytranscriber-transcribe.py for word-level alignment

#1 opened 5 days ago by

New activity in toad-hf-inference-explorers/README 10 days ago

DIdn't get Inference Provider credits

#1 opened 3 months ago by

Remove free credits messaging

#2 opened 10 days ago by

New activity in LabelStudio/LabelStudio 11 days ago

Add HF Storage Bucket persistence support

#8 opened 11 days ago by

New activity in llamaindex/ParseBench 11 days ago

Fix YAML indentation in eval.yaml

#2 opened 11 days ago by

New activity in datalab-to/chandra 30 days ago

Add new model version metadata

#10 opened 30 days ago by

New activity in TheBritishLibrary/blbooksgenre 30 days ago

Convert to Parquet format (remove legacy loading script)

#4 opened 30 days ago by

New activity in davanstrien/ocr-bench-britannica about 1 month ago

Add Qianfan-OCR results (10 samples) [qianfan-ocr]

#7 opened about 1 month ago by

Add rednote-hilab/dots.mocr OCR results (10 samples) [dots-mocr]

#6 opened about 1 month ago by

New activity in datalab-to/chandra-ocr-2 about 1 month ago

Add olmOCR-bench evaluation results

#1 opened about 1 month ago by

commented a paper about 1 month ago

Organize the Web: Constructing Domains Enhances Pre-Training Data Curation

Paper • 2502.10341 • Published Feb 14, 2025 • 3 •

New activity in davanstrien/isl-finepdfs-ocr about 2 months ago

Add deepseek-ai/DeepSeek-OCR OCR results (10 samples) [deepseek-ocr]

#3 opened about 2 months ago by

Add rednote-hilab/dots.ocr OCR results (10 samples) [dots-ocr]

#2 opened about 2 months ago by

Add zai-org/GLM-OCR OCR results (10 samples) [glm-ocr]

#1 opened about 2 months ago by

New activity in davanstrien/ocr-bench-britannica about 2 months ago

Add FireRedTeam/FireRed-OCR OCR results (50 samples) [firered-ocr]

#5 opened about 2 months ago by

New activity in davanstrien/bpl-ocr-bench about 2 months ago

Add rednote-hilab/dots.ocr OCR results (50 samples) [dots-ocr]

#2 opened 2 months ago by

Add rednote-hilab/dots.ocr OCR results (50 samples) [dots-ocr]

#5 opened about 2 months ago by

New activity in davanstrien/ocr-bench-britannica about 2 months ago

Add zai-org/GLM-OCR OCR results (50 samples) [glm-ocr]

#4 opened about 2 months ago by

Add deepseek-ai/DeepSeek-OCR OCR results (50 samples) [deepseek-ocr]

#3 opened about 2 months ago by

Add rednote-hilab/dots.ocr OCR results (50 samples) [dots-ocr]

#2 opened about 2 months ago by