Daniel van Strien's picture

Building on HF

Daniel van Strien PRO

davanstrien

huggingface

·

https://danielvanstrien.xyz/

AI & ML interests

Machine Learning Librarian

Recent Activity

updated a dataset about 5 hours ago

davanstrien/ocr-bench-ufo-judge-30b

published a dataset about 5 hours ago

davanstrien/ocr-bench-ufo-judge-30b

new activity about 5 hours ago

davanstrien/ocr-bench-ufo:Add rednote-hilab/dots.ocr OCR results (50 samples) [dots-ocr]

View all activity

Organizations

upvoted an article 3 days ago

Article

LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling

3 days ago

•

42

upvoted a collection 4 days ago

LLaDA2.1

3 items • Updated 3 days ago • 18

upvoted an article 4 days ago

Article

Transformers.js v4 Preview: Now Available on NPM!

7 days ago

•

66

upvoted 2 papers 6 days ago

EvasionBench: Detecting Evasive Answers in Financial Q&A via Multi-Model Consensus and LLM-as-Judge

Paper • 2601.09142 • Published Jan 14 • 10

compar:IA: The French Government's LLM arena to collect French-language human prompts and preference data

Paper • 2602.06669 • Published 9 days ago • 6

upvoted an article 10 days ago

Article

Community Evals: Because we're done trusting black-box leaderboards over the community

+5

12 days ago

•

67

upvoted a collection 11 days ago

GLiNER-bi-V2

4 items • Updated 16 days ago • 5

upvoted 2 collections 12 days ago

Qwen3-Coder-Next

4 items • Updated 12 days ago • 78

🌌 Borealis Preview

Preview release of the Borealis family of instruction tuned models by the National Library of Norway. • 18 items • Updated 2 days ago • 8

upvoted a paper 16 days ago

Shaping capabilities with token-level data filtering

Paper • 2601.21571 • Published 17 days ago • 26

upvoted a collection 17 days ago

Qwen3-ASR

4 items • Updated 17 days ago • 48

upvoted 2 articles 20 days ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

+2

Dec 1, 2025

•

298

Article

Why Your AI Strategy Needs Hugging Face Storage

20 days ago

•

12

upvoted a collection 22 days ago

Qwen3-TTS

7 items • Updated 24 days ago • 294

upvoted an article 27 days ago

Article

LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family

27 days ago

•

81

upvoted a paper 27 days ago

TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs

Paper • 2512.14698 • Published Dec 16, 2025 • 21

upvoted a collection about 1 month ago

TranslateGemma

3 items • Updated Jan 15 • 209

upvoted a paper about 1 month ago

Perceptual Taxonomy: Evaluating and Guiding Hierarchical Scene Reasoning in Vision-Language Models

Paper • 2511.19526 • Published Nov 24, 2025 • 3

upvoted a collection about 1 month ago

Qwen3-VL-Embedding

2 items • Updated Jan 8 • 59

upvoted an article about 1 month ago

Article

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

+1

Mar 22, 2024

•

128