7 132 47

Frank Sommers PRO

fsommers

fsommers

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

leo-vnuuet/ColQwen3.5-0.8B-Embedding

upvoted an article 2 days ago

Beyond Semantic Similarity: Introducing NVIDIA NeMo Retriever’s Generalizable Agentic Retrieval Pipeline

updated a collection 3 days ago

Misc papers

View all activity

Organizations

liked a model 2 days ago

leo-vnuuet/ColQwen3.5-0.8B-Embedding

Feature Extraction • Updated about 19 hours ago • 94 • 3

upvoted an article 2 days ago

Article

Beyond Semantic Similarity: Introducing NVIDIA NeMo Retriever’s Generalizable Agentic Retrieval Pipeline

14 days ago

•

updated a collection 3 days ago

Misc papers

Collection

19 items • Updated 3 days ago

upvoted a paper 3 days ago

Look Where It Matters: High-Resolution Crops Retrieval for Efficient VLMs

Paper • 2603.16932 • Published 14 days ago • 84

upvoted a paper 9 days ago

Qianfan-OCR: A Unified End-to-End Model for Document Intelligence

Paper • 2603.13398 • Published 17 days ago • 150

liked a model 9 days ago

Qwen/Qwen3.5-4B

Image-Text-to-Text • 5B • Updated 26 days ago • 2.28M • 411

liked a model 10 days ago

Qwen/Qwen3.5-9B

Image-Text-to-Text • 10B • Updated 26 days ago • 3.95M • • 1.05k

liked a model 12 days ago

ModernVBERT/colmodernvbert

Visual Document Retrieval • Updated Oct 2, 2025 • 4k • 29

upvoted a collection 21 days ago

Qwen3.5

Collection

21 items • Updated 19 days ago • 1.33k

upvoted an article about 1 month ago

Article

Nemotron ColEmbed V2: Raising the Bar for Multimodal Retrieval with ViDoRe V3’s Top Model

Feb 4

•

upvoted an article about 2 months ago

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Jul 9, 2025

•

789

liked a model about 2 months ago

zai-org/GLM-OCR

Image-to-Text • Updated 16 days ago • 3.85M • • 1.48k

upvoted a paper about 2 months ago

DeepSeek-OCR 2: Visual Causal Flow

Paper • 2601.20552 • Published Jan 28 • 66

upvoted 2 papers 2 months ago

GutenOCR: A Grounded Vision-Language Front-End for Documents

Paper • 2601.14490 • Published Jan 20 • 37

Typhoon OCR: Open Vision-Language Model For Thai Document Extraction

Paper • 2601.14722 • Published Jan 21 • 15

upvoted a collection 2 months ago

PP-OCRv5

Collection

PP-OCRv5 is the latest text recognition solution, supporting Simplified Chinese, Chinese Pinyin, Traditional Chinese, English, and Japanese • 13 items • Updated Sep 15, 2025 • 52

upvoted a paper 4 months ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 264

upvoted an article 4 months ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

Dec 1, 2025

•

307

upvoted 2 papers 4 months ago

SRPO: Self-Referential Policy Optimization for Vision-Language-Action Models

Paper • 2511.15605 • Published Nov 19, 2025 • 25

TurkColBERT: A Benchmark of Dense and Late-Interaction Models for Turkish Information Retrieval

Paper • 2511.16528 • Published Nov 20, 2025 • 24

Frank Sommers PRO

AI & ML interests

Recent Activity

Organizations

fsommers's activity

Beyond Semantic Similarity: Introducing NVIDIA NeMo Retriever’s Generalizable Agentic Retrieval Pipeline

Nemotron ColEmbed V2: Raising the Bar for Multimodal Retrieval with ViDoRe V3’s Top Model

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Transformers v5: Simple model definitions powering the AI ecosystem