15 567 240

Taufiq Dwi Purnomo

taufiqdp

https://taufiqdp.com

AI & ML interests

SLM, VLM

Recent Activity

upvoted a paper about 8 hours ago

Large Language Diffusion Models

upvoted a paper 3 days ago

mmE5: Improving Multimodal Multilingual Embeddings via High-quality Synthetic Data

upvoted a paper 3 days ago

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

View all activity

Organizations

taufiqdp's activity

upvoted a paper about 8 hours ago

Large Language Diffusion Models

Paper • 2502.09992 • Published 3 days ago • 27

upvoted 2 papers 3 days ago

mmE5: Improving Multimodal Multilingual Embeddings via High-quality Synthetic Data

Paper • 2502.08468 • Published 5 days ago • 12

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published 5 days ago • 125

upvoted 3 papers 5 days ago

upvoted 2 papers 6 days ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published 7 days ago • 125

Matryoshka Quantization

Paper • 2502.06786 • Published 7 days ago • 23

upvoted a paper 10 days ago

Gold-medalist Performance in Solving Olympiad Geometry with AlphaGeometry2

Paper • 2502.03544 • Published 12 days ago • 40

upvoted a paper 11 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 13 days ago • 175

upvoted an article 13 days ago

Article

Open-source DeepResearch – Freeing our search agents

14 days ago

• 998

upvoted a paper 13 days ago

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published 14 days ago • 176

upvoted a paper 14 days ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published 17 days ago • 102

upvoted an article 15 days ago

Article

Open-R1: Update #1

and 7 others •

16 days ago

• 280

upvoted a paper 17 days ago

Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch

Paper • 2501.18512 • Published 18 days ago • 26

upvoted a paper 20 days ago

Baichuan-Omni-1.5 Technical Report

Paper • 2501.15368 • Published 23 days ago • 57

upvoted an article 20 days ago

Article

Welcome to Inference Providers on the Hub 🔥

21 days ago

• 371

upvoted a paper 20 days ago

Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published 22 days ago • 58

upvoted a collection 21 days ago

Qwen2.5-VL

Collection

Vision-language model series based on Qwen2.5 • 3 items • Updated 21 days ago • 345

upvoted an article 22 days ago

Article

We now support VLMs in smolagents!

25 days ago

• 81