12 12 28

Kaiyan Zhang

iseesaw

iseesaw

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Diverse Inference and Verification for Advanced Reasoning

commented on a paper 1 day ago

Diverse Inference and Verification for Advanced Reasoning

liked a Space 2 days ago

gaia-benchmark/leaderboard

View all activity

Organizations

iseesaw's activity

upvoted a paper 1 day ago

Diverse Inference and Verification for Advanced Reasoning

Paper • 2502.09955 • Published 4 days ago • 11

upvoted an article 2 days ago

Article

Our Transformers Code Agent beats the GAIA benchmark!

Jul 1, 2024

• 66

upvoted 3 articles 7 days ago

Article

Open-source DeepResearch – Freeing our search agents

14 days ago

• 1.01k

Article

What is test-time compute and how to scale it?

and 1 other •

12 days ago

• 31

Article

Open R1: Update #2

and 6 others •

8 days ago

• 173

upvoted a paper 7 days ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published 8 days ago • 128

upvoted a paper 18 days ago

MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding

Paper • 2501.18362 • Published 19 days ago • 21

upvoted an article about 1 month ago

Article

Process Reinforcement through Implicit Rewards

and 1 other •

Jan 3

• 24

upvoted a collection about 1 month ago

Reasoning Datasets

Collection

Reasoning datasets that are trending 🔥 • 10 items • Updated Jan 3 • 24

upvoted a paper about 2 months ago

Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization

Paper • 2412.17739 • Published Dec 23, 2024 • 41

upvoted a paper 3 months ago

Free Process Rewards without Process Labels

Paper • 2412.01981 • Published Dec 2, 2024 • 32

upvoted a paper 7 months ago

Towards Building Specialized Generalist AI with System 1 and System 2 Fusion

Paper • 2407.08642 • Published Jul 11, 2024 • 10