2 4 2

Sijun Tan

sijuntan

jeffreysijuntan

AI & ML interests

None yet

Recent Activity

upvoted a paper 10 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

upvoted a paper 18 days ago

Training Software Engineering Agents and Verifiers with SWE-Gym

liked a dataset 3 months ago

ScalerLab/JudgeBench

View all activity

Organizations

sijuntan's activity

upvoted a paper 10 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 12 days ago • 236

upvoted a paper 18 days ago

Training Software Engineering Agents and Verifiers with SWE-Gym

Paper • 2412.21139 • Published 21 days ago • 21

liked a dataset 3 months ago

ScalerLab/JudgeBench

Viewer • Updated Oct 9, 2024 • 620 • 149 • 4

updated a dataset 3 months ago

IAMJB/paper-central-pr

Viewer • Updated Oct 29, 2024 • 15 • 277

New activity in IAMJB/paper-central-pr 3 months ago

Add new entry for arXiv ID 2410.12784

#8 opened 3 months ago by

sijuntan

authored a paper 3 months ago

JudgeBench: A Benchmark for Evaluating LLM-based Judges

Paper • 2410.12784 • Published Oct 16, 2024 • 43

liked a Space 3 months ago

Running

🏆

JudgeBench Leaderboard

upvoted a paper 3 months ago

JudgeBench: A Benchmark for Evaluating LLM-based Judges

Paper • 2410.12784 • Published Oct 16, 2024 • 43

commented a paper 3 months ago

JudgeBench: A Benchmark for Evaluating LLM-based Judges

Paper • 2410.12784 • Published Oct 16, 2024 • 43 •

upvoted a paper 4 months ago

LLoCO: Learning Long Contexts Offline

Paper • 2404.07979 • Published Apr 11, 2024 • 21

authored a paper 9 months ago

LLoCO: Learning Long Contexts Offline

Paper • 2404.07979 • Published Apr 11, 2024 • 21