OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization Paper • 2212.12017 • Published Dec 22, 2022 • 1 • 1
Direct Preference Optimization: Your Language Model is Secretly a Reward Model Paper • 2305.18290 • Published May 29, 2023 • 64 • 4
Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference Paper • 2403.04132 • Published Mar 7, 2024 • 40 • 2
Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning Paper • 2601.09667 • Published 5 days ago • 75 • 6
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery Paper • 2408.06292 • Published Aug 12, 2024 • 127 • 11
Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs Paper • 2601.08763 • Published 6 days ago • 129 • 5
VL-JEPA: Joint Embedding Predictive Architecture for Vision-language Paper • 2512.10942 • Published Dec 11, 2025 • 45 • 6
Urban Socio-Semantic Segmentation with Vision-Language Reasoning Paper • 2601.10477 • Published 4 days ago • 150 • 3
CloneMem: Benchmarking Long-Term Memory for AI Clones Paper • 2601.07023 • Published 8 days ago • 2 • 1
An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models Paper • 2408.00724 • Published Aug 1, 2024 • 2 • 1
DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation Paper • 2601.09688 • Published 5 days ago • 110 • 3
Controlled Self-Evolution for Algorithmic Code Optimization Paper • 2601.07348 • Published 7 days ago • 107 • 4
NitroGen: An Open Foundation Model for Generalist Gaming Agents Paper • 2601.02427 • Published 15 days ago • 41 • 3
Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization Paper • 2601.05432 • Published 10 days ago • 159 • 6