QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs Paper • 2510.11696 • Published 6 days ago • 156
ACADREASON: Exploring the Limits of Reasoning Models with Academic Research Problems Paper • 2510.11652 • Published 6 days ago • 26
Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution Paper • 2509.25301 • Published 20 days ago • 16
Towards Personalized Deep Research: Benchmarks and Evaluations Paper • 2509.25106 • Published 20 days ago • 27
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper • 2508.06471 • Published Aug 8 • 186
Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL Paper • 2508.13167 • Published Aug 6 • 127
mmoradi/Robust-Biomed-RoBERTa-RelationClassification Feature Extraction • Updated Oct 6, 2021 • 2 • 2