Yury Panikov

panikov

panikov

AI & ML interests

None yet

Recent Activity

commentedon a paper 20 days ago

Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

commentedon a paper 20 days ago

Reinforcement Learning for LLM-based Multi-Agent Systems through Orchestration Traces

commentedon a paper 20 days ago

Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement Learning

View all activity

Organizations

None yet

upvoted 20 papers 21 days ago

Chain of Evidence: Pixel-Level Visual Attribution for Iterative Retrieval-Augmented Generation

Paper • 2605.01284 • Published 28 days ago • 3

How Fast Should a Model Commit to Supervision? Training Reasoning Models on the Tsallis Loss Continuum

Paper • 2604.25907 • Published Apr 28 • 3

A Benchmark for Interactive World Models with a Unified Action Generation Framework

Paper • 2605.03941 • Published 25 days ago • 5

Healthcare AI GYM for Medical Agents

Paper • 2605.02943 • Published 29 days ago • 4

Skills-Coach: A Self-Evolving Skill Optimizer via Training-Free GRPO

Paper • 2604.27488 • Published 30 days ago • 7

SplAttN: Bridging 2D and 3D with Gaussian Soft Splatting and Attention for Point Cloud Completion

Paper • 2605.01466 • Published 28 days ago • 6

ESARBench: A Benchmark for Agentic UAV Embodied Search and Rescue

Paper • 2605.01371 • Published 28 days ago • 6

TCDA: Thread-Constrained Discourse-Aware Modeling for Conversational Sentiment Quadruple Analysis

Paper • 2605.01717 • Published 27 days ago • 6

StateSMix: Online Lossless Compression via Mamba State Space Models and Sparse N-gram Context Mixing

Paper • 2605.02904 • Published Apr 5 • 8

Reinforcement Learning for LLM-based Multi-Agent Systems through Orchestration Traces

Paper • 2605.02801 • Published 26 days ago • 8

Workspace-Bench 1.0: Benchmarking AI Agents on Workspace Tasks with Large-Scale File Dependencies

Paper • 2605.03596 • Published 25 days ago • 10

PatRe: A Full-Stage Office Action and Rebuttal Generation Benchmark for Patent Examination

Paper • 2605.03571 • Published 25 days ago • 7

Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement Learning

Paper • 2605.02913 • Published Apr 8 • 9

SVGS: Enhancing Gaussian Splatting Using Primitives with Spatially Varying Colors

Paper • 2411.18966 • Published 26 days ago • 9

Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL

Paper • 2604.28123 • Published 29 days ago • 49

OpenSeeker-v2: Pushing the Limits of Search Agents with Informative and High-Difficulty Trajectories

Paper • 2605.04036 • Published 25 days ago • 68

Yury Panikov

AI & ML interests

Recent Activity

Organizations

panikov's activity