jasmineWang's picture

jasmineWang

Jessamine

·

AI & ML interests

None yet

Recent Activity

commentedon a paper 1 day ago

Where Do Deep-Research Agents Go Wrong? Span-Level Error Localization in Agent Trajectories

updated a dataset 1 day ago

NJU-LINK/TELBench

updated a collection 1 day ago

View all activity

Organizations

upvoted a collection 1 day ago

Agent Papers

4 items • Updated 1 day ago • 1

upvoted 3 papers 1 day ago

Cosmos 3: Omnimodal World Models for Physical AI

Paper • 2606.02800 • Published 5 days ago • 83

MMG2Skill: Can Agents Distill In-the-Wild Guides into Self-Evolving Skills?

Paper • 2606.01993 • Published 4 days ago • 13

Where Do Deep-Research Agents Go Wrong? Span-Level Error Localization in Agent Trajectories

Paper • 2606.02060 • Published 5 days ago • 49

upvoted 2 papers 3 days ago

TVIR: Building Deep Research Agents Towards Text--Visual Interleaved Report Generation

Paper • 2606.02320 • Published 5 days ago • 13

Domino: Decoupling Causal Modeling from Autoregressive Drafting in Speculative Decoding

Paper • 2605.29707 • Published 9 days ago • 139

upvoted a paper 4 days ago

VideoMLA: Low-Rank Latent KV Cache for Minute-Scale Autoregressive Video Diffusion

Paper • 2605.30351 • Published 9 days ago • 26

upvoted a paper 10 days ago

LongAV-Compass: Towards Unified Evaluation of Minute-Scale Audio-Visual Generation Across T2AV, I2AV, and V2AV

Paper • 2605.26244 • Published 12 days ago • 38

upvoted a paper 25 days ago

HumanNet: Scaling Human-centric Video Learning to One Million Hours

Paper • 2605.06747 • Published 30 days ago • 52

upvoted 6 papers about 2 months ago

OpenGame: Open Agentic Coding for Games

Paper • 2604.18394 • Published Apr 20 • 81

WebCompass: Towards Multimodal Web Coding Evaluation for Code Language Models

Paper • 2604.18224 • Published Apr 20 • 22

Vero: An Open RL Recipe for General Visual Reasoning

Paper • 2604.04917 • Published Apr 6 • 33

DR^{3}-Eval: Towards Realistic and Reproducible Deep Research Evaluation

Paper • 2604.14683 • Published Apr 16 • 36

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 109

CodeTracer: Towards Traceable Agent States

Paper • 2604.11641 • Published Apr 13 • 38

upvoted a paper 2 months ago

Agentic-MME: What Agentic Capability Really Brings to Multimodal Intelligence?

Paper • 2604.03016 • Published Apr 3 • 37

upvoted a paper 3 months ago

InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing

Paper • 2603.09877 • Published Mar 10 • 49

upvoted 3 papers 4 months ago

Grounding and Enhancing Informativeness and Utility in Dataset Distillation

Paper • 2601.21296 • Published Jan 29 • 21

SERA: Soft-Verified Efficient Repository Agents

Paper • 2601.20789 • Published Jan 28 • 13

Harder Is Better: Boosting Mathematical Reasoning via Difficulty-Aware GRPO and Multi-Aspect Question Reformulation

Paper • 2601.20614 • Published Jan 28 • 119