Submitted by RTT1 73 Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving · 18 authors 346 1
Submitted by tytyt 55 OmniPart: Part-Aware 3D Generation with Semantic Decoupling and Structural Cohesion · 10 authors 1
Submitted by taiwang 46 StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling · 12 authors 210 2
Submitted by happzy2633 42 CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization · 19 authors 42 1
Submitted by judge 31 RLVER: Reinforcement Learning with Verifiable Emotion Rewards for Empathetic Agents · 16 authors 2
Submitted by wangrongsheng 26 MedGen: Unlocking Medical Video Generation by Scaling Granularly-annotated Medical Videos · 7 authors 21 1
Submitted by yxlu0 20 Is Diversity All You Need for Scalable Robotic Manipulation? · 10 authors 2.33k 1
Submitted by zsytony 20 Coding Triangle: How Does Large Language Model Understand Code? · 6 authors 1
Submitted by guokan-shang 19 Nile-Chat: Egyptian Language Models for Arabic and Latin Scripts · 10 authors 1
Submitted by songtingyu 13 Efficiency-Effectiveness Reranking FLOPs for LLM-based Rerankers · 5 authors 1
Submitted by BestWishYsh 12 Tora2: Motion and Appearance Customized Diffusion Transformer for Multi-Entity Video Generation · 5 authors 2
Submitted by xinyu1205 11 High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning · 6 authors 1
Submitted by acharkq 11 PRING: Rethinking Protein-Protein Interaction Prediction from Pairs to Graphs · 12 authors 1
Submitted by songdj 11 SAMed-2: Selective Memory Enhanced Medical Segment Anything Model · 14 authors 1
Submitted by ZetangForward 10 LOOM-Scope: a comprehensive and efficient LOng-cOntext Model evaluation framework · 8 authors 18 1
Submitted by Xuandong 5 The Landscape of Memorization in LLMs: Mechanisms, Measurement, and Mitigation · 4 authors 1
Submitted by ChristophReich1996 4 Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion · 7 authors 61 2
Submitted by mderakhshani 4 NeoBabel: A Multilingual Open Tower for Visual Generation · 4 authors 20 1
Submitted by Gigglingface 1 Does Data Scaling Lead to Visual Compositional Generalization? · 3 authors 1
Submitted by nielsr 1 AXLearn: Modular Large Model Training on Heterogeneous Infrastructure · 37 authors 2.23k 1