Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory Paper • 2504.19413 • Published Apr 28, 2025 • 41
MetricAnything: Scaling Metric Depth Pretraining with Noisy Heterogeneous Sources Paper • 2601.22054 • Published 6 days ago • 5
LTX-2: Efficient Joint Audio-Visual Foundation Model Paper • 2601.03233 • Published 29 days ago • 146
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models Paper • 2601.07372 • Published 23 days ago • 40
UltraRAG: A Modular and Automated Toolkit for Adaptive Retrieval-Augmented Generation Paper • 2504.08761 • Published Mar 31, 2025 • 7
What Matters in Data Curation for Multimodal Reasoning? Insights from the DCVLR Challenge Paper • 2601.10922 • Published 20 days ago • 3
Uncertainty-Aware Gradient Signal-to-Noise Data Selection for Instruction Tuning Paper • 2601.13697 • Published 16 days ago • 3
FantasyVLN: Unified Multimodal Chain-of-Thought Reasoning for Vision-Language Navigation Paper • 2601.13976 • Published 15 days ago • 21
Being-H0.5: Scaling Human-Centric Robot Learning for Cross-Embodiment Generalization Paper • 2601.12993 • Published 16 days ago • 75
Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking Paper • 2601.04720 • Published 28 days ago • 52
YOLO-Master: MOE-Accelerated with Specialized Transformers for Enhanced Real-time Detection Paper • 2512.23273 • Published Dec 29, 2025 • 14
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times Paper • 2512.16093 • Published Dec 18, 2025 • 95
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI Paper • 2512.16676 • Published Dec 18, 2025 • 216