VLog: Video-Language Models by Generative Retrieval of Narration Vocabulary Paper • 2503.09402 • Published 3 days ago • 6
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning Paper • 2503.09516 • Published 3 days ago • 14
Exploiting Instruction-Following Retrievers for Malicious Information Retrieval Paper • 2503.08644 • Published 4 days ago • 16
Gemini Embedding: Generalizable Embeddings from Gemini Paper • 2503.07891 • Published 4 days ago • 25
LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL Paper • 2503.07536 • Published 5 days ago • 73
Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia Paper • 2503.07920 • Published 4 days ago • 89
LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning Paper • 2503.04812 • Published 11 days ago • 12
Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models Paper • 2503.06749 • Published 5 days ago • 21
Automated Movie Generation via Multi-Agent CoT Planning Paper • 2503.07314 • Published 5 days ago • 36
Taking Notes Brings Focus? Towards Multi-Turn Multimodal Dialogue Learning Paper • 2503.07002 • Published 5 days ago • 36
An Empirical Study on Eliciting and Improving R1-like Reasoning Models Paper • 2503.04548 • Published 9 days ago • 8
TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation Paper • 2503.04872 • Published 9 days ago • 14
R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcing Learning Paper • 2503.05379 • Published 8 days ago • 31
R1-Zero's "Aha Moment" in Visual Reasoning on a 2B Non-SFT Model Paper • 2503.05132 • Published 8 days ago • 48
Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching Paper • 2503.05179 • Published 8 days ago • 42
Unified Reward Model for Multimodal Understanding and Generation Paper • 2503.05236 • Published 8 days ago • 104
R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning Paper • 2503.05592 • Published 8 days ago • 25
Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities Paper • 2503.03983 • Published 9 days ago • 22
IFIR: A Comprehensive Benchmark for Evaluating Instruction-Following in Expert-Domain Information Retrieval Paper • 2503.04644 • Published 9 days ago • 20