MagicDriveDiT: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control Paper • 2411.13807 • Published 4 days ago • 7
Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models Paper • 2411.14432 • Published 3 days ago • 15
Hymba: A Hybrid-head Architecture for Small Language Models Paper • 2411.13676 • Published 4 days ago • 33
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions Paper • 2411.14405 • Published 3 days ago • 32
Multimodal Autoregressive Pre-training of Large Vision Encoders Paper • 2411.14402 • Published 3 days ago • 34
Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization Paper • 2411.10442 • Published 9 days ago • 50
Continuous Speculative Decoding for Autoregressive Image Generation Paper • 2411.11925 • Published 6 days ago • 13
RedPajama: an Open Dataset for Training Large Language Models Paper • 2411.12372 • Published 5 days ago • 42
PerceiverS: A Multi-Scale Perceiver with Effective Segmentation for Long-Term Expressive Symbolic Music Generation Paper • 2411.08307 • Published 12 days ago • 6
Direct Preference Optimization Using Sparse Feature-Level Constraints Paper • 2411.07618 • Published 12 days ago • 15
Large Language Models Can Self-Improve in Long-context Reasoning Paper • 2411.08147 • Published 12 days ago • 59
CAD-MLLM: Unifying Multimodality-Conditioned CAD Generation With MLLM Paper • 2411.04954 • Published 17 days ago • 7
Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding Paper • 2411.04282 • Published 18 days ago • 30
LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation Paper • 2411.04997 • Published 17 days ago • 34
Game-theoretic LLM: Agent Workflow for Negotiation Games Paper • 2411.05990 • Published 16 days ago • 7
IOPO: Empowering LLMs with Complex Instruction Following via Input-Output Preference Optimization Paper • 2411.06208 • Published 15 days ago • 18
GitChameleon: Unmasking the Version-Switching Capabilities of Code Generation Models Paper • 2411.05830 • Published 19 days ago • 20