Byte Latent Transformer: Patches Scale Better Than Tokens Paper • 2412.09871 • Published 5 days ago • 43
Apollo: An Exploration of Video Understanding in Large Multimodal Models Paper • 2412.10360 • Published 4 days ago • 118
The Pitfalls of Memorization: When Memorization Hurts Generalization Paper • 2412.07684 • Published 7 days ago • 1
Large Concept Models: Language Modeling in a Sentence Representation Space Paper • 2412.08821 • Published 6 days ago • 4
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions Paper • 2412.09596 • Published 5 days ago • 87
Maya: An Instruction Finetuned Multilingual Multimodal Model Paper • 2412.07112 • Published 8 days ago • 22
Elucidating the Design Space of Diffusion-Based Generative Models Paper • 2206.00364 • Published Jun 1, 2022 • 14
ProcessBench: Identifying Process Errors in Mathematical Reasoning Paper • 2412.06559 • Published 9 days ago • 62
Training Large Language Models to Reason in a Continuous Latent Space Paper • 2412.06769 • Published 8 days ago • 54 • 7
Training Large Language Models to Reason in a Continuous Latent Space Paper • 2412.06769 • Published 8 days ago • 54