DynaSaur: Large Language Agents Beyond Predefined Actions Paper • 2411.01747 • Published 2 days ago • 13
Survey of Cultural Awareness in Language Models: Text and Beyond Paper • 2411.00860 • Published 7 days ago • 20
Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent Paper • 2411.02265 • Published 2 days ago • 19
WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning Paper • 2411.02337 • Published 2 days ago • 27
TOMATO: Assessing Visual Temporal Reasoning Capabilities in Multimodal Foundation Models Paper • 2410.23266 • Published 7 days ago • 19
Can Language Models Replace Programmers? REPOCOD Says 'Not Yet' Paper • 2410.21647 • Published 8 days ago • 11
RARe: Retrieval Augmented Retrieval with In-Context Examples Paper • 2410.20088 • Published 11 days ago • 5
Zero-Shot Dense Retrieval with Embeddings from Relevance Feedback Paper • 2410.21242 • Published 9 days ago • 6
OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization Paper • 2410.19609 • Published 12 days ago • 14
CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation Paper • 2410.23090 • Published 7 days ago • 52
CLEAR: Character Unlearning in Textual and Visual Modalities Paper • 2410.18057 • Published 14 days ago • 196
Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines Paper • 2410.21220 • Published 9 days ago • 8
Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback Paper • 2410.19133 • Published 13 days ago • 11
Infinity-MM: Scaling Multimodal Performance with Large-Scale and High-Quality Instruction Data Paper • 2410.18558 • Published 13 days ago • 17
Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss Paper • 2410.17243 • Published 15 days ago • 86