DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation Paper • 2412.07589 • Published 13 days ago • 45
Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation Paper • 2412.06531 • Published 14 days ago • 71
Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks Paper • 2408.03615 • Published Aug 7 • 30
AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases Paper • 2407.12784 • Published Jul 17 • 48
AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM Agents Paper • 2407.04363 • Published Jul 5 • 27
VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding Paper • 2403.11481 • Published Mar 18 • 12
Evaluating Very Long-Term Conversational Memory of LLM Agents Paper • 2402.17753 • Published Feb 27 • 18
Commonsense-augmented Memory Construction and Management in Long-term Conversations via Context-aware Persona Refinement Paper • 2401.14215 • Published Jan 25 • 2
Effective and Efficient Conversation Retrieval for Dialogue State Tracking with Implicit Text Summaries Paper • 2402.13043 • Published Feb 20 • 2
UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics Paper • 2412.07774 • Published 13 days ago • 24
Training Large Language Models to Reason in a Continuous Latent Space Paper • 2412.06769 • Published 14 days ago • 56
ProcessBench: Identifying Process Errors in Mathematical Reasoning Paper • 2412.06559 • Published 14 days ago • 64
Maya: An Instruction Finetuned Multilingual Multimodal Model Paper • 2412.07112 • Published 13 days ago • 24
Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling Paper • 2412.05271 • Published 17 days ago • 116
EXAONE 3.5: Series of Large Language Models for Real-world Use Cases Paper • 2412.04862 • Published 17 days ago • 47
Evaluating Language Models as Synthetic Data Generators Paper • 2412.03679 • Published 19 days ago • 43
Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection Paper • 2412.04455 • Published 18 days ago • 35
CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules Paper • 2310.08992 • Published Oct 13, 2023 • 10
MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation Paper • 2412.04448 • Published 18 days ago • 9
PaliGemma 2: A Family of Versatile VLMs for Transfer Paper • 2412.03555 • Published 19 days ago • 118
Surveying the Effects of Quality, Diversity, and Complexity in Synthetic Data From Large Language Models Paper • 2412.02980 • Published 19 days ago • 12
Balancing Speed and Stability: The Trade-offs of FP8 vs. BF16 Training in LLMs Paper • 2411.08719 • Published Nov 10
Little Giants: Synthesizing High-Quality Embedding Data at Scale Paper • 2410.18634 • Published Oct 24
A Survey on Data Synthesis and Augmentation for Large Language Models Paper • 2410.12896 • Published Oct 16
Self-Improvement in Language Models: The Sharpening Mechanism Paper • 2412.01951 • Published 20 days ago
Large Language Models Can Self-Improve in Long-context Reasoning Paper • 2411.08147 • Published Nov 12 • 62
Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability Paper • 2411.19943 • Published 24 days ago • 55
MALT: Improving Reasoning with Multi-Agent LLM Training Paper • 2412.01928 • Published 21 days ago • 38
Multi-Agent Large Language Models for Conversational Task-Solving Paper • 2410.22932 • Published Oct 30
AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning Paper • 2412.03248 • Published 19 days ago • 25
OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation Paper • 2412.02592 • Published 20 days ago • 20
Scaling Image Tokenizers with Grouped Spherical Quantization Paper • 2412.02632 • Published 20 days ago • 10
X-Prompt: Towards Universal In-Context Image Generation in Auto-Regressive Vision Language Foundation Models Paper • 2412.01824 • Published 21 days ago • 65
Open-Sora Plan: Open-Source Large Video Generation Model Paper • 2412.00131 • Published 25 days ago • 32
The Well: a Large-Scale Collection of Diverse Physics Simulations for Machine Learning Paper • 2412.00568 • Published 23 days ago • 14
PhysGame: Uncovering Physical Commonsense Violations in Gameplay Videos Paper • 2412.01800 • Published 21 days ago • 6
A Simple and Provable Scaling Law for the Test-Time Compute of Large Language Models Paper • 2411.19477 • Published 24 days ago • 5
Exploring the Abilities of Large Language Models to Solve Proportional Analogies via Knowledge-Enhanced Prompting Paper • 2412.00869 • Published 22 days ago • 4
World-consistent Video Diffusion with Explicit 3D Modeling Paper • 2412.01821 • Published 21 days ago • 4
AfriMed-QA: A Pan-African, Multi-Specialty, Medical Question-Answering Benchmark Dataset Paper • 2411.15640 • Published 30 days ago • 4
Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens Paper • 2411.17691 • Published 27 days ago • 9
Learning 3D Representations from Procedural 3D Programs Paper • 2411.17467 • Published 28 days ago • 8
Star Attention: Efficient LLM Inference over Long Sequences Paper • 2411.17116 • Published 27 days ago • 47
O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson? Paper • 2411.16489 • Published 28 days ago • 40
From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge Paper • 2411.16594 • Published 28 days ago • 36
Reflections from the 2024 Large Language Model (LLM) Hackathon for Applications in Materials Science and Chemistry Paper • 2411.15221 • Published Nov 20 • 25
All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages Paper • 2411.16508 • Published 28 days ago • 8
Best of Both Worlds: Advantages of Hybrid Graph Sequence Models Paper • 2411.15671 • Published 29 days ago • 7
TÜLU 3: Pushing Frontiers in Open Language Model Post-Training Paper • 2411.15124 • Published about 1 month ago • 55
A Flexible Large Language Models Guardrail Development Methodology Applied to Off-Topic Prompt Detection Paper • 2411.12946 • Published Nov 20 • 20
BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games Paper • 2411.13543 • Published Nov 20 • 18
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions Paper • 2411.14405 • Published Nov 21 • 57
OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs Paper • 2411.14199 • Published Nov 21 • 28
Hymba: A Hybrid-head Architecture for Small Language Models Paper • 2411.13676 • Published Nov 20 • 38
Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Models Paper • 2411.14257 • Published Nov 21 • 9
VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models Paper • 2411.13503 • Published Nov 20 • 30
SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration Paper • 2411.10958 • Published Nov 17 • 50
When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training Paper • 2411.13476 • Published Nov 20 • 14
Continuous Speculative Decoding for Autoregressive Image Generation Paper • 2411.11925 • Published Nov 18 • 15
Building Trust: Foundations of Security, Safety and Transparency in AI Paper • 2411.12275 • Published Nov 19 • 10
Evaluating Tokenizer Performance of Large Language Models Across Official Indian Languages Paper • 2411.12240 • Published Nov 19 • 6
BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices Paper • 2411.10640 • Published Nov 16 • 44
Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering Paper • 2411.11504 • Published Nov 18 • 19
Drowning in Documents: Consequences of Scaling Reranker Inference Paper • 2411.11767 • Published Nov 18 • 17
Comprehensive and Practical Evaluation of Retrieval-Augmented Generation Systems for Medical Question Answering Paper • 2411.09213 • Published Nov 14 • 6
Evaluating the role of `Constitutions' for learning from AI feedback Paper • 2411.10168 • Published Nov 15 • 5
The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer Use Paper • 2411.10323 • Published Nov 15 • 31
LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models Paper • 2411.09595 • Published Nov 14 • 71
Stronger Models are NOT Stronger Teachers for Instruction Tuning Paper • 2411.07133 • Published Nov 11 • 34
Scaling Properties of Diffusion Models for Perceptual Tasks Paper • 2411.08034 • Published Nov 12 • 13
GRS-QA -- Graph Reasoning-Structured Question Answering Dataset Paper • 2411.00369 • Published Nov 1 • 6
Graph-constrained Reasoning: Faithful Reasoning on Knowledge Graphs with Large Language Models Paper • 2410.13080 • Published Oct 16
What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective Paper • 2410.23743 • Published Oct 31 • 59
ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference Paper • 2410.21465 • Published Oct 28 • 11
RARe: Retrieval Augmented Retrieval with In-Context Examples Paper • 2410.20088 • Published Oct 26 • 5
Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding Paper • 2411.04282 • Published Nov 6 • 30
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models Paper • 2411.04905 • Published Nov 7 • 111
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models Paper • 2411.04996 • Published Nov 7 • 49
Thanos: Enhancing Conversational Agents with Skill-of-Mind-Infused Large Language Model Paper • 2411.04496 • Published Nov 7 • 22
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level Paper • 2411.03562 • Published Nov 5 • 60
"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization Paper • 2411.02355 • Published Nov 4 • 46
How Far is Video Generation from World Model: A Physical Law Perspective Paper • 2411.02385 • Published Nov 4 • 33
Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent Paper • 2411.02265 • Published Nov 4 • 24
Adaptive Caching for Faster Video Generation with Diffusion Transformers Paper • 2411.02397 • Published Nov 4 • 23
Swan and ArabicMTEB: Dialect-Aware, Arabic-Centric, Cross-Lingual, and Cross-Cultural Embedding Models and Benchmarks Paper • 2411.01192 • Published Nov 2 • 3
OS-ATLAS: A Foundation Action Model for Generalist GUI Agents Paper • 2410.23218 • Published Oct 30 • 46
Survey of User Interface Design and Interaction Techniques in Generative AI Applications Paper • 2410.22370 • Published Oct 28 • 11
BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments Paper • 2410.23918 • Published Oct 31 • 18
On Memorization of Large Language Models in Logical Reasoning Paper • 2410.23123 • Published Oct 30 • 18
AutoKaggle: A Multi-Agent Framework for Autonomous Data Science Competitions Paper • 2410.20424 • Published Oct 27 • 38
OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization Paper • 2410.19609 • Published Oct 25 • 17
AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant Paper • 2410.18603 • Published Oct 24 • 31
Counting Ability of Large Language Models and Impact of Tokenization Paper • 2410.19730 • Published Oct 25 • 10
Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss Paper • 2410.17243 • Published Oct 22 • 89
Unleashing Reasoning Capability of LLMs via Scalable Question Synthesis from Scratch Paper • 2410.18693 • Published Oct 24 • 40
Unbounded: A Generative Infinite Game of Character Life Simulation Paper • 2410.18975 • Published Oct 24 • 35
Multi-Draft Speculative Sampling: Canonical Architectures and Theoretical Limits Paper • 2410.18234 • Published Oct 23 • 3
WorldSimBench: Towards Video Generation Models as World Simulators Paper • 2410.18072 • Published Oct 23 • 18
LCM-LoRA: A Universal Stable-Diffusion Acceleration Module Paper • 2311.05556 • Published Nov 9, 2023 • 82
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference Paper • 2310.04378 • Published Oct 6, 2023 • 19
Aligning Text-to-Image Diffusion Models with Reward Backpropagation Paper • 2310.03739 • Published Oct 5, 2023 • 21
Large Concept Models: Language Modeling in a Sentence Representation Space Paper • 2412.08821 • Published 11 days ago • 6
The Role of Summarization in Generative Agents: A Preliminary Perspective Paper • 2305.01253 • Published May 2, 2023
Generative Agents: Interactive Simulacra of Human Behavior Paper • 2304.03442 • Published Apr 7, 2023 • 12
SOTOPIA-π: Interactive Learning of Socially Intelligent Language Agents Paper • 2403.08715 • Published Mar 13 • 20
DRLC: Reinforcement Learning with Dense Rewards from LLM Critic Paper • 2401.07382 • Published Jan 14 • 2
Secrets of RLHF in Large Language Models Part II: Reward Modeling Paper • 2401.06080 • Published Jan 11 • 26
Is this the real life? Is this just fantasy? The Misleading Success of Simulating Social Interactions With LLMs Paper • 2403.05020 • Published Mar 8 • 2
Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards Paper • 2403.07708 • Published Mar 12
Large Language Model-based Human-Agent Collaboration for Complex Task Solving Paper • 2402.12914 • Published Feb 20
Interactive Agents: Simulating Counselor-Client Psychological Counseling via Role-Playing LLM-to-LLM Interactions Paper • 2408.15787 • Published Aug 28
Building Cooperative Embodied Agents Modularly with Large Language Models Paper • 2307.02485 • Published Jul 5, 2023 • 11
From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based Agents Paper • 2412.03563 • Published 19 days ago
AgentSense: Benchmarking Social Intelligence of Language Agents through Interactive Scenarios Paper • 2410.19346 • Published Oct 25
ReSpAct: Harmonizing Reasoning, Speaking, and Acting Towards Building Large Language Model-Based Conversational AI Agents Paper • 2411.00927 • Published Nov 1
Positive Experience Reflection for Agents in Interactive Text Environments Paper • 2411.02223 • Published Nov 4
From Multimodal LLMs to Generalist Embodied Agents: Methods and Lessons Paper • 2412.08442 • Published 12 days ago
OpenDevin: An Open Platform for AI Software Developers as Generalist Agents Paper • 2407.16741 • Published Jul 23 • 68
Agentless: Demystifying LLM-based Software Engineering Agents Paper • 2407.01489 • Published Jul 1 • 42
CodeNav: Beyond tool-use to using real-world codebases with LLM agents Paper • 2406.12276 • Published Jun 18
HyperAgent: Generalist Software Engineering Agents to Solve Coding Tasks at Scale Paper • 2409.16299 • Published Sep 9 • 10