On Memorization of Large Language Models in Logical Reasoning Paper • 2410.23123 • Published Oct 30 • 18
LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory Paper • 2410.10813 • Published Oct 14 • 9
TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models Paper • 2410.10818 • Published Oct 14 • 15
Vinoground: Scrutinizing LMMs over Dense Temporal Reasoning with Short Videos Paper • 2410.02763 • Published Oct 3 • 7
Configurable Foundation Models: Building LLMs from a Modular Perspective Paper • 2409.02877 • Published Sep 4 • 27
Evaluating the Smooth Control of Attribute Intensity in Text Generation with LLMs Paper • 2406.04460 • Published Jun 6 • 1