Learning Human-Perceived Fakeness in AI-Generated Videos via Multimodal LLMs Paper • 2509.22646 • Published Sep 26 • 16
Perception Tokens Enhance Visual Reasoning in Multimodal Language Models Paper • 2412.03548 • Published Dec 4, 2024 • 17
Interleaved Scene Graph for Interleaved Text-and-Image Generation Assessment Paper • 2411.17188 • Published Nov 26, 2024 • 21