[CLS] Token Tells Everything Needed for Training-free Efficient MLLMs Paper • 2412.05819 • Published Dec 8, 2024
PrefixKV: Adaptive Prefix KV Cache is What Vision Instruction-Following Models Need for Efficient Generation Paper • 2412.03409 • Published Dec 4, 2024
RepViT: Revisiting Mobile CNN From ViT Perspective Paper • 2307.09283 • Published Jul 18, 2023 • 2