Do Vision-Language Models Have Internal World Models? Towards an Atomic Evaluation Paper β’ 2506.21876 β’ Published Jun 27 β’ 28
Contrasting Adversarial Perturbations: The Space of Harmless Perturbations Paper β’ 2402.02095 β’ Published Feb 3, 2024
Benchmarking Chinese Knowledge Rectification in Large Language Models Paper β’ 2409.05806 β’ Published Sep 9, 2024 β’ 15
Defining and Extracting generalizable interaction primitives from DNNs Paper β’ 2401.16318 β’ Published Jan 29, 2024 β’ 1
DCA-Bench: A Benchmark for Dataset Curation Agents Paper β’ 2406.07275 β’ Published Jun 11, 2024 β’ 1
Turbo Sparse: Achieving LLM SOTA Performance with Minimal Activated Parameters Paper β’ 2406.05955 β’ Published Jun 10, 2024 β’ 27
ReLU$^2$ Wins: Discovering Efficient Activation Functions for Sparse LLMs Paper β’ 2402.03804 β’ Published Feb 6, 2024 β’ 4
PowerInfer-2: Fast Large Language Model Inference on a Smartphone Paper β’ 2406.06282 β’ Published Jun 10, 2024 β’ 38
PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU Paper β’ 2312.12456 β’ Published Dec 16, 2023 β’ 44