Concurrent Adversarial Learning for Large-Batch Training Paper • 2106.00221 • Published Jun 1, 2021 • 1
Rethinking Architecture Selection in Differentiable NAS Paper • 2108.04392 • Published Aug 10, 2021
Sparse MeZO: Less Parameters for Better Performance in Zeroth-Order LLM Fine-Tuning Paper • 2402.15751 • Published Feb 24, 2024
MOSSBench: Is Your Multimodal Language Model Oversensitive to Safe Queries? Paper • 2406.17806 • Published Jun 22, 2024 • 1
One Prompt is not Enough: Automated Construction of a Mixture-of-Expert Prompts Paper • 2407.00256 • Published Jun 28, 2024 • 1
Understanding the Impact of Negative Prompts: When and How Do They Take Effect? Paper • 2406.02965 • Published Jun 5, 2024
MuLan: Multimodal-LLM Agent for Progressive and Interactive Multi-Object Diffusion Paper • 2402.12741 • Published Feb 20, 2024
LaRA: Benchmarking Retrieval-Augmented Generation and Long-Context LLMs -- No Silver Bullet for LC or RAG Routing Paper • 2502.09977 • Published 28 days ago • 1
R1-Zero's "Aha Moment" in Visual Reasoning on a 2B Non-SFT Model Paper • 2503.05132 • Published 7 days ago • 47
GReaTer: Gradients over Reasoning Makes Smaller Language Models Strong Prompt Optimizers Paper • 2412.09722 • Published Dec 12, 2024 • 5
VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception of Geometric Information Paper • 2412.00947 • Published Dec 1, 2024 • 8
AAAR-1.0: Assessing AI's Potential to Assist Research Paper • 2410.22394 • Published Oct 29, 2024 • 16
IMDb data from Two Generations, from 1979 to 2019; Part one, Dataset Introduction and Preliminary Analysis Paper • 2005.14147 • Published May 28, 2020
Evaluating LLMs at Detecting Errors in LLM Responses Paper • 2404.03602 • Published Apr 4, 2024 • 2
DocMath-Eval: Evaluating Numerical Reasoning Capabilities of LLMs in Understanding Long Documents with Tabular Data Paper • 2311.09805 • Published Nov 16, 2023 • 3
WiCE: Real-World Entailment for Claims in Wikipedia Paper • 2303.01432 • Published Mar 2, 2023 • 2
Is Prompt All You Need? No. A Comprehensive and Broader View of Instruction Learning Paper • 2303.10475 • Published Mar 18, 2023 • 2
MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following Paper • 2312.02436 • Published Dec 5, 2023 • 1
UMIE: Unified Multimodal Information Extraction with Instruction Tuning Paper • 2401.03082 • Published Jan 5, 2024 • 1