INSTRUCTEVAL: Towards Holistic Evaluation of Instruction-Tuned Large Language Models Paper • 2306.04757 • Published Jun 7, 2023 • 6
Evaluating Instruction-Tuned Large Language Models on Code Comprehension and Generation Paper • 2308.01240 • Published Aug 2, 2023 • 2
Can Large Language Models Understand Real-World Complex Instructions? Paper • 2309.09150 • Published Sep 17, 2023 • 2
Evaluating the Instruction-Following Robustness of Large Language Models to Prompt Injection Paper • 2308.10819 • Published Aug 17, 2023
InFoBench: Evaluating Instruction Following Ability in Large Language Models Paper • 2401.03601 • Published Jan 7 • 7
FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models Paper • 2310.20410 • Published Oct 31, 2023 • 1
Is Prompt All You Need? No. A Comprehensive and Broader View of Instruction Learning Paper • 2303.10475 • Published Mar 18, 2023 • 2
Multi-Task Inference: Can Large Language Models Follow Multiple Instructions at Once? Paper • 2402.11597 • Published Feb 18 • 1
Diverse and Fine-Grained Instruction-Following Ability Exploration with Synthetic Data Paper • 2407.03942 • Published Jul 4
HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models Paper • 2409.16191 • Published Sep 24 • 41
Training Language Models to Self-Correct via Reinforcement Learning Paper • 2409.12917 • Published Sep 19 • 133
The Unlocking Spell on Base LLMs: Rethinking Alignment via In-Context Learning Paper • 2312.01552 • Published Dec 4, 2023 • 30
Eliciting Instruction-tuned Code Language Models' Capabilities to Utilize Auxiliary Function for Code Generation Paper • 2409.13928 • Published Sep 20 • 1
What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10 • 3
From Instructions to Constraints: Language Model Alignment with Automatic Constraint Verification Paper • 2403.06326 • Published Mar 10 • 1
Batch Prompting: Efficient Inference with Large Language Model APIs Paper • 2301.08721 • Published Jan 19, 2023 • 1
Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models Paper • 2409.18943 • Published Sep 27 • 26
OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data Paper • 2410.01560 • Published Oct 2 • 3
Only-IF:Revealing the Decisive Effect of Instruction Diversity on Generalization Paper • 2410.04717 • Published about 1 month ago • 17
Rethinking Data Selection at Scale: Random Selection is Almost All You Need Paper • 2410.09335 • Published 25 days ago • 14