Adding Error Bars to Evals: A Statistical Approach to Language Model Evaluations Paper • 2411.00640 • Published 23 days ago • 3
The Prompt Report: A Systematic Survey of Prompting Techniques Paper • 2406.06608 • Published Jun 6 • 55
Training language models to follow instructions with human feedback Paper • 2203.02155 • Published Mar 4, 2022 • 16
RULER: What's the Real Context Size of Your Long-Context Language Models? Paper • 2404.06654 • Published Apr 9 • 34
Cosmos Tokenizer Collection A suite of image and video tokenizers • 10 items • Updated 18 days ago • 19