RedPajama: an Open Dataset for Training Large Language Models Paper • 2411.12372 • Published 5 days ago • 42
Introducing v0.5 of the AI Safety Benchmark from MLCommons Paper • 2404.12241 • Published Apr 18 • 10
Prismatic VLMs: Investigating the Design Space of Visually-Conditioned Language Models Paper • 2402.07865 • Published Feb 12 • 12
Llamas Know What GPTs Don't Show: Surrogate Models for Confidence Estimation Paper • 2311.08877 • Published Nov 15, 2023 • 6
MedAlign: A Clinician-Generated Dataset for Instruction Following with Electronic Medical Records Paper • 2308.14089 • Published Aug 27, 2023 • 28
Robust Distortion-free Watermarks for Language Models Paper • 2307.15593 • Published Jul 28, 2023 • 8
Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training Paper • 2305.14342 • Published May 23, 2023
SQuAD: 100,000+ Questions for Machine Comprehension of Text Paper • 1606.05250 • Published Jun 16, 2016 • 3
Cheaply Evaluating Inference Efficiency Metrics for Autoregressive Transformer APIs Paper • 2305.02440 • Published May 3, 2023 • 1
Beyond Positive Scaling: How Negation Impacts Scaling Trends of Language Models Paper • 2305.17311 • Published May 27, 2023 • 1