view article Article There is no such thing as a tokenizer-free lunch By catherinearnett • 28 days ago • 83
Surveying the Effects of Quality, Diversity, and Complexity in Synthetic Data From Large Language Models Paper • 2412.02980 • Published Dec 4, 2024 • 15