Loss-to-Loss Prediction: Scaling Laws for All Datasets Paper • 2411.12925 • Published 3 days ago • 5 • 2
CoLoR-Filter: Conditional Loss Reduction Filtering for Targeted Language Model Pre-training Paper • 2406.10670 • Published Jun 15 • 4 • 1