RegMix: Data Mixture as Regression for Language Model Pre-training Paper โข 2407.01492 โข Published Jul 1, 2024 โข 40 โข 7