arxiv:2501.18965
Alexander Hägele
haeggee
AI & ML interests
None yet
Recent Activity
authored
a paper
about 19 hours ago
The Surprising Agreement Between Convex Optimization Theory and
Learning-Rate Scheduling for Large Model Training
authored
a paper
8 months ago
Scaling Laws and Compute-Optimal Training Beyond Fixed Training
Durations