arxiv:2412.11768
Lichuan
CodexXiang
AI & ML interests
None yet
Recent Activity
commented
a paper
about 23 hours ago
No More Adam: Learning Rate Scaling at Initialization is All You Need
authored
a paper
about 24 hours ago
No More Adam: Learning Rate Scaling at Initialization is All You Need
Organizations
Papers
1
models
None public yet
datasets
None public yet