On the SDEs and Scaling Rules for Adaptive Gradient Algorithms Paper • 2205.10287 • Published May 20, 2022
Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs? Paper • 2501.02669 • Published Jan 5 • 1
AdaptMI: Adaptive Skill-based In-context Math Instruction for Small Language Models Paper • 2505.00147 • Published Apr 30 • 4
Learning Human-Perceived Fakeness in AI-Generated Videos via Multimodal LLMs Paper • 2509.22646 • Published Sep 26 • 16
OLMo-150M and OLMo-1B Pretrained Models Collection Pretrained models from scratch used in "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining". • 12 items • Updated Jul 7 • 3
Task-Specific Skill Localization in Fine-tuned Language Models Paper • 2302.06600 • Published Feb 13, 2023