Pretrained models from scratch used in "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining".
Rosie Zhao
rosieyzh
·
AI & ML interests
theory of machine learning, deep learning
Recent Activity
updated
a model
28 days ago
rosieyzh/finemath4_full_150m
published
a model
28 days ago
rosieyzh/finemath4_full_150m
updated
a model
about 1 month ago
rosieyzh/qwen2.5-vl-7b-seed1-step1050