Yifan Hao
yifanhao99
ยท
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
5 days ago
Reinforce-Ada: An Adaptive Sampling Framework for Reinforce-Style LLM
Training
upvoted
a
paper
about 1 month ago
Beyond Correctness: Harmonizing Process and Outcome Rewards through RL
Training
upvoted
a
paper
4 months ago
Chain-of-Experts: Unlocking the Communication Power of
Mixture-of-Experts Models