AlexLI's picture
1

AlexLI

AlexLINB
ยท

AI & ML interests

None yet

Recent Activity

Organizations

None yet

AlexLINB's activity

replied to di-zhang-fdu's post about 1 month ago
reacted to di-zhang-fdu's post with ๐Ÿ‘€ about 1 month ago
view post
Post
2605
LLaMA-O1-PRM and LLaMA-O1-Reinforcement will release in this weekend.
We have implemented a novel Reinforcement finetune(RFT) pipeline that taught models learning reasoning and reward labeling without human annotation.
ยท