Penghui Qi
QPHutu
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
11 days ago
Language Models Can Learn from Verbal Feedback Without Scalar Rewards
upvoted
a
paper
11 days ago
Variational Reasoning for Language Models
liked
a dataset
12 days ago
SynthLabsAI/Big-Math-RL-Verified