wx13
wx13
ยท
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
14 days ago
Self-rewarding correction for mathematical reasoning
liked
a dataset
10 months ago
RLHFlow/prompt-collection-v0.1
upvoted
a
collection
10 months ago
Online RLHF
Organizations
None yet
models
None public yet
datasets
None public yet