Max
reciprocate
AI & ML interests
Reward models
Recent Activity
liked
a dataset
17 days ago
microsoft/orca-math-word-problems-200k
liked
a model
29 days ago
stabilityai/stable-diffusion-3.5-large
Organizations
reciprocate's activity
fix(readme): rename `map` -> `filter` in code for selecting subset
#3 opened 6 months ago
by
reciprocate
change mt bench plot
#1 opened 12 months ago
by
reciprocate
is it reward model? how can we use it?
1
#1 opened over 1 year ago
by
Asaf-Yehudai