PSFT+RL models
SII-Wenhong
wh-zhu
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 8 hours ago
wh-zhu/train_openr1_8k
published
a dataset
about 8 hours ago
wh-zhu/train_openr1_8k
authored
a paper
6 days ago
Flexible Realignment of Language Models
Organizations
None yet