AI & ML interests

Reinforcement Learning, Large Language Models, Value Alignment

Recent Activity

TungHoi  updated a dataset 1 day ago
PKU-Alignment/DollyTails-12K
TungHoi  published a dataset 1 day ago
PKU-Alignment/DollyTails-12K
XuyaoWang  updated a model 21 days ago
PKU-Alignment/AnyRewardModel
View all activity