PaTaRM is a Generative Reward Model (GRM) for RLHF alignment.
JianAi
AIJian
AI & ML interests
None yet
Recent Activity
upvoted a paper 41 minutes ago
SCOPE: Signal-Calibrated On-Policy Distillation Enhancement with Dual-Path Adaptive Weighting updated a model 12 days ago
AIJian/PaTaRM-8B updated a model 13 days ago
AIJian/PaTaRM