RLHF Collection A collection of models trained with Reinforcement Learning from Human Feedback (RLHF). β’ 4 items β’ Updated Oct 1 β’ 5