RLHF Models

sr5434 's Collections

updated Mar 10

A set of models from my experiments with Reinforcement Learning from Human Feedback