reward-gpt-b6 / checkpoint-5000
bradmin's picture
Training in progress, step 5000, checkpoint
6c4acc3