reward-gpt-b6 / checkpoint-500
bradmin's picture
Training in progress, step 500, checkpoint
035d30c