gpt2-tulu2-DPO / README.md
kejian's picture
Update README.md
beaadeb verified

gpt-2 small (124M) on tulu-sft-v2 mixture + DPO 40k on UltraFeedback