gpt2-tulu2-DPO / README.md
kejian's picture
Update README.md
ad664bc verified
|
raw
history blame
69 Bytes
gpt-2 small (124M) on tulu2-sft mixture
Then DPO 40k on UltraFeedback