qwerrwe / docs /rlhf.md

Commit History

Update rlhf.md (#1237) [skip ci]
52c83d3
unverified

hamel commited on

more dpo fixes for dataset loading and docs (#1185) [skip ci]
5bce45f
unverified

winglian commited on

Update rlhf.md (#1178) [skip ci]
dc051b8
unverified

Aleksey Korshuk commited on

feat: enable trl's autounwrap (#1060)
b432889
unverified

Nanobit commited on

RL/DPO (#935)
f243c21

winglian commited on