Commit History

add handling for argilla dpo-mix (#1397)
8a82d2e
unverified

winglian commited on

Support user-defined prompt processing strategies for dpo (#1248)
1e3d530
unverified

nopperl winglian commited on

precompute dpo logprobs setting and fixes (#1199) [skip ci]
33e1170
unverified

winglian commited on

DPO cleanup (#1126)
7523d1f
unverified

winglian plaguss HF staff commited on