Support user-defined prompt processing strategies for dpo (#1248) 1e3d530 unverified nopperl winglian commited on Feb 26
precompute dpo logprobs setting and fixes (#1199) [skip ci] 33e1170 unverified winglian commited on Jan 25