DUAL-GPO
/

phi-2-dpo-chatml-lora-10k-30k-i1

alignment-handbook

Generated from Trainer

Model card Files Files and versions Metrics Training metrics Community

phi-2-dpo-chatml-lora-10k-30k-i1 / runs /Sep10_19-37-45_gpu4-119-5

1 contributor

History: 3 commits

BraylonDash's picture

Model save

18011e1 verified 3 months ago

events.out.tfevents.1725961119.gpu4-119-5.645890.0

25.6 kB
LFS

Model save 3 months ago