sarthakrw
/

dpo_model

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

dpo_model / runs /Sep15_17-37-17_b1311fcc90ca

1 contributor

History: 1 commit

sarthakrw's picture

sarthakrw/SmolLM-FT-CoEdIT-DPO

5975bd4 verified 5 months ago

events.out.tfevents.1726421859.b1311fcc90ca.1827.2

10.1 kB
LFS

sarthakrw/SmolLM-FT-CoEdIT-DPO 5 months ago