Qwen1.5-0.5B-dpo-mix-7k-3000 / trl /test_orpo_trainer_demo.py

Commit History

Upload folder using huggingface_hub
4ad32d0
verified

burtenshaw HF staff commited on