Luka A1150
Collection
Models relating to ticket A1150: How to train closed-sourced models on negative examples via SFT?
•
1 item
•
Updated
W&B logs: https://wandb.ai/slingshot-ai/luka_A1150/runs/2wmzzpc7/workspace?nw=nwuserlukasmyth96
This model was trained on top of slingshot/alfie-10 using SFT on the chosen responses of the preference dataset created by team Alfie in the Llamathon.