Uploaded model
- Developed by: hoonikoo
- License: apache-2.0
- Finetuned from model : maywell/TinyWand-DPO
This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.
Model tree for hoonikoo/lora_model_1115_1b
Base model
maywell/TinyWand-DPO