--- base_model: Goekdeniz-Guelmez/Josiefied-Qwen2.5-1.5B-Instruct-abliterated-v1 language: - en license: apache-2.0 tags: - text-generation-inference - transformers - unsloth - qwen2 - trl - dpo --- # Uploaded model - **Developed by:** Goekdeniz-Guelmez - **License:** apache-2.0 - **Finetuned from model :** Goekdeniz-Guelmez/Josiefied-Qwen2.5-1.5B-Instruct-abliterated-v1 This qwen2 model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library. [

](https://github.com/unslothai/unsloth) ## A experimental DPO training with a custom dataset.