library_name: transformers | |
base_model: | |
- microsoft/Phi-3.5-mini-instruct | |
datasets: | |
- jondurbin/gutenberg-dpo-v0.1 | |
license: mit | |
![](https://lh7-rt.googleusercontent.com/docsz/AD_4nXeiuCm7c8lEwEJuRey9kiVZsRn2W-b4pWlu3-X534V3YmVuVc2ZL-NXg2RkzSOOS2JXGHutDuyyNAUtdJI65jGTo8jT9Y99tMi4H4MqL44Uc5QKG77B0d6-JfIkZHFaUA71-RtjyYZWVIhqsNZcx8-OMaA?key=xt3VSDoCbmTY7o-cwwOFwQ) | |
# QuantFactory/phi3.5-gutenberg-4B-GGUF | |
This is quantized version of [nbeerbower/phi3.5-gutenberg-4B](https://huggingface.co/nbeerbower/phi3.5-gutenberg-4B) created using llama.cpp | |
# Original Model Card | |
# phi3.5-gutenberg-4B | |
[microsoft/Phi-3.5-mini-instruct](https://huggingface.co/microsoft/Phi-3.5-mini-instruct) finetuned on [jondurbin/gutenberg-dpo-v0.1](https://huggingface.co/datasets/jondurbin/gutenberg-dpo-v0.1). | |
### Method | |
Finetuned using 2x RTX 4060 Ti for 3 epochs. | |
[Fine-tune Llama 3 with ORPO](https://mlabonne.github.io/blog/posts/2024-04-19_Fine_tune_Llama_3_with_ORPO.html) | |