simonbutt
/

am_llama3_dpo

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

simonbutt commited on Apr 23

Commit

745ed25

•

1 Parent(s): 1c1d51b

Update README.md

Files changed (1) hide show

README.md +11 -4

README.md CHANGED Viewed

@@ -1,6 +1,7 @@
 ---
 language:
 - en
 license: apache-2.0
 tags:
 - text-generation-inference
@@ -8,16 +9,22 @@ tags:
 - unsloth
 - llama
 - trl
-- dpo
 base_model: unsloth/llama-3-8b-bnb-4bit
 ---
-# Uploaded  model
 - **Developed by:** simonbutt
 - **License:** apache-2.0
 - **Finetuned from model :** unsloth/llama-3-8b-bnb-4bit
-This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
-[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)

 ---
 language:
 - en
+- am
 license: apache-2.0
 tags:
 - text-generation-inference
 - unsloth
 - llama
 - trl
+- sft
 base_model: unsloth/llama-3-8b-bnb-4bit
+datasets:
+- iocuydi/amharic-alpaca
+- iocuydi/amharic-dolly-15k
 ---
+# Llama3 Amharic DPO
+[Amharic Llama3 8B Alpaca](simonbutt/am_llama3_alpaca) further DPO tuned on an amharic translated dolly-15k [dataset](https://huggingface.co/datasets/iocuydi/amharic-dolly-15k) to always respond in Amharic.
+Very token inefficient.
 - **Developed by:** simonbutt
 - **License:** apache-2.0
 - **Finetuned from model :** unsloth/llama-3-8b-bnb-4bit
+[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)