swap-uniba
/

LLaMAntino-3-ANITA-8B-Inst-DPO-ITA

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

m-polignano-uniba commited on May 14

Commit

80717db

•

1 Parent(s): a52dca9

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -50,7 +50,7 @@ wants to provide Italian NLP researchers with an improved model for the Italian
 ## Specifications
 - **Model developers**: <br><a href="https://marcopoli.github.io/">Ph.D. Marco Polignano</a> - University of Bari Aldo Moro, Italy <br> <a href="https://huggingface.co/swap-uniba">SWAP Research Group</a> <br>
-- **Variations**: The model release has been **supervised fine-tuning (SFT)** using **QLoRA** 4bit, on two instruction-based datasets. **DPO** approach over the *jondurbin/truthy-dpo-v0.1* dataset is used to align with human preferences for helpfulness and safety.
 - **Input**: Models input text only.
 - **Language**: Multilingual 🏁 + Italian 🇮🇹
 - **Output**: Models generate text and code only.

 ## Specifications
 - **Model developers**: <br><a href="https://marcopoli.github.io/">Ph.D. Marco Polignano</a> - University of Bari Aldo Moro, Italy <br> <a href="https://huggingface.co/swap-uniba">SWAP Research Group</a> <br>
+- **Variations**: The model release has been **supervised fine-tuning (SFT)** using **QLoRA** 4bit, on two instruction-based datasets. **DPO** approach over the *mlabonne/orpo-dpo-mix-40k* dataset is used to align with human preferences for helpfulness and safety.
 - **Input**: Models input text only.
 - **Language**: Multilingual 🏁 + Italian 🇮🇹
 - **Output**: Models generate text and code only.