m-polignano-uniba
commited on
Commit
โข
80717db
1
Parent(s):
a52dca9
Update README.md
Browse files
README.md
CHANGED
@@ -50,7 +50,7 @@ wants to provide Italian NLP researchers with an improved model for the Italian
|
|
50 |
## Specifications
|
51 |
|
52 |
- **Model developers**: <br><a href="https://marcopoli.github.io/">Ph.D. Marco Polignano</a> - University of Bari Aldo Moro, Italy <br> <a href="https://huggingface.co/swap-uniba">SWAP Research Group</a> <br>
|
53 |
-
- **Variations**: The model release has been **supervised fine-tuning (SFT)** using **QLoRA** 4bit, on two instruction-based datasets. **DPO** approach over the *
|
54 |
- **Input**: Models input text only.
|
55 |
- **Language**: Multilingual ๐ + Italian ๐ฎ๐น
|
56 |
- **Output**: Models generate text and code only.
|
|
|
50 |
## Specifications
|
51 |
|
52 |
- **Model developers**: <br><a href="https://marcopoli.github.io/">Ph.D. Marco Polignano</a> - University of Bari Aldo Moro, Italy <br> <a href="https://huggingface.co/swap-uniba">SWAP Research Group</a> <br>
|
53 |
+
- **Variations**: The model release has been **supervised fine-tuning (SFT)** using **QLoRA** 4bit, on two instruction-based datasets. **DPO** approach over the *mlabonne/orpo-dpo-mix-40k* dataset is used to align with human preferences for helpfulness and safety.
|
54 |
- **Input**: Models input text only.
|
55 |
- **Language**: Multilingual ๐ + Italian ๐ฎ๐น
|
56 |
- **Output**: Models generate text and code only.
|