e-palmisano commited on
Commit
9452cf0
1 Parent(s): 70219be

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +23 -2
README.md CHANGED
@@ -2,6 +2,7 @@
2
  base_model: model
3
  language:
4
  - en
 
5
  license: apache-2.0
6
  tags:
7
  - text-generation-inference
@@ -9,14 +10,34 @@ tags:
9
  - unsloth
10
  - qwen2
11
  - trl
 
 
 
12
  ---
13
 
 
 
 
 
 
14
  # Uploaded model
15
 
16
  - **Developed by:** e-palmisano
17
  - **License:** apache-2.0
18
- - **Finetuned from model :** model
 
 
 
 
 
 
 
 
 
 
 
 
19
 
20
  This qwen2 model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
21
 
22
- [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
 
2
  base_model: model
3
  language:
4
  - en
5
+ - it
6
  license: apache-2.0
7
  tags:
8
  - text-generation-inference
 
10
  - unsloth
11
  - qwen2
12
  - trl
13
+ datasets:
14
+ - gsarti/clean_mc4_it
15
+ - FreedomIntelligence/alpaca-gpt4-italian
16
  ---
17
 
18
+ This model has been fine-tuned with the continuous pretraining mode of Unsloth on gsarti/clean_mc4_it dataset (only 100k row).
19
+ The second fine-tuning was performed on the instructed dataset FreedomIntelligence/alpaca-gpt4-italian.
20
+
21
+
22
+
23
  # Uploaded model
24
 
25
  - **Developed by:** e-palmisano
26
  - **License:** apache-2.0
27
+ - **Finetuned from model :** unsloth/Qwen2-1.5B-Instruct-bnb-4bit
28
+
29
+ ## Evaluation
30
+
31
+ For a detailed comparison of model performance, check out the [Leaderboard for Italian Language Models](https://huggingface.co/spaces/FinancialSupport/open_ita_llm_leaderboard).
32
+
33
+ Here's a breakdown of the performance metrics:
34
+
35
+ | Metric | hellaswag_it acc_norm | arc_it acc_norm | m_mmlu_it 5-shot acc | Average |
36
+ |:----------------------------|:----------------------|:----------------|:---------------------|:--------|
37
+ | **Accuracy Normalized** | | | 0.4689 | |
38
+
39
+
40
 
41
  This qwen2 model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
42
 
43
+ [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)