Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
@@ -1,10 +1,10 @@
|
|
1 |
---
|
2 |
-
language:
|
3 |
-
- fi
|
4 |
-
license: apache-2.0
|
5 |
base_model: TurkuNLP/gpt3-finnish-xl
|
|
|
6 |
datasets:
|
7 |
- TurkuNLP/squad_v2_fi
|
|
|
|
|
8 |
pipeline_tag: text-generation
|
9 |
---
|
10 |
|
@@ -76,7 +76,7 @@ Use pipeline with task text-generation and the recommended format.
|
|
76 |
|
77 |
### Training Data
|
78 |
|
79 |
-
Trained with
|
80 |
|
81 |
### Training Procedure
|
82 |
|
@@ -84,7 +84,7 @@ Training was done for 4-bit base model with supervised fine-tuning and Lora.
|
|
84 |
|
85 |
#### Training Hyperparameters
|
86 |
|
87 |
-
- **Training regime:** 4-bit, batch size 4, max steps
|
88 |
|
89 |
## Evaluation
|
90 |
|
@@ -94,7 +94,7 @@ Evaluation has not been done properly yet.
|
|
94 |
|
95 |
#### Testing Data
|
96 |
|
97 |
-
Evaluated with
|
98 |
|
99 |
#### Factors
|
100 |
|
|
|
1 |
---
|
|
|
|
|
|
|
2 |
base_model: TurkuNLP/gpt3-finnish-xl
|
3 |
+
license: apache-2.0
|
4 |
datasets:
|
5 |
- TurkuNLP/squad_v2_fi
|
6 |
+
language:
|
7 |
+
- fi
|
8 |
pipeline_tag: text-generation
|
9 |
---
|
10 |
|
|
|
76 |
|
77 |
### Training Data
|
78 |
|
79 |
+
Trained with 40000 random samples from test data in: [TurkuNLP/squad_v2_fi](https://huggingface.co/datasets/TurkuNLP/squad_v2_fi).
|
80 |
|
81 |
### Training Procedure
|
82 |
|
|
|
84 |
|
85 |
#### Training Hyperparameters
|
86 |
|
87 |
+
- **Training regime:** 4-bit, batch size 4, max steps 20000, data collator for completion only
|
88 |
|
89 |
## Evaluation
|
90 |
|
|
|
94 |
|
95 |
#### Testing Data
|
96 |
|
97 |
+
Evaluated with 1000 random samples from test data in: [TurkuNLP/squad_v2_fi](https://huggingface.co/datasets/TurkuNLP/squad_v2_fi).
|
98 |
|
99 |
#### Factors
|
100 |
|