marttisu-futurice commited on
Commit
4bedbab
·
verified ·
1 Parent(s): 414d6e4

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +6 -6
README.md CHANGED
@@ -1,10 +1,10 @@
1
  ---
2
- language:
3
- - fi
4
- license: apache-2.0
5
  base_model: TurkuNLP/gpt3-finnish-xl
 
6
  datasets:
7
  - TurkuNLP/squad_v2_fi
 
 
8
  pipeline_tag: text-generation
9
  ---
10
 
@@ -76,7 +76,7 @@ Use pipeline with task text-generation and the recommended format.
76
 
77
  ### Training Data
78
 
79
- Trained with 10000 random samples from test data in: [TurkuNLP/squad_v2_fi](https://huggingface.co/datasets/TurkuNLP/squad_v2_fi).
80
 
81
  ### Training Procedure
82
 
@@ -84,7 +84,7 @@ Training was done for 4-bit base model with supervised fine-tuning and Lora.
84
 
85
  #### Training Hyperparameters
86
 
87
- - **Training regime:** 4-bit, batch size 4, max steps 1000, data collator for completion only
88
 
89
  ## Evaluation
90
 
@@ -94,7 +94,7 @@ Evaluation has not been done properly yet.
94
 
95
  #### Testing Data
96
 
97
- Evaluated with 500 random samples from test data in: [TurkuNLP/squad_v2_fi](https://huggingface.co/datasets/TurkuNLP/squad_v2_fi).
98
 
99
  #### Factors
100
 
 
1
  ---
 
 
 
2
  base_model: TurkuNLP/gpt3-finnish-xl
3
+ license: apache-2.0
4
  datasets:
5
  - TurkuNLP/squad_v2_fi
6
+ language:
7
+ - fi
8
  pipeline_tag: text-generation
9
  ---
10
 
 
76
 
77
  ### Training Data
78
 
79
+ Trained with 40000 random samples from test data in: [TurkuNLP/squad_v2_fi](https://huggingface.co/datasets/TurkuNLP/squad_v2_fi).
80
 
81
  ### Training Procedure
82
 
 
84
 
85
  #### Training Hyperparameters
86
 
87
+ - **Training regime:** 4-bit, batch size 4, max steps 20000, data collator for completion only
88
 
89
  ## Evaluation
90
 
 
94
 
95
  #### Testing Data
96
 
97
+ Evaluated with 1000 random samples from test data in: [TurkuNLP/squad_v2_fi](https://huggingface.co/datasets/TurkuNLP/squad_v2_fi).
98
 
99
  #### Factors
100