luelhagos
commited on
Commit
•
f7970c6
1
Parent(s):
e353eb5
model updated
Browse files- README.md +6 -6
- merges.txt +0 -0
- pytorch_model.bin +1 -1
- tokenizer.json +0 -0
- training_args.bin +1 -1
- vocab.json +0 -0
README.md
CHANGED
@@ -19,16 +19,16 @@ model-index:
|
|
19 |
metrics:
|
20 |
- name: Perplexity
|
21 |
type: perplexity
|
22 |
-
value:
|
23 |
- name: Training Loss
|
24 |
type: loss
|
25 |
-
value: 3.
|
26 |
---
|
27 |
|
28 |
# Model Card for GPT-2 Tigrinya Medium
|
29 |
|
30 |
## Model Summary
|
31 |
-
This is a GPT-2 model trained from scratch on Tigrinya text data. It was trained on 20 million tokens, primarily from news sources. The model is specifically designed for generating Tigrinya text using the Hugging Face Transformers library.
|
32 |
|
33 |
#### Model Description
|
34 |
- Model type: GPT-2
|
@@ -43,12 +43,12 @@ This is a GPT-2 model trained from scratch on Tigrinya text data. It was trained
|
|
43 |
#### Training Details
|
44 |
- Training regime: fp16 mixed precision
|
45 |
- Number of Epochs: 12
|
46 |
-
- Batch Size:
|
47 |
- Learning Rate: 5e-4
|
48 |
|
49 |
#### Evaluation
|
50 |
-
- Training Perplexity:
|
51 |
-
- Training Loss: 3.
|
52 |
|
53 |
#### Usage
|
54 |
|
|
|
19 |
metrics:
|
20 |
- name: Perplexity
|
21 |
type: perplexity
|
22 |
+
value: 28.6
|
23 |
- name: Training Loss
|
24 |
type: loss
|
25 |
+
value: 3.12
|
26 |
---
|
27 |
|
28 |
# Model Card for GPT-2 Tigrinya Medium
|
29 |
|
30 |
## Model Summary
|
31 |
+
This is a GPT-2 model trained from scratch on Tigrinya text data. It was trained on 20.6 million tokens, primarily from news sources. The model is specifically designed for generating Tigrinya text using the Hugging Face Transformers library.
|
32 |
|
33 |
#### Model Description
|
34 |
- Model type: GPT-2
|
|
|
43 |
#### Training Details
|
44 |
- Training regime: fp16 mixed precision
|
45 |
- Number of Epochs: 12
|
46 |
+
- Batch Size: 6 (with gradient accumulation steps of 8)
|
47 |
- Learning Rate: 5e-4
|
48 |
|
49 |
#### Evaluation
|
50 |
+
- Training Perplexity: 28.6
|
51 |
+
- Training Loss: 3.12
|
52 |
|
53 |
#### Usage
|
54 |
|
merges.txt
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
pytorch_model.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 207671258
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f167374c31b3a30aacd1703be0ab4e2af2391c4c8133b342be464fa9681515b9
|
3 |
size 207671258
|
tokenizer.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 4344
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:8a6487b98ea3fcc3fdb934f815119cf125fc27b4452ba1e3e3a2bdef802ffd7c
|
3 |
size 4344
|
vocab.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|