luelhagos commited on
Commit
f7970c6
1 Parent(s): e353eb5

model updated

Browse files
Files changed (6) hide show
  1. README.md +6 -6
  2. merges.txt +0 -0
  3. pytorch_model.bin +1 -1
  4. tokenizer.json +0 -0
  5. training_args.bin +1 -1
  6. vocab.json +0 -0
README.md CHANGED
@@ -19,16 +19,16 @@ model-index:
19
  metrics:
20
  - name: Perplexity
21
  type: perplexity
22
- value: 37.35
23
  - name: Training Loss
24
  type: loss
25
- value: 3.03
26
  ---
27
 
28
  # Model Card for GPT-2 Tigrinya Medium
29
 
30
  ## Model Summary
31
- This is a GPT-2 model trained from scratch on Tigrinya text data. It was trained on 20 million tokens, primarily from news sources. The model is specifically designed for generating Tigrinya text using the Hugging Face Transformers library.
32
 
33
  #### Model Description
34
  - Model type: GPT-2
@@ -43,12 +43,12 @@ This is a GPT-2 model trained from scratch on Tigrinya text data. It was trained
43
  #### Training Details
44
  - Training regime: fp16 mixed precision
45
  - Number of Epochs: 12
46
- - Batch Size: 4 (with gradient accumulation steps of 8)
47
  - Learning Rate: 5e-4
48
 
49
  #### Evaluation
50
- - Training Perplexity: 37.35
51
- - Training Loss: 3.03
52
 
53
  #### Usage
54
 
 
19
  metrics:
20
  - name: Perplexity
21
  type: perplexity
22
+ value: 28.6
23
  - name: Training Loss
24
  type: loss
25
+ value: 3.12
26
  ---
27
 
28
  # Model Card for GPT-2 Tigrinya Medium
29
 
30
  ## Model Summary
31
+ This is a GPT-2 model trained from scratch on Tigrinya text data. It was trained on 20.6 million tokens, primarily from news sources. The model is specifically designed for generating Tigrinya text using the Hugging Face Transformers library.
32
 
33
  #### Model Description
34
  - Model type: GPT-2
 
43
  #### Training Details
44
  - Training regime: fp16 mixed precision
45
  - Number of Epochs: 12
46
+ - Batch Size: 6 (with gradient accumulation steps of 8)
47
  - Learning Rate: 5e-4
48
 
49
  #### Evaluation
50
+ - Training Perplexity: 28.6
51
+ - Training Loss: 3.12
52
 
53
  #### Usage
54
 
merges.txt CHANGED
The diff for this file is too large to render. See raw diff
 
pytorch_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e89ceaedec11b8d4afb4843ab0fb9bf0a9af7698284ec344bedf9cdbba02b144
3
  size 207671258
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f167374c31b3a30aacd1703be0ab4e2af2391c4c8133b342be464fa9681515b9
3
  size 207671258
tokenizer.json CHANGED
The diff for this file is too large to render. See raw diff
 
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1e4e6d66f8f60a2705605b075325fcb23d5428f5038c6c42a653d4792fb38c7a
3
  size 4344
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8a6487b98ea3fcc3fdb934f815119cf125fc27b4452ba1e3e3a2bdef802ffd7c
3
  size 4344
vocab.json CHANGED
The diff for this file is too large to render. See raw diff