model updated

Files changed (6) hide show

README.md CHANGED Viewed

@@ -19,16 +19,16 @@ model-index:
     metrics:
     - name: Perplexity
       type: perplexity
-      value: 37.35
     - name: Training Loss
       type: loss
-      value: 3.03
 ---
 # Model Card for GPT-2 Tigrinya Medium
 ## Model Summary
-This is a GPT-2 model trained from scratch on Tigrinya text data. It was trained on 20 million tokens, primarily from news sources. The model is specifically designed for generating Tigrinya text using the Hugging Face Transformers library.
 #### Model Description
 - Model type: GPT-2
@@ -43,12 +43,12 @@ This is a GPT-2 model trained from scratch on Tigrinya text data. It was trained
 #### Training Details
 - Training regime: fp16 mixed precision
 - Number of Epochs: 12
-- Batch Size: 4 (with gradient accumulation steps of 8)
 - Learning Rate: 5e-4
 #### Evaluation
-- Training Perplexity: 37.35
-- Training Loss: 3.03
 #### Usage

     metrics:
     - name: Perplexity
       type: perplexity
+      value: 28.6
     - name: Training Loss
       type: loss
+      value: 3.12
 ---
 # Model Card for GPT-2 Tigrinya Medium
 ## Model Summary
+This is a GPT-2 model trained from scratch on Tigrinya text data. It was trained on 20.6 million tokens, primarily from news sources. The model is specifically designed for generating Tigrinya text using the Hugging Face Transformers library.
 #### Model Description
 - Model type: GPT-2
 #### Training Details
 - Training regime: fp16 mixed precision
 - Number of Epochs: 12
+- Batch Size: 6 (with gradient accumulation steps of 8)
 - Learning Rate: 5e-4
 #### Evaluation
+- Training Perplexity: 28.6
+- Training Loss: 3.12
 #### Usage

merges.txt CHANGED Viewed

The diff for this file is too large to render. See raw diff

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e89ceaedec11b8d4afb4843ab0fb9bf0a9af7698284ec344bedf9cdbba02b144
 size 207671258

 version https://git-lfs.github.com/spec/v1
+oid sha256:f167374c31b3a30aacd1703be0ab4e2af2391c4c8133b342be464fa9681515b9
 size 207671258

tokenizer.json CHANGED Viewed

The diff for this file is too large to render. See raw diff

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1e4e6d66f8f60a2705605b075325fcb23d5428f5038c6c42a653d4792fb38c7a
 size 4344

 version https://git-lfs.github.com/spec/v1
+oid sha256:8a6487b98ea3fcc3fdb934f815119cf125fc27b4452ba1e3e3a2bdef802ffd7c
 size 4344

vocab.json CHANGED Viewed

The diff for this file is too large to render. See raw diff