e-hossam96
/

arabic-nano-gpt-v1

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

e-hossam96 commited on about 1 month ago

Commit

62c866d

•

1 Parent(s): 1965f36

Update README.md

Files changed (1) hide show

README.md +17 -8

README.md CHANGED Viewed

@@ -7,16 +7,18 @@ tags:
 model-index:
 - name: arabic-nano-gpt-v1
   results: []
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
 # arabic-nano-gpt-v1
 This model is a fine-tuned version of [openai-community/gpt2](https://huggingface.co/openai-community/gpt2) on an unknown dataset.
-It achieves the following results on the evaluation set:
-- Loss: 3.0267
 ## Model description
@@ -46,9 +48,9 @@ The following hyperparameters were used during training:
 - lr_scheduler_warmup_ratio: 0.01
 - num_epochs: 24
-### Training results
-| Training Loss | Epoch   | Step   | Validation Loss |
 |:-------------:|:-------:|:------:|:---------------:|
 | 4.1743        | 0.5849  | 5000   | 3.6616          |
 | 3.6165        | 1.1698  | 10000  | 3.4256          |
@@ -90,12 +92,19 @@ The following hyperparameters were used during training:
 | 3.1775        | 22.2260 | 190000 | 3.0295          |
 | 3.1747        | 22.8109 | 195000 | 3.0284          |
 | 3.1724        | 23.3957 | 200000 | 3.0273          |
-| 3.1757        | 23.9806 | 205000 | 3.0267          |
 ### Framework versions
 - Transformers 4.45.2
 - Pytorch 2.5.0
 - Datasets 3.0.1
-- Tokenizers 0.20.1

 model-index:
 - name: arabic-nano-gpt-v1
   results: []
+datasets:
+- wikimedia/wikipedia
+language:
+- ar
 ---
 # arabic-nano-gpt-v1
 This model is a fine-tuned version of [openai-community/gpt2](https://huggingface.co/openai-community/gpt2) on an unknown dataset.
+It achieves the following results on the held-out test set:
+- Loss: 3.02885
 ## Model description
 - lr_scheduler_warmup_ratio: 0.01
 - num_epochs: 24
+<!-- ### Training results -->
+<!-- | Training Loss | Epoch   | Step   | Validation Loss |
 |:-------------:|:-------:|:------:|:---------------:|
 | 4.1743        | 0.5849  | 5000   | 3.6616          |
 | 3.6165        | 1.1698  | 10000  | 3.4256          |
 | 3.1775        | 22.2260 | 190000 | 3.0295          |
 | 3.1747        | 22.8109 | 195000 | 3.0284          |
 | 3.1724        | 23.3957 | 200000 | 3.0273          |
+| 3.1757        | 23.9806 | 205000 | 3.0267          | -->
+### Training Loss
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/63ccee86374057a338e03c1e/WIQvnj-VCCBqvsUlJZ1K_.png)
+### Validation Loss
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/63ccee86374057a338e03c1e/DmTh4sIODlv1wrxXcedxL.png)
 ### Framework versions
 - Transformers 4.45.2
 - Pytorch 2.5.0
 - Datasets 3.0.1
+- Tokenizers 0.20.1