pszemraj
/

griffin-1024-llama3t-8layer-simplewiki-silu

Text Generation

recurrent_gemma

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

pszemraj commited on Apr 26

Commit

daeb05f

•

1 Parent(s): adf1478

Update README.md

Files changed (1) hide show

README.md +5 -2

README.md CHANGED Viewed

@@ -7,6 +7,9 @@ metrics:
 model-index:
 - name: griffin-1024-llama3t-8layer-simple_wikipedia_LM-vN
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -14,7 +17,7 @@ should probably proofread and complete it, then remove this comment. -->
 # griffin-1024-llama3t-8layer-simple_wikipedia_LM-vN
-This model is a fine-tuned version of [griffin-1024-llama3t-8layer](https://huggingface.co/griffin-1024-llama3t-8layer) on the pszemraj/simple_wikipedia_LM dataset.
 It achieves the following results on the evaluation set:
 - Loss: 4.3584
 - Accuracy: 0.3789
@@ -66,4 +69,4 @@ The following hyperparameters were used during training:
 - Transformers 4.40.1
 - Pytorch 2.3.0+cu121
 - Datasets 2.19.0
-- Tokenizers 0.19.1

 model-index:
 - name: griffin-1024-llama3t-8layer-simple_wikipedia_LM-vN
   results: []
+license: apache-2.0
+datasets:
+- pszemraj/simple_wikipedia_LM
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # griffin-1024-llama3t-8layer-simple_wikipedia_LM-vN
+pretraining experiment on the pszemraj/simple_wikipedia_LM dataset.
 It achieves the following results on the evaluation set:
 - Loss: 4.3584
 - Accuracy: 0.3789
 - Transformers 4.40.1
 - Pytorch 2.3.0+cu121
 - Datasets 2.19.0
+- Tokenizers 0.19.1