lennart-finke commited on
Commit
2f63422
·
verified ·
1 Parent(s): 05726de

Push model using huggingface_hub.

Browse files
Files changed (3) hide show
  1. README.md +14 -0
  2. config.json +10 -0
  3. model.safetensors +3 -0
README.md ADDED
@@ -0,0 +1,14 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ pipeline_tag: text-generation
5
+ tags:
6
+ - distillation
7
+ - model_hub_mixin
8
+ - pytorch_model_hub_mixin
9
+ - simple-stories
10
+ ---
11
+
12
+ This model has been pushed to the Hub using the [PytorchModelHubMixin](https://huggingface.co/docs/huggingface_hub/package_reference/mixins#huggingface_hub.PyTorchModelHubMixin) integration:
13
+ - Library: https://github.com/danbraunai/simple_stories_train
14
+ - Docs: [More Information Needed]
config.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "block_size": 1024,
3
+ "flash_attention": true,
4
+ "n_embd": 768,
5
+ "n_head": 12,
6
+ "n_key_value_heads": 3,
7
+ "n_layer": 12,
8
+ "rotary_dim": 64,
9
+ "vocab_size": 50257
10
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:106080be9e43f57776e48610bb6f7d2b6b9ec6e36760c98d3d741fa01b22d680
3
+ size 508375792