Push model using huggingface_hub.

Files changed (3) hide show

README.md ADDED Viewed

+---
+language:
+- en
+pipeline_tag: text-generation
+tags:
+- distillation
+- model_hub_mixin
+- pytorch_model_hub_mixin
+- simple-stories
+---
+This model has been pushed to the Hub using the [PytorchModelHubMixin](https://huggingface.co/docs/huggingface_hub/package_reference/mixins#huggingface_hub.PyTorchModelHubMixin) integration:
+- Library: https://github.com/danbraunai/simple_stories_train
+- Docs: [More Information Needed]

config.json ADDED Viewed

+{
+  "block_size": 1024,
+  "flash_attention": true,
+  "n_embd": 768,
+  "n_head": 12,
+  "n_key_value_heads": 3,
+  "n_layer": 12,
+  "rotary_dim": 64,
+  "vocab_size": 50257
+}

model.safetensors ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:106080be9e43f57776e48610bb6f7d2b6b9ec6e36760c98d3d741fa01b22d680
+size 508375792