tim-lawson
/

mlsae-pythia-70m-deduped-x64-k32-tfm

model_hub_mixin

pytorch_model_hub_mixin

Model card Files Files and versions Community

tim-lawson commited on Nov 19, 2024

Commit

338acfd

·

verified ·

1 Parent(s): 42a94df

Push model using huggingface_hub.

Files changed (2) hide show

README.md +1 -4
config.json +4 -2

README.md CHANGED Viewed

@@ -1,12 +1,9 @@
 ---
-language: en
-library_name: mlsae
-license: mit
 tags:
 - model_hub_mixin
 - pytorch_model_hub_mixin
 ---
 This model has been pushed to the Hub using the [PytorchModelHubMixin](https://huggingface.co/docs/huggingface_hub/package_reference/mixins#huggingface_hub.PyTorchModelHubMixin) integration:
-- Library: https://github.com/tim-lawson/mlsae
 - Docs: [More Information Needed]

 ---
 tags:
 - model_hub_mixin
 - pytorch_model_hub_mixin
 ---
 This model has been pushed to the Hub using the [PytorchModelHubMixin](https://huggingface.co/docs/huggingface_hub/package_reference/mixins#huggingface_hub.PyTorchModelHubMixin) integration:
+- Library: [More Information Needed]
 - Docs: [More Information Needed]

config.json CHANGED Viewed

@@ -8,11 +8,13 @@
   "dead_tokens_threshold": 10000000,
   "expansion_factor": 64,
   "k": 32,
-  "layers": null,
   "lr": 0.0001,
   "max_length": 2048,
   "model_name": "EleutherAI/pythia-70m-deduped",
   "skip_special_tokens": true,
   "standardize": true,
   "tuned_lens": false
-}

   "dead_tokens_threshold": 10000000,
   "expansion_factor": 64,
   "k": 32,
+  "layers": [
+    0
+  ],
   "lr": 0.0001,
   "max_length": 2048,
   "model_name": "EleutherAI/pythia-70m-deduped",
   "skip_special_tokens": true,
   "standardize": true,
   "tuned_lens": false
+}