tim-lawson
/

mlsae-pythia-70m-deduped-x64-k32-tfm

model_hub_mixin

pytorch_model_hub_mixin

Model card Files Files and versions Community

tim-lawson commited on Nov 19, 2024

Commit

732f3e4

·

verified ·

1 Parent(s): 338acfd

Upload folder using huggingface_hub

Files changed (2) hide show

README.md +4 -1
config.json +1 -4

README.md CHANGED Viewed

@@ -1,9 +1,12 @@
 ---
 tags:
 - model_hub_mixin
 - pytorch_model_hub_mixin
 ---
 This model has been pushed to the Hub using the [PytorchModelHubMixin](https://huggingface.co/docs/huggingface_hub/package_reference/mixins#huggingface_hub.PyTorchModelHubMixin) integration:
-- Library: [More Information Needed]
 - Docs: [More Information Needed]

 ---
+language: en
+library_name: mlsae
+license: mit
 tags:
 - model_hub_mixin
 - pytorch_model_hub_mixin
 ---
 This model has been pushed to the Hub using the [PytorchModelHubMixin](https://huggingface.co/docs/huggingface_hub/package_reference/mixins#huggingface_hub.PyTorchModelHubMixin) integration:
+- Library: https://github.com/tim-lawson/mlsae
 - Docs: [More Information Needed]

config.json CHANGED Viewed

@@ -8,13 +8,10 @@
   "dead_tokens_threshold": 10000000,
   "expansion_factor": 64,
   "k": 32,
-  "layers": [
-    0
-  ],
   "lr": 0.0001,
   "max_length": 2048,
   "model_name": "EleutherAI/pythia-70m-deduped",
   "skip_special_tokens": true,
   "standardize": true,
   "tuned_lens": false
-}

   "dead_tokens_threshold": 10000000,
   "expansion_factor": 64,
   "k": 32,
   "lr": 0.0001,
   "max_length": 2048,
   "model_name": "EleutherAI/pythia-70m-deduped",
   "skip_special_tokens": true,
   "standardize": true,
   "tuned_lens": false
+}