tim-lawson
/

mlsae-pythia-160m-deduped-x4-k32-tfm

model_hub_mixin

pytorch_model_hub_mixin

Model card Files Files and versions Community

Edit model card

mlsae-pythia-160m-deduped-x4-k32-tfm

A Multi-Layer Sparse Autoencoder (MLSAE) trained on the residual stream activation vectors from every layer of EleutherAI/pythia-160m-deduped with an expansion factor of 4 and k = 32, over 1 billion tokens from monology/pile-uncopyrighted. This model includes the underlying transformer.

For more details, see:

Paper: https://arxiv.org/abs/2409.04185
GitHub repository: https://github.com/tim-lawson/mlsae
Weights & Biases project: https://wandb.ai/timlawson-/mlsae

Downloads last month: 28

Safetensors

Model size

167M params

Tensor type

F32

·

Inference API

Unable to determine this model’s pipeline type. Check the docs .

Dataset used to train tim-lawson/mlsae-pythia-160m-deduped-x4-k32-tfm

Collection including tim-lawson/mlsae-pythia-160m-deduped-x4-k32-tfm

Multi-Layer Sparse Autoencoders with Transformers

Single SAEs trained on the residual stream activation vectors from every transformer layer simultaneously (including the transformers). • 30 items • Updated Oct 7