tim-lawson 's Collections

Multi-Layer SAEs with Tuned Lens and Transformers

Single SAEs trained on the residual stream activation vectors from every layer simultaneously using tuned lenses, including the transformers.