A compilation of sparse auto-encoders trained on large language models.
David Louapre
dlouapre
AI & ML interests
Large Language Models, Mechanistic Interpretability, ML & Games, Education
Recent Activity
updated
a collection
5 days ago
Sparse Auto-Encoders (SAEs) for Mechanistic Interpretability
updated
a collection
5 days ago
Sparse Auto-Encoders (SAEs) for Mechanistic Interpretability
updated
a collection
5 days ago
Sparse Auto-Encoders (SAEs) for Mechanistic Interpretability