Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
@@ -7,7 +7,7 @@ sdk: static
|
|
7 |
pinned: false
|
8 |
---
|
9 |
|
10 |
-
- **[2024-11]**
|
11 |
|
12 |
[GitHub](https://github.com/EvolvingLMMs-Lab/multimodal-sae)
|
13 |
|
|
|
7 |
pinned: false
|
8 |
---
|
9 |
|
10 |
+
- **[2024-11]** 🤯🤯 We introduce **Multimodal SAE**, the first framework designed to interpret learned features in large-scale multimodal models using Sparse Autoencoders. Through our approach, we leverage LLaVA-OneVision-72B to analyze and explain the SAE-derived features of LLaVA-NeXT-LLaMA3-8B. Furthermore, we demonstrate the ability to steer model behavior by clamping specific features to alleviate hallucinations and avoid safety-related issues.
|
11 |
|
12 |
[GitHub](https://github.com/EvolvingLMMs-Lab/multimodal-sae)
|
13 |
|