Spaces:

lmms-lab
/

README

Running

luodian commited on 2 days ago

Commit

3b1a8e9

•

1 Parent(s): 9d6082d

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -7,7 +7,7 @@ sdk: static
 pinned: false
 ---
-- **[2024-11]** 🔥🔥 We introduce **Multimodal SAE**, the first framework designed to interpret learned features in large-scale multimodal models using Sparse Autoencoders. Through our approach, we leverage LLaVA-OneVision-72B to analyze and explain the SAE-derived features of LLaVA-NeXT-LLaMA3-8B. Furthermore, we demonstrate the ability to steer model behavior by clamping specific features to alleviate hallucinations and avoid safety-related issues.
     [GitHub](https://github.com/EvolvingLMMs-Lab/multimodal-sae)

 pinned: false
 ---
+- **[2024-11]** 🤯🤯 We introduce **Multimodal SAE**, the first framework designed to interpret learned features in large-scale multimodal models using Sparse Autoencoders. Through our approach, we leverage LLaVA-OneVision-72B to analyze and explain the SAE-derived features of LLaVA-NeXT-LLaMA3-8B. Furthermore, we demonstrate the ability to steer model behavior by clamping specific features to alleviate hallucinations and avoid safety-related issues.
     [GitHub](https://github.com/EvolvingLMMs-Lab/multimodal-sae)