-
ContextCite: Attributing Model Generation to Context
Paper • 2409.00729 • Published • 13 -
Residual Stream Analysis with Multi-Layer SAEs
Paper • 2409.04185 • Published -
Amuro & Char: Analyzing the Relationship between Pre-Training and Fine-Tuning of Large Language Models
Paper • 2408.06663 • Published • 15 -
Gemma Scope: Open Sparse Autoencoders Everywhere All At Once on Gemma 2
Paper • 2408.05147 • Published • 36
Collections
Discover the best community collections!
Collections including paper arxiv:2404.07129