Frame Representation Hypothesis: Multi-Token LLM Interpretability and Concept-Guided Text Generation Paper • 2412.07334 • Published 9 days ago • 16
Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation Paper • 2412.06531 • Published 9 days ago • 70
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated 21 days ago • 426