If You Can't Use Them, Recycle Them: Optimizing Merging at Scale Mitigates Performance Tradeoffs Paper • 2412.04144 • Published 14 days ago • 3
Source-Aware Training Enables Knowledge Attribution in Language Models Paper • 2404.01019 • Published Apr 1 • 1
Discriminator-Guided Multi-step Reasoning with Language Models Paper • 2305.14934 • Published May 24, 2023 • 1