Quantifying Generalization Complexity for Large Language Models Paper • 2410.01769 • Published Oct 2 • 13
Training Task Experts through Retrieval Based Distillation Paper • 2407.05463 • Published Jul 7 • 7
Self-MoE: Towards Compositional Large Language Models with Self-Specialized Experts Paper • 2406.12034 • Published Jun 17 • 14
DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models Paper • 2309.03883 • Published Sep 7, 2023 • 34
Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning Paper • 2309.10814 • Published Sep 19, 2023 • 3