Addition is All You Need for Energy-efficient Language Models Paper • 2410.00907 • Published Oct 1 • 144 • 17
Addition is All You Need for Energy-efficient Language Models Paper • 2410.00907 • Published Oct 1 • 144 • 17
Quantifying Generalization Complexity for Large Language Models Paper • 2410.01769 • Published Oct 2 • 13 • 2
Addition is All You Need for Energy-efficient Language Models Paper • 2410.00907 • Published Oct 1 • 144 • 17
Training Task Experts through Retrieval Based Distillation Paper • 2407.05463 • Published Jul 7 • 7 • 1