GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection Paper • 2403.03507 • Published Mar 6 • 182 • 15
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Paper • 2402.17764 • Published Feb 27 • 603 • 142
Seeing through the Brain: Image Reconstruction of Visual Perception from Human Brain Signals Paper • 2308.02510 • Published Jul 27, 2023 • 21 • 3
Accelerating LLM Inference with Staged Speculative Decoding Paper • 2308.04623 • Published Aug 8, 2023 • 23 • 4