RedPajama: an Open Dataset for Training Large Language Models Paper • 2411.12372 • Published 5 days ago • 42
SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration Paper • 2411.10958 • Published 7 days ago • 42