Language Models can Self-Lengthen to Generate Long Texts Paper • 2410.23933 • Published 28 days ago • 16
ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference Paper • 2410.21465 • Published Oct 28 • 10