Long Context - a Julius-L Collection

Julius-L 's Collections

Memory Efficient Training

Model Architecture

LLM Technical Reports

Long Context

updated 25 days ago

Why Does the Effective Context Length of LLMs Fall Short?

Paper • 2410.18745 • Published Oct 24 • 16
Language Models can Self-Lengthen to Generate Long Texts

Paper • 2410.23933 • Published 28 days ago • 16
ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference

Paper • 2410.21465 • Published Oct 28 • 10