Transformers alternatives - a juampahc Collection

juampahc 's Collections

Extending Context-Lenght

Transformers alternatives

Transformers alternatives

updated Mar 4

Transformers are Multi-State RNNs

Paper • 2401.06104 • Published Jan 11 • 35
Linear Transformers with Learnable Kernel Functions are Better In-Context Models

Paper • 2402.10644 • Published Feb 16 • 78
In Search of Needles in a 10M Haystack: Recurrent Memory Finds What LLMs Miss

Paper • 2402.10790 • Published Feb 16 • 40
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 602