hazyresearch
/

lolcats-llama-3.1-8b-distill

Text Generation

Model card Files Files and versions

This is a pure sub-quadtratic linear attention 8B parameter model, linearized from the Meta Llama 3.1 8B model.

Details on this model and how to train your own are provided at: https://github.com/HazyResearch/lolcats/tree/lolcats-scaled

Demo

Here is a quick GitHub GIST that will help you run inference on the model checkpoints.

Paper

See the paper page: https://huggingface.co/papers/2410.10254

Downloads last month: 12

Space using hazyresearch/lolcats-llama-3.1-8b-distill 1

Collection including hazyresearch/lolcats-llama-3.1-8b-distill

LoLCATS

Linearizing LLMs with high quality and efficiency. We linearize the full Llama 3.1 model family -- 8b, 70b, 405b -- for the first time! • 4 items • Updated Oct 14, 2024 • 15