hazyresearch
/

lolcats-llama-3.1-8b-distill

Text Generation

Model card Files Files and versions Community

lolcats-llama-3.1-8b-distill / README.md

ariG23498's picture

ariG23498 HF staff

adding a small gist to help run the demo

bd21747 verified 2 months ago

|

428 Bytes

	---
	language:
	- en
	---

	This is a pure sub-quadtratic linear attention 8B parameter model, linearized from the Meta Llama 3.1 8B model.

	Details on this model and how to train your own are provided at: https://github.com/HazyResearch/lolcats/tree/lolcats-scaled

	## Demo

	Here is a quick [GitHub GIST](https://gist.github.com/ariG23498/45b0c2afc95ca4c4b7cf64fbc161c1e7) that will help you run inference on the model checkpoints.