LoLCATs models.

#369

by jrell - opened about 1 month ago

Discussion

jrell

about 1 month ago

•

edited 30 days ago

This one might be a bit tricky.
Paper: https://hazyresearch.stanford.edu/blog/2024-10-14-lolcats-p1
Checkpoints: https://huggingface.co/collections/hazyresearch/lolcats-670ca4341699355b61238c37

These are base models, but it still would be cool to see how they work.

Thank you for your amazing quants!🫡

mradermacher

Owner 30 days ago

•

edited 30 days ago

As far as I can see, these are not even transformer models, but just the pure weights, without any of the meta data required to use them. So, without even knowing what architecture these are, conversion will be impossible. Maybe when it is supported by transformers, llama will catch up.

mradermacher changed discussion status to closed 30 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment