Llama.cpp fixes have been merged, requires gguf regen

by RamoreRemora - opened Jun 30, 2024

Jun 30, 2024

A few hours ago, llama.cpp introduced fixes for gemma2.
All GGUF quants have to be regenerated for both 9b and 27b, unfortunately:
https://github.com/ggerganov/llama.cpp/pull/8197
Thanks again for your amazing work!

bartowski

Owner Jun 30, 2024

That don't have to be because they'll be populated with default values - but I'm doing it anyways for completeness and because imatrix should improve in quality with these changes

bartowski

Owner Jun 30, 2024

they've been remade!

RizwanHunzai

Jun 30, 2024

My download was at 97% on ~1Mbs connection 🥹 - But thanks for your work! 🫡

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment