Quantized version please

by HR1777 - opened Jan 16, 2024

Jan 16, 2024

I am pretty sure many users would love to download the GGUF version of @mlabonne models, but unfortunately no one shows any interests in releasing them :(

mlabonne

Owner Jan 16, 2024

Thanks! Currently making one, it's gonna be uploaded in a few minutes + a chat space to give it a try.

HR1777

Jan 16, 2024

That's great. I'm waiting for both files and chat space.

mlabonne

Owner Jan 16, 2024

Sorry it failed. It'll be uploaded in 5-6 hours

sudhir2016

Jan 16, 2024

Meanwhile you can try my humble attempt sudhir2016/NeuralBeagle14-7B-GGUF till the original work from the master himself is available !!

mlabonne

Owner Jan 16, 2024

Thanks @sudhir2016 ! The space is now available here: https://huggingface.co/spaces/mlabonne/NeuralBeagle14-7B-GGUF-Chat (GGUF: https://huggingface.co/mlabonne/NeuralBeagle14-7B-GGUF).

CultriX

Jan 17, 2024

Thanks for your efforts @mlabonne . Great model!

testerav

Jan 17, 2024

•

edited Jan 17, 2024

@mlabonne Q4_K_M or Q5_K_M for 7b models? is their amy significant difference? i see that earlier the space was running the Q5 model but you switched to Q4

HR1777

Jan 17, 2024

Thank you @mlabonne for the GGUF version. I really like it! My request is done, but I just don't close the discussion because of the questions asked by pother users.

mlabonne

Owner Jan 17, 2024

@mlabonne Q4_K_M or Q5_K_M for 7b models? is their amy significant difference? i see that earlier the space was running the Q5 model but you switched to Q4

Q5_K_M is slightly better, I changed it because the inference was too slow on a CPU.

Thank you @mlabonne for the GGUF version. I really like it! My request is done, but I just don't close the discussion because of the questions asked by pother users.

Thanks @HR1777 sure let's keep it open

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment