A lot of "G"s

by tastypear - opened Jul 2

Discussion

tastypear

Jul 2

I use the latest llamaserver to load this model, it outputs "GGGGGGGGG..." (also in Very_Berry_Qwen2_7B gguf)

Maybe it's a lora related bug I'm not sure.

jeiku

Owner Jul 2

I am running it fine in Llama.RN on mobile, are you sure your llamacpp is up to date?

jeiku

Owner Jul 2

•

edited Jul 2

@tastypear you could also try the official quants from mradermacher. The ones on my page are just meant for debugging and are generated automatically.

https://huggingface.co/mradermacher/Very_Berry_Qwen2_7B-i1-GGUF

Berry v2 is in the queue now.

tastypear

Jul 2

Oh, I figured out the issue; it's not a problem with the model.

When loading using the llama.cpp server, it requires the -fa argument (flash attention).

Anyway, thank you for your response🤗

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment