Loading the model

by PyrroAiakid - opened Aug 16, 2023

Discussion

PyrroAiakid

Aug 16, 2023

Forgive me if it's something very simple, but when I load the modlo I get an error.

They say I have to put -gqa 8 but I don't know how to do it.

waynekenney

Sep 3, 2023

I'm running into issues loading this model too. Gotta love our super helpful community, right?

Flanua

Sep 6, 2023

Why context length is 2048? Is it was cut on half? Base Llama 2 model have 4096 context length. If it's indeed 2048 then it's not the first time the model gets massacred like that.

qnixsynapse

Sep 7, 2023

•

edited Sep 7, 2023

It's just the GQA which should be 8 in 70B models. IF you multiply 1024 by 8 it will be 8192. Try adding -gqa 8 parameter or set gqa as 8.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment