Seq len

by Hypersniper - opened Aug 13, 2023

Aug 13, 2023

•

edited Aug 13, 2023

So just to clarify, if the seq len of the quantized model shows 8k that means I can't use the full 16k? What should I set my max token and truncate settings to? (Text generation webui)

TheBloke

Owner Aug 13, 2023

You can use it at 16K. Please see details under "Explanation of GPTQ parameters"

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment