Disabling/Reducing model reasoning

#22

by Abdallah1997 - opened 5 days ago

Abdallah1997

5 days ago

I have Important CoT prompts that guide the llm how to think. Using them is leading to latency and large token output, I'd like to reduce the internal model reasoning for those reasons.

Abdallah1997 changed discussion title from Disabling/Reducing reasoning to Disabling/Reducing model reasoning 5 days ago

bobzhuyb

StepFun org 4 days ago

We hear the ask. You are not alone. We will add it in the next version

LagOps

about 23 hours ago

ideally there would also be a non-thinking version or a non-thinking switch to keep the model responsive for local usage on consumer hardware or when latency is key to the application (such as using tex to speech to have a conversation etc.)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment