Spaces:

huggingchat
/

chat-ui

Running

App Files Files Community

683

Llama 3 using a fixed seed?

#429

by Someone2077 - opened Apr 28, 2024

Discussion

Someone2077

Apr 28, 2024

Whenever I regenerate responses while using llama 3, it outputs exactly the same response every time. I think it's using a fixed seed. I tried it on groq.com and there the responses were unique each time.

Someone2077 changed discussion title from Llama 3 using the same seed each time? to Llama 3 using a fixed seed? Apr 28, 2024

EveryPizza

Apr 28, 2024

I've noticed that CommandR+ is generating very similar/exact same responses occasionally, but I think it's just a bug.

LostSpirit

Apr 30, 2024

Sounds like it could be a low temperature (randomness) setting. Good for precision if it doesn't hallucinate. Bad for creative writing or open-ended questions.

KingNish

Apr 30, 2024

@EveryPizza Just add in instruction at top that "You always generate Unique responses" or "Must create unique response every time"

EveryPizza

Apr 30, 2024

@EveryPizza Just add in instruction at top that "You always generate Unique responses" or "Must create unique response every time"

That won't have much of an effect, because the model isn't told what the last sent message was.

KingNish

May 1, 2024

@EveryPizza my models are also using llama3, but they were not repeating things because of System Prompt.

Model Link-> https://hf.co/chat/assistant/6612cb237c1e770b75c5ebad

LostSpirit

May 3, 2024

A workaround for this is by using an assistant with less restrictive parameters.

Don't use low temperature if you want varied responses. Decrease it if it outputs gibberish or goes "off the rails"
Avoid low Top P, slightly lower values than 1 should be safe.
Repetition penalty is a tricky one. Decrease it if it starts degrading into nonsense
Top-K is straightforward in how it works, uses lower values for more predictable output.

The options look like this, but it will take some trial and error to find the "right" values.

Use whatever you like for a system prompt for your use case.

Someone2077

May 3, 2024

This comment has been hidden

Someone2077 changed discussion status to closed May 3, 2024

Someone2077 changed discussion status to open May 3, 2024

Someone2077

May 3, 2024

@LostSpirit thanks for the info! I wish it was possible to customize these parameters in regular chat because assistants don't have the web search toggle (it's either permanently enabled or disabled).

treboresque

May 29, 2024

is there a canonical list of these parameters for llama 3 someplace with min/max/default values?

Someone2077 changed discussion status to closed Jul 30, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment