Sometimes I've empty answer
#13
by
Yarflam
- opened
The model replies in general, but sometimes nothing is generate.
I started my talk with a basic preprompt without any problem:
And 6 messages later:
I don't see any error in my runpod instance:
I've the impression that this happens more frequently when the end doesn't finish with punctuation (to be verified, of course).
The model is running with minimum parameters:
outputs = self.model.generate(
encodeds.to(self.device),
num_return_sequences=options['n'],
max_new_tokens=options['max_new_tokens'],
do_sample=options['do_sample'],
temperature=options['temperature'],
top_p=options['top_p'],
pad_token_id=self.tokenizer.eos_token_id
)
If you have any idea about the reasons or debug approach to determine the bug.