Model fails with error about missing generation config

#1
by RaivisDejus - opened

When this model is used in Buzz application https://github.com/chidiwilliams/buzz/ it fails with the following error.

ValueError: You are trying to return timestamps, but the generation config is not properly set. Make sure to initialize the generation config with the correct attributes that are needed such as no_timestamps_token_id. For more details on how to generate the approtiate config, refer to https://github.com/huggingface/transformers/issues/21878#issuecomment-1451902363

Please consider adding generation config to the model repository

Finnish-NLP org

I added the generation config as I would assume it to be the same as in this repo: https://huggingface.co/Finnish-NLP/whisper-large-finnish-v3-ct2
Let me know if it is ok

Thanks @RASMUS works like a charm now.

Maybe I can also kindly ask you to add the generation config also to https://huggingface.co/Finnish-NLP/whisper-large-v2-finnish

RaivisDejus changed discussion status to closed
Finnish-NLP org

Hmm now when I look into it I think there might be some issues in the generation config:
Fi is:
"<|fi|>": 50277,
but in the config I have now:
"forced_decoder_ids": [
[
1,
null
],
[
2,
50360
]
],

Shouldn't that 2nd value be 50277?
I need to take a look into this later maybe. I have been planning to do the finetuning on more "balanced" dataset

Finnish-NLP org
edited Aug 2

Maybe it still works.
I did not see errors in the training code.
Tested locally and seems to still work

Seems to work for me as well

Sign up or log in to comment