Tokenizer not available (OSError: Can't load tokenizer)
#1
by
jwidmer
- opened
Hello
Thanks for your work on the Sinhala Whisper model. I want to use it for a friend, but when loading it with transformers.pipeline(), I'm getting an OSError (Can't load tokenizer).
There is a discussion about this: https://discuss.huggingface.co/t/tokenizer-not-created-when-training-whisper-small-model/61876
You think you could add the tokenizer somehow?
Greetings and thanks,
jonas
I added tokenizer
thanks a lot!
tokenizer loading works.
Now I get: OSError: RRashmini/whisper-small-sinhala-26 does not appear to have a file named preprocessor_config.json.
I'm running the following code:
from transformers import pipeline
whisper = pipeline(
"automatic-speech-recognition",
"RRashmini/whisper-small-sinhala-26",
torch_dtype=torch.float16,
)