--- license: cc-by-4.0 language: - is datasets: - language-and-voice-lab/samromur_asr - language-and-voice-lab/samromur_children - language-and-voice-lab/malromur_asr - language-and-voice-lab/althingi_asr tags: - audio - automatic-speech-recognition - icelandic - whisper - whisper-large - iceland - reykjavik - samromur - faster-whisper --- # whisper-large-icelandic-30k-steps-1000h-ct2 This is a faster-whisper version of [language-and-voice-lab/whisper-large-icelandic-30k-steps-1000h](https://huggingface.co/language-and-voice-lab/whisper-large-icelandic-30k-steps-1000h). The model was created like described in [faster-whisper](https://github.com/guillaumekln/faster-whisper/tree/master): ```bash ct2-transformers-converter --model language-and-voice-lab/whisper-large-icelandic-30k-steps-1000h \ --output_dir whisper-large-icelandic-30k-steps-1000h-ct2 \ --quantization float16 ``` # Usage ```python from faster_whisper import WhisperModel model_size = "whisper-large-icelandic-30k-steps-1000h-ct2" # Run on GPU with FP16 model = WhisperModel(model_size, device="cuda", compute_type="float16") # or run on GPU with INT8 # model = WhisperModel(model_size, device="cuda", compute_type="int8_float16") # or run on CPU with INT8 # model = WhisperModel(model_size, device="cpu", compute_type="int8") segments, info = model.transcribe("audio.mp3", beam_size=5) print("Detected language '%s' with probability %f" % (info.language, info.language_probability)) for segment in segments: print("[%.2fs -> %.2fs] %s" % (segment.start, segment.end, segment.text)) ``` # BibTeX entry and citation info *When publishing results based on these models please refer to:* ```bibtex @misc{gunnarsson2023whisperlarge30kicelandicct2, title={Acoustic Model in Icelandic: whisper-large-icelandic-30k-steps-1000h-ct2.}, author={Gunnarsson, Thorsteinn Dadi and Hernandez Mena, Carlos Daniel}, url={https://huggingface.co/language-and-voice-lab/whisper-large-icelandic-30k-steps-1000h-ct2}, year={2023} } ```