language-and-voice-lab
/

whisper-large-icelandic-30k-steps-1000h-ct2

Automatic Speech Recognition

Inference Endpoints

Model card Files Files and versions Community

whisper-large-icelandic-30k-steps-1000h-ct2 / README.md

carlosdanielhernandezmena's picture

carlosdanielhernandezmena

Adding BibTex Citation

1a2ad15 about 1 year ago

|

history blame contribute delete

2.03 kB

	---
	license: cc-by-4.0
	language:
	- is
	datasets:
	- language-and-voice-lab/samromur_asr
	- language-and-voice-lab/samromur_children
	- language-and-voice-lab/malromur_asr
	- language-and-voice-lab/althingi_asr
	tags:
	- audio
	- automatic-speech-recognition
	- icelandic
	- whisper
	- whisper-large
	- iceland
	- reykjavik
	- samromur
	- faster-whisper
	---
	# whisper-large-icelandic-30k-steps-1000h-ct2

	This is a faster-whisper version of [language-and-voice-lab/whisper-large-icelandic-30k-steps-1000h](https://huggingface.co/language-and-voice-lab/whisper-large-icelandic-30k-steps-1000h).

	The model was created like described in [faster-whisper](https://github.com/guillaumekln/faster-whisper/tree/master):

	```bash
	ct2-transformers-converter --model language-and-voice-lab/whisper-large-icelandic-30k-steps-1000h \
	--output_dir whisper-large-icelandic-30k-steps-1000h-ct2 \
	--quantization float16
	```

	# Usage

	```python
	from faster_whisper import WhisperModel

	model_size = "whisper-large-icelandic-30k-steps-1000h-ct2"

	# Run on GPU with FP16
	model = WhisperModel(model_size, device="cuda", compute_type="float16")

	# or run on GPU with INT8
	# model = WhisperModel(model_size, device="cuda", compute_type="int8_float16")
	# or run on CPU with INT8
	# model = WhisperModel(model_size, device="cpu", compute_type="int8")

	segments, info = model.transcribe("audio.mp3", beam_size=5)

	print("Detected language '%s' with probability %f" % (info.language, info.language_probability))

	for segment in segments:
	print("[%.2fs -> %.2fs] %s" % (segment.start, segment.end, segment.text))
	```

	# BibTeX entry and citation info
	When publishing results based on these models please refer to:
	```bibtex
	@misc{gunnarsson2023whisperlarge30kicelandicct2,
	title={Acoustic Model in Icelandic: whisper-large-icelandic-30k-steps-1000h-ct2.},
	author={Gunnarsson, Thorsteinn Dadi and Hernandez Mena, Carlos Daniel},
	url={https://huggingface.co/language-and-voice-lab/whisper-large-icelandic-30k-steps-1000h-ct2},
	year={2023}
	}
	```