[AUTOMATED] Model Memory Requirements
#53 opened 5 months ago
by
model-sizer-bot
Upload file, not working. It stays at the librispeech. I wanted to test it on the demo page.
#52 opened 7 months ago
by
BladedSupernova
Difference in Transcription Quality Between Local Whisper Large V2 and Model Card Inference API
#51 opened 9 months ago
by
nkanaka1
add link for whisper large v3 to the readme
#49 opened 10 months ago
by
iitsg
Correct long-form generation config parameters 'max_initial_timestamp_index' and 'prev_sot_token_id'.
#47 opened 11 months ago
by
patrickvonplaten
How to Generate a .mlmodel File for Apple's CoreML Framework
#45 opened 11 months ago
by
Garry1234
Upload tokenizer
1
#43 opened 12 months ago
by
ArthurZ
OpenAI Whisper offline use for production and roadmap
#42 opened about 1 year ago
by
bahadyr
How can whisper return the language type?
2
#41 opened about 1 year ago
by
polaris16
Correct added token ids
#40 opened about 1 year ago
by
sanchit-gandhi
Fine-tunining Whisper models for shorter audio segments
#34 opened over 1 year ago
by
Malishevsky
About finetuning whisper
#33 opened over 1 year ago
by
lypspeech
Sagemaker endpoint deployment (image_uri)?
#32 opened over 1 year ago
by
MLLife
Link of model download
3
#25 opened almost 2 years ago
by
eashanchawla
Should large still exist? Or should it link to large-v2?
4
#22 opened almost 2 years ago
by
altryne
Update config for automatic language detection
2
#19 opened almost 2 years ago
by
ArthurZ
prerequisites for fine-tuning whisper model
1
#18 opened almost 2 years ago
by
Achitha
ONNX implementation
1
#17 opened almost 2 years ago
by
kirankumaram
Audio file is not transcribed after 30 second mark.
1
#16 opened almost 2 years ago
by
kirankumaram
Source of audio used to train Whisper
2
#15 opened about 2 years ago
by
mahelona
Not transcribing the audio into text (for some audios)
7
#13 opened about 2 years ago
by
uriii3
Transcription
6
#11 opened about 2 years ago
by
Spotex93
forced_decoder_ids not applied properly when generation
1
#10 opened about 2 years ago
by
minseong-ringle
Only the logits for the decoder_input_ids are returned, not for the actual input_features
4
#8 opened about 2 years ago
by
joeyontour
Decoding of 'mp3' failed
3
#6 opened about 2 years ago
by
tyatabe
Input error
4
#3 opened about 2 years ago
by
mrJezy
WhisperProcessor class import doesn't work
6
#1 opened about 2 years ago
by
mrJezy