Resources

View closed (26)

[AUTOMATED] Model Memory Requirements

#53 opened 5 months ago by

model-sizer-bot

Upload file, not working. It stays at the librispeech. I wanted to test it on the demo page.

#52 opened 7 months ago by

BladedSupernova

Difference in Transcription Quality Between Local Whisper Large V2 and Model Card Inference API

#51 opened 9 months ago by

nkanaka1

add link for whisper large v3 to the readme

#49 opened 10 months ago by

iitsg

Correct long-form generation config parameters 'max_initial_timestamp_index' and 'prev_sot_token_id'.

#47 opened 11 months ago by

patrickvonplaten

How to Generate a .mlmodel File for Apple's CoreML Framework

#45 opened 11 months ago by

Garry1234

Upload tokenizer

#43 opened 12 months ago by

ArthurZ

OpenAI Whisper offline use for production and roadmap

#42 opened about 1 year ago by

bahadyr

How can whisper return the language type?

#41 opened about 1 year ago by

polaris16

Correct added token ids

#40 opened about 1 year ago by

sanchit-gandhi

Fine-tunining Whisper models for shorter audio segments

#34 opened over 1 year ago by

Malishevsky

About finetuning whisper

#33 opened over 1 year ago by

lypspeech

Sagemaker endpoint deployment (image_uri)?

#32 opened over 1 year ago by

MLLife

Link of model download

#25 opened almost 2 years ago by

eashanchawla

Should large still exist? Or should it link to large-v2?

#22 opened almost 2 years ago by

altryne

Update config for automatic language detection

#19 opened almost 2 years ago by

ArthurZ

prerequisites for fine-tuning whisper model

#18 opened almost 2 years ago by

Achitha

ONNX implementation

#17 opened almost 2 years ago by

kirankumaram

Audio file is not transcribed after 30 second mark.

#16 opened almost 2 years ago by

kirankumaram

Source of audio used to train Whisper

#15 opened about 2 years ago by

mahelona

Not transcribing the audio into text (for some audios)

#13 opened about 2 years ago by

uriii3

Transcription

#11 opened about 2 years ago by

Spotex93

forced_decoder_ids not applied properly when generation

#10 opened about 2 years ago by

minseong-ringle

Only the logits for the decoder_input_ids are returned, not for the actual input_features

#8 opened about 2 years ago by

joeyontour

Decoding of 'mp3' failed

#6 opened about 2 years ago by

tyatabe

Input error

#3 opened about 2 years ago by

mrJezy

WhisperProcessor class import doesn't work

#1 opened about 2 years ago by

mrJezy