How to use MU-NLPC/whisper-large-v2-audio-captioning with Transformers:
# Load model directly from transformers import AutoProcessor, WhisperForAudioCaptioning processor = AutoProcessor.from_pretrained("MU-NLPC/whisper-large-v2-audio-captioning") model = WhisperForAudioCaptioning.from_pretrained("MU-NLPC/whisper-large-v2-audio-captioning")