Model

This repository contains the first version of our Automatic Speech Recognition and Subtitle Generation model, trained on 2000 hours of Flemish broadcast subtitled speech data. It can generate both a verbatim and a subtitle transcription. It outputs both modalities with separate decoders.

Version: September 2023

Usage

This repository only hosts the pre-trained model itself and the configuration files. To download this model, see the instructions here.

Usage of this model, as well as our other ASR models, is integrated in our Github codebase. Please refer to the Github for installation.

Webservice

This model can also be accessed through the webservice of the NeLF Project. After requesting access, you can upload audio or video files and they will be transcribed according to the desired settings.

Citation

If you use this model, please cite the research paper: (Will be added shortly).

Contact

Jakob Poncelet: jakob.poncelet@kuleuven.be