Add Diarization to README
Browse files
README.md
CHANGED
@@ -52,6 +52,13 @@ You can upload multiple files either through the "Upload files" option, or as a
|
|
52 |
Each audio file will then be processed in turn, and the resulting SRT/VTT/Transcript will be made available in the "Download" section.
|
53 |
When more than one file is processed, the UI will also generate a "All_Output" zip file containing all the text output files.
|
54 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
55 |
## Whisper Implementation
|
56 |
|
57 |
You can choose between using `whisper` or `faster-whisper`. [Faster Whisper](https://github.com/guillaumekln/faster-whisper) as a drop-in replacement for the
|
|
|
52 |
Each audio file will then be processed in turn, and the resulting SRT/VTT/Transcript will be made available in the "Download" section.
|
53 |
When more than one file is processed, the UI will also generate a "All_Output" zip file containing all the text output files.
|
54 |
|
55 |
+
## Diarization
|
56 |
+
|
57 |
+
To detect different speakers in the audio, you can use the [whisper-diarization](https://gitlab.com/aadnk/whisper-diarization) application.
|
58 |
+
|
59 |
+
Download the JSON file after running Whisper on an audio file, and then run app.py in the
|
60 |
+
whisper-diarization repository with the audio file and the JSON file as arguments.
|
61 |
+
|
62 |
## Whisper Implementation
|
63 |
|
64 |
You can choose between using `whisper` or `faster-whisper`. [Faster Whisper](https://github.com/guillaumekln/faster-whisper) as a drop-in replacement for the
|