This Colab notebook transcribe audio using Whisper by OpenAI. It uses another python package for identifying and separating speakers. In addition, it creates a TXT and SRT files do download.
In order to use:
- Run all cells and follow the order
- Upload an audio file
- Select a language (English / Hebrew / Auto mode)
- Select number of speakers
- Run rest of cells
- The transcription (including the diarization - separation by speakers) will be shown in the last cell
- The SRT and TXT files will be available for download in the sidebar (under main folder)
Please note: GPU units may be neccessary for running this! If you don't have any GPU units, you may get error of CUDA: Out of memory! Make sure to select GPU Runtime, and if possible - turn on "high RAM".
If using please make sure to citate Yuval Avidani @HACKIT.CO.IL