Speech-to-Text Converter using Python
The Speech-to-Text Converter tool in Python converts spoken audio into readable text, supporting multilingual speech such as Arabic, Russian, Spanish, French, English, Chinese, and Japanese. You can also download a text file of converted text. The libraries I have used are: whisper, tempfile, pydub, etc.
Audio file types supported are MP3, WAV, M4A, FLAC, and OGG.
Validations used in the Speech-to-Text Converter tool are
- Audio Duration Validation
- Supported Audio File Format Validation
- Audio File Size Validation, etc.
You will get a browse files button, with which you can upload an audio file, after uploading you can see a transcribe audio button. After the audio has been transcribed, you will see a text interface where the transcribed text is displayed, and you will also have a download as TXT button to download the transcribed text as a text file.