Speech Transcription

Model by Open Source

This model is an open source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu’s Deep Speech research paper. It takes a WAV file as input. The model outputs text of the transcribed speech.

This model can be used in to transcribe audio speech to analyze customer service phone calls, and to convert spoken messages into email or text messages.


Many models are available for limited use in the free Modzy Basic account.