Automatic Speech Recognition – English 8 KHZ

Model by AppTek

This model converts speech from English language 8Khz audio telephony and other files into text. It accepts audio files including WAV, MP3, PCM and other popular formats and outputs text in JSON, XML, TEXT or SRT formats. The model includes punctuation, capitalization, timecodes, word confidence score, and speaker diarization. This model can be used to transcribe telephone calls, meetings and interviews, and other 8Khz recorded content.


Many models are available for limited use in the free Modzy Basic account.