Back to model community

Automatic Speech Recognition - Russian 16 KHZ - powered by Modzy MLOps platform for Enterprise and Edge AI

Automatic Speech Recognition – Russian 16 KHZ

Model by AppTek

This model converts speech from Russian language 16Khz audio telephony files into text. It accepts audio files including MP4, WMV, WAV, MP3, and other popular formats and outputs text in JSON, XML, TEXT or SRT formats. The model includes punctuation, capitalization, timecodes, word confidence score, and speaker diarization. This model can be used to transcribe news channels along with media and entertainment, create rich metadata from media archives, and to generate captions from audio and video sources.


See the model in action with a Modzy MLOps platform demo or start a trial