Back to model community

Video Captioning - powered by Modzy MLOps platform for Enterprise and Edge AI

Video Captioning

Model by Open Source

This model gives a one sentence description of a short video clip. It accepts a short video in MP4 or MPEG format as input. The output is a text description of the contents of the video. This model can be used in to identify content of unseen videos by appropriately describing it. This makes it possible to quickly search a large collection of videos using established text search and retrieval tools.


See the model in action with a Modzy MLOps platform demo or start a trial