Identifying and tracking persons and vehicles in videos is a valuable asset for forensic and real-time alerting applications. However, thousands of hours of recorded content are time consuming and difficult to sift through for potential threatening entities. Currently, AI models with an additional component are being researched. These models, activity detectors, are trained to perform detection, tracking, and activity classification on objects of interest. This is an innately complex task not only because of each object’s uniqueness, but also because of the possible interactive activities that occur between multiple objects (e.g., opening car doors). This model tackles this complexity by detecting and tracking people and vehicles in videos, while classifying their singular and interactive activities (Person only, Vehicle only, and Person-Vehicle interactions).

