Video annotation is a key aspect in converting unstructured visual data into structured information, which AI systems can understand, learn and utilize. Annotating video data is an important part of training machine learning (ML) models to perform tasks such as object detection, facial recognition, and activity recognition.