As the video begins to dominate the field of application in AI: from autonomous vehicles and surveillance to health and entertainment, video annotation services have risen to great heights. Not just machines on seeing video content but also for recognition of motion identification and intention of patterns or interactions inside visual content.