In the contemporary digital landscape, the volume of visual data is experiencing rapid growth, becoming a fundamental component across various sectors, including healthcare and autonomous vehicles. To fully harness the potential of this data, it is essential to go beyond mere images; it necessitates the inclusion of context, categorization, and labeling.