Overview
An AI task that generates natural language answers to questions about the content of images.
More in Computer Vision
Medical Imaging AI
Recognition & DetectionApplication of computer vision and deep learning to analyse medical images for diagnosis, screening, and treatment planning.
Feature Extraction
Segmentation & AnalysisThe process of identifying and extracting relevant visual features from images for downstream analysis.
Image Segmentation
Segmentation & AnalysisPartitioning an image into multiple segments or regions, assigning each pixel to a specific class or object.
Bounding Box
Recognition & DetectionA rectangular region drawn around an object in an image to indicate its location for object detection tasks.
Semantic Segmentation
Segmentation & AnalysisClassifying every pixel in an image into a predefined category without distinguishing between individual object instances.
Image Registration
Recognition & DetectionThe process of aligning two or more images of the same scene taken at different times, viewpoints, or by different sensors.
Visual SLAM
3D & SpatialSimultaneous Localisation and Mapping using visual sensors to build a map while tracking position within it.
Point Cloud
3D & SpatialA set of data points in 3D space, typically generated by LiDAR or depth sensors, representing surface geometry.