Overview
Automatically generating natural language descriptions of the content depicted in images.
More in Computer Vision
Visual SLAM
3D & SpatialSimultaneous Localisation and Mapping using visual sensors to build a map while tracking position within it.
Image Generation
Generation & EnhancementCreating new images from scratch using generative AI models like GANs, diffusion models, or VAEs.
Panoptic Segmentation
Segmentation & AnalysisA unified approach combining semantic and instance segmentation to provide complete scene understanding.
3D Reconstruction
3D & SpatialThe process of capturing and creating three-dimensional models of real-world objects or environments from visual data.
Medical Imaging AI
Recognition & DetectionApplication of computer vision and deep learning to analyse medical images for diagnosis, screening, and treatment planning.
Point Cloud
3D & SpatialA set of data points in 3D space, typically generated by LiDAR or depth sensors, representing surface geometry.
Image Registration
Recognition & DetectionThe process of aligning two or more images of the same scene taken at different times, viewpoints, or by different sensors.
Instance Segmentation
Segmentation & AnalysisDetecting and delineating each distinct object instance in an image at the pixel level.