Overview
The process of identifying and extracting relevant visual features from images for downstream analysis.
More in Computer Vision
Visual Question Answering
Recognition & DetectionAn AI task that generates natural language answers to questions about the content of images.
Action Recognition
Recognition & DetectionIdentifying and classifying human actions or activities from video sequences.
Image Classification
Recognition & DetectionThe task of assigning a label or category to an entire image based on its visual content.
Image Captioning
Recognition & DetectionAutomatically generating natural language descriptions of the content depicted in images.
3D Reconstruction
3D & SpatialThe process of capturing and creating three-dimensional models of real-world objects or environments from visual data.
Image Generation
Generation & EnhancementCreating new images from scratch using generative AI models like GANs, diffusion models, or VAEs.
Optical Character Recognition
Recognition & DetectionTechnology that converts images of text into machine-readable text data.
Computer Vision
Recognition & DetectionThe field of AI that enables computers to interpret and understand visual information from images and video.