Deep LearningArchitectures

Vision Transformer

Overview

A transformer architecture adapted for image recognition that divides images into patches and processes them as sequences, rivalling convolutional networks in visual tasks.

Cross-References(1)

Deep Learning

More in Deep Learning