Deep LearningArchitectures

Transformer

Overview

A neural network architecture based entirely on attention mechanisms, eliminating recurrence and enabling parallel processing of sequences.

Cross-References(1)

Deep Learning

More in Deep Learning