Deep LearningTraining & Optimisation

Multi-Head Attention

Overview

An attention mechanism that runs multiple attention operations in parallel, capturing different types of relationships.

Cross-References(1)

Deep Learning

More in Deep Learning