Artificial IntelligenceModels & Architecture

Speculative Decoding

Overview

An inference acceleration technique where a small draft model generates candidate token sequences that are verified in parallel by the larger target model.

Cross-References(1)

Blockchain & DLT

More in Artificial Intelligence

See Also