Overview
An inference acceleration technique where a small draft model generates candidate token sequences that are verified in parallel by the larger target model.
Cross-References(1)
More in Artificial Intelligence
AI Orchestration Layer
Infrastructure & OperationsMiddleware that manages routing, fallback, load balancing, and model selection across multiple AI providers to optimise cost, latency, and output quality.
Constraint Satisfaction
Reasoning & PlanningA computational approach where problems are defined as a set of variables, domains, and constraints that must all be simultaneously satisfied.
AI Governance
Safety & GovernanceThe frameworks, policies, and regulations that guide the responsible development and deployment of AI technologies.
Artificial General Intelligence
Foundations & TheoryA hypothetical form of AI that possesses the ability to understand, learn, and apply knowledge across any intellectual task a human can perform.
Artificial Superintelligence
Foundations & TheoryA theoretical level of AI that surpasses human cognitive abilities across all domains, including creativity and social intelligence.
AI Fairness
Safety & GovernanceThe principle of ensuring AI systems make equitable decisions without discriminating against any group based on protected attributes.
AI Explainability
Safety & GovernanceThe ability to describe AI decision-making processes in human-understandable terms, enabling trust and regulatory compliance.
AI Transparency
Safety & GovernanceThe practice of making AI systems' operations, data usage, and decision processes openly visible to stakeholders.