Overview
The economic model governing the pricing and allocation of computational resources for AI inference, including per-token billing, rate limiting, and credit systems.
Cross-References(3)
More in Artificial Intelligence
AI Red Teaming
Safety & GovernanceThe systematic adversarial testing of AI systems to identify vulnerabilities, failure modes, harmful outputs, and safety risks before deployment.
Model Pruning
Models & ArchitectureThe process of removing redundant or less important parameters from a neural network to reduce its size and computational cost.
Frame Problem
Foundations & TheoryThe challenge in AI of representing the effects of actions without having to explicitly state everything that remains unchanged.
AI Safety
Safety & GovernanceThe interdisciplinary field dedicated to making AI systems safe, robust, and beneficial while minimizing risks of unintended consequences.
In-Context Learning
Prompting & InteractionThe ability of large language models to learn new tasks from examples provided within the input prompt without parameter updates.
Direct Preference Optimisation
Training & InferenceA simplified alternative to RLHF that directly optimises language model policies using preference data without requiring a separate reward model.
Confusion Matrix
Evaluation & MetricsA table used to evaluate classification model performance by comparing predicted classifications against actual classifications.
Quantisation
Evaluation & MetricsReducing the precision of neural network weights and activations from floating-point to lower-bit representations for efficiency.