Overview
The systematic adversarial testing of AI systems to identify vulnerabilities, failure modes, harmful outputs, and safety risks before deployment.
More in Artificial Intelligence
Cognitive Computing
Foundations & TheoryComputing systems that simulate human thought processes using self-learning algorithms, data mining, pattern recognition, and natural language processing.
AI Tokenomics
Infrastructure & OperationsThe economic model governing the pricing and allocation of computational resources for AI inference, including per-token billing, rate limiting, and credit systems.
Reinforcement Learning from Human Feedback
Training & InferenceA training paradigm where AI models are refined using human preference signals, aligning model outputs with human values and quality expectations through reward modelling.
Turing Test
Foundations & TheoryA measure of machine intelligence proposed by Alan Turing, where a machine is deemed intelligent if it can exhibit conversation indistinguishable from a human.
Fuzzy Logic
Reasoning & PlanningA form of logic that handles approximate reasoning, allowing variables to have degrees of truth rather than strict binary true/false values.
Heuristic Search
Reasoning & PlanningProblem-solving techniques that use practical rules of thumb to find satisfactory solutions when exhaustive search is impractical.
Tensor Processing Unit
Models & ArchitectureGoogle's custom-designed application-specific integrated circuit for accelerating machine learning workloads.
Frame Problem
Foundations & TheoryThe challenge in AI of representing the effects of actions without having to explicitly state everything that remains unchanged.