Overview
The process of removing redundant or less important parameters from a neural network to reduce its size and computational cost.
Cross-References(1)
More in Artificial Intelligence
AI Agent Orchestration
Infrastructure & OperationsThe coordination and management of multiple AI agents working together to accomplish complex tasks, routing subtasks between specialised agents based on capability and context.
AI Inference
Training & InferenceThe process of using a trained AI model to make predictions or decisions on new, unseen data.
AI Memory Systems
Infrastructure & OperationsArchitectures that enable AI agents to store, retrieve, and reason over information from past interactions, providing continuity and personalisation across conversations.
AI Training
Training & InferenceThe process of teaching an AI model to recognise patterns by exposing it to large datasets and adjusting its parameters.
Tool Use in AI
Prompting & InteractionThe capability of AI agents to invoke external tools, APIs, databases, and software applications to accomplish tasks beyond the model's intrinsic knowledge and abilities.
Confusion Matrix
Evaluation & MetricsA table used to evaluate classification model performance by comparing predicted classifications against actual classifications.
Hyperparameter Tuning
Training & InferenceThe process of optimising the external configuration settings of a machine learning model that are not learned during training.
Quantisation
Evaluation & MetricsReducing the precision of neural network weights and activations from floating-point to lower-bit representations for efficiency.