Overview
A machine learning paradigm where agents learn optimal behaviour through trial and error, receiving rewards or penalties.
Cross-References(1)
More in Machine Learning
Learning Rate
Training TechniquesA hyperparameter that controls how much model parameters are adjusted with respect to the loss gradient during training.
Polynomial Regression
Supervised LearningA form of regression analysis where the relationship between variables is modelled as an nth degree polynomial.
Cross-Validation
Training TechniquesA resampling technique that partitions data into subsets, training on some and validating on others to assess model generalisation.
XGBoost
Supervised LearningAn optimised distributed gradient boosting library designed for speed and performance in machine learning competitions and production.
Bias-Variance Tradeoff
Training TechniquesThe balance between a model's ability to minimise bias (error from assumptions) and variance (sensitivity to training data fluctuations).
K-Means Clustering
Unsupervised LearningA partitioning algorithm that divides data into k clusters by minimising the distance between points and their cluster centroids.
t-SNE
Unsupervised Learningt-Distributed Stochastic Neighbour Embedding — a technique for visualising high-dimensional data in two or three dimensions.
Feature Engineering
Feature Engineering & SelectionThe process of using domain knowledge to create, select, and transform input variables to improve model performance.