Overview
Reinforcement Learning from Human Feedback — a technique for aligning language models with human preferences through reward modelling.
Cross-References(2)
More in Natural Language Processing
Named Entity Recognition
Parsing & StructureAn NLP task that identifies and classifies named entities in text into categories like person, organisation, and location.
Dialogue Management
Generation & TranslationThe component of conversational systems that tracks conversation state, determines the next system action, and maintains coherent multi-turn interactions with users.
Sentiment Analysis
Text AnalysisThe computational study of people's opinions, emotions, and attitudes expressed in text.
Cross-Lingual Transfer
Core NLPThe application of models trained in one language to perform tasks in another language, leveraging shared multilingual representations learned during pre-training.
Vector Database
Core NLPA database optimised for storing and querying high-dimensional vector embeddings for similarity search.
Latent Dirichlet Allocation
Core NLPA generative probabilistic model for discovering topics in a collection of documents.
Machine Translation
Generation & TranslationThe use of AI to automatically translate text or speech from one natural language to another.
Instruction Following
Semantics & RepresentationThe capability of language models to accurately interpret and execute natural language instructions, a core skill developed through instruction tuning and alignment training.
See Also
Reinforcement Learning
A machine learning paradigm where agents learn optimal behaviour through trial and error, receiving rewards or penalties.
Machine LearningReinforcement Learning from Human Feedback
A training paradigm where AI models are refined using human preference signals, aligning model outputs with human values and quality expectations through reward modelling.
Artificial Intelligence