Overview
The process of breaking text into smaller units (tokens) such as words, subwords, or characters for processing by language models.
More in Natural Language Processing
Machine Translation
Generation & TranslationThe use of AI to automatically translate text or speech from one natural language to another.
Coreference Resolution
Parsing & StructureThe task of identifying all expressions in text that refer to the same real-world entity.
Dialogue System
Generation & TranslationA computer system designed to converse with humans, encompassing task-oriented and open-domain conversation.
Abstractive Summarisation
Text AnalysisA text summarisation approach that generates novel sentences to capture the essential meaning of a document, rather than simply extracting and rearranging existing sentences.
Natural Language Understanding
Core NLPThe subfield of NLP focused on machine reading comprehension and extracting meaning from text.
Natural Language Processing
Core NLPThe field of AI focused on enabling computers to understand, interpret, and generate human language.
Dialogue Management
Generation & TranslationThe component of conversational systems that tracks conversation state, determines the next system action, and maintains coherent multi-turn interactions with users.
Multilingual Model
Semantics & RepresentationA language model trained on text from dozens or hundreds of languages simultaneously, enabling cross-lingual understanding and generation without language-specific fine-tuning.