Overview
The process of assigning grammatical categories (noun, verb, adjective) to each word in a text.
More in Natural Language Processing
Large Language Model
Semantics & RepresentationA neural network trained on massive text corpora that can generate, understand, and reason about natural language.
Structured Output
Semantics & RepresentationThe generation of machine-readable formatted responses such as JSON, XML, or code from language models, enabling reliable integration with downstream software systems.
Dialogue Management
Generation & TranslationThe component of conversational systems that tracks conversation state, determines the next system action, and maintains coherent multi-turn interactions with users.
Top-K Sampling
Generation & TranslationA text generation strategy that restricts the model to sampling from the K most probable next tokens.
Chunking Strategy
Core NLPThe method of dividing long documents into smaller segments for embedding and retrieval, balancing context preservation with optimal chunk sizes for vector search accuracy.
Tokenisation
Semantics & RepresentationThe process of breaking text into smaller units (tokens) such as words, subwords, or characters for processing by language models.
Latent Dirichlet Allocation
Core NLPA generative probabilistic model for discovering topics in a collection of documents.
Natural Language Processing
Core NLPThe field of AI focused on enabling computers to understand, interpret, and generate human language.