Overview
The process of automatically extracting structured information from unstructured or semi-structured text sources.
More in Natural Language Processing
Language Model
Semantics & RepresentationA probabilistic model that assigns probabilities to sequences of words, enabling prediction of the next word in a sequence.
Text Embedding
Core NLPDense vector representations of text passages that capture semantic meaning for similarity comparison and retrieval.
Semantic Search
Core NLPSearch technology that understands the meaning and intent behind queries rather than just matching keywords.
Semantic Similarity
Semantics & RepresentationA measure of how closely the meanings of two text passages align, computed through embedding comparison and used in duplicate detection, search, and recommendation systems.
Cross-Lingual Transfer
Core NLPThe application of models trained in one language to perform tasks in another language, leveraging shared multilingual representations learned during pre-training.
GloVe
Semantics & RepresentationGlobal Vectors for Word Representation — an unsupervised learning algorithm for obtaining word vector representations from aggregated word co-occurrence statistics.
Text Classification
Text AnalysisThe task of assigning predefined categories or labels to text documents based on their content.
Grounding
Semantics & RepresentationConnecting language model outputs to real-world knowledge, facts, or data sources to improve factual accuracy.