Overview
A two-stage retrieval process where an initial set of candidate documents is rescored by a more powerful model to improve the relevance ordering of search results.
More in Natural Language Processing
Language Model
Semantics & RepresentationA probabilistic model that assigns probabilities to sequences of words, enabling prediction of the next word in a sequence.
Long-Context Modelling
Semantics & RepresentationTechniques and architectures that enable language models to process and reason over extremely long input sequences, from tens of thousands to millions of tokens.
Named Entity Recognition
Parsing & StructureAn NLP task that identifies and classifies named entities in text into categories like person, organisation, and location.
Speech Recognition
Speech & AudioThe technology that converts spoken language into text, also known as automatic speech recognition.
Instruction Tuning
Semantics & RepresentationTraining a language model to follow natural language instructions by fine-tuning on instruction-response pairs.
Chunking Strategy
Core NLPThe method of dividing long documents into smaller segments for embedding and retrieval, balancing context preservation with optimal chunk sizes for vector search accuracy.
Coreference Resolution
Parsing & StructureThe task of identifying all expressions in text that refer to the same real-world entity.
Semantic Similarity
Semantics & RepresentationA measure of how closely the meanings of two text passages align, computed through embedding comparison and used in duplicate detection, search, and recommendation systems.