algorithm to process data that is too large to fit into a computer's main memory at one time, optimized to efficiently fetch and access data stored in slow bulk memory (e.g. hard drives, tape drives, remote network)
Discovered by embedding cosine similarity (sentence-transformers MiniLM, 384-dim).