computing component that transparently stores data so that future requests for that data can be served faster
A cache is a computing component that stores data in a way that lets your computer or device quickly retrieve it again without having to fetch it from scratch. It matters because it makes programs and websites run faster by remembering information you've already accessed, so you don't have to wait as long the next time you need it.
AI-generated from the Wikipedia summary — may contain errors.
Diagram of a CPU memory cache operation
In computing, a cache (/kæʃ/ KASH) is a hardware or software component that stores data so that future requests for that data can be served faster; the data stored in a cache might be the result of an earlier computation or a copy of data stored elsewhere. A cache hit occurs when the requested data can be found in a cache, while a cache miss occurs when it cannot. Cache hits are served by reading data from the cache, which is faster than recomputing a result or reading from a slower data store; thus, the more requests that can be served from the cache, the faster the system performs.
Discovered by embedding cosine similarity (sentence-transformers MiniLM, 384-dim).