Xapian is a free and open-source probabilistic information retrieval library, released under the GNU General Public License (GPL). It is a full-text search engine library for programmers.
Xapian is a free and open-source probabilistic information retrieval library, released under the GNU General Public License (GPL). It is a full-text search engine library for programmers.
It is written in C++, with bindings to allow use from Perl, Python (2 and 3), PHP (5 and 7), Java, Tcl, C#, Ruby, Lua, Erlang, Node.js and R. Xapian is highly portable and runs on Linux, OS X, FreeBSD, NetBSD, OpenBSD, Solaris, HP-UX, AIX, Windows, OS/2 and Hurd, as well as Tru64. Xapian grew out of the Muscat search engine, written by Dr. Martin F. Porter at the University of Cambridge. The first official release of Xapian was version 0.5.0 on September 20, 2002.
Discovered by embedding cosine similarity (sentence-transformers MiniLM, 384-dim).