Unicode block (U+FFFO-FFFF) containing a few interlinear annotation controls and replacement characters, as well as two special code points permanently reserved as non-characters at end of their code plane
Discovered by embedding cosine similarity (sentence-transformers MiniLM, 384-dim).