historical character encoding designed mainly for the simplified ideographic writing system used in the People's Republic of China, with a very basic support of a few other scripts
Discovered by embedding cosine similarity (sentence-transformers MiniLM, 384-dim).