character encoding standardized in P.R. of China and mostly used for Simplified Chinese, compatible with the universal character set defined in The Unicode® Standard and the ISO/IEC 10646 standard, backward compatible with GB encodings
Discovered by embedding cosine similarity (sentence-transformers MiniLM, 384-dim).