Big-5 or Big5 () is a Chinese character encoding method used in Taiwan, Hong Kong, and Macau for traditional Chinese characters.
via Wikipedia infobox
Big-5 or Big5 () is a Chinese character encoding method used in Taiwan, Hong Kong, and Macau for traditional Chinese characters.
The People's Republic of China (PRC), which uses simplified Chinese characters, uses the GB 18030 character set instead (though it can also substitute Big-5 or UTF-8).
Discovered by embedding cosine similarity (sentence-transformers MiniLM, 384-dim).