ordered sequence of Unicode characters encoded in canonical form and with an assigned standardized name, used to represent some composite character which is not encodable individually
Discovered by embedding cosine similarity (sentence-transformers MiniLM, 384-dim).