Also known as U+034F, CGJ
Unicode control character; name is misnomer; has 2 functions: ① indicates that a pair of characters is a digraph for the purposes of ligation but not for collation; ② prevents canonical reordering of combining marks for e.g. Hebrew cantillation marks
Discovered by embedding cosine similarity (sentence-transformers MiniLM, 384-dim).