soft hyphen (U+00AD): format control character normally invisible, which indicates a break position within a word; if the word break is applied, the character is displayed as a hyphen at end of line before the break
Discovered by embedding cosine similarity (sentence-transformers MiniLM, 384-dim).