special characters that are part of the spelling of words, not between words or sentences; e.g. the hyphen (“high-five”), the apostrophe (“o’clock”), the full point respectively the period for abbreviations (“Dr.”, “U.F.O.”)
Discovered by embedding cosine similarity (sentence-transformers MiniLM, 384-dim).