Also known as Unicode Transformation Format – 8-bit, UCS Transformation Format – 8-bit, UTF-2, FSS-UTF, filesystem safe UTF, UTF 8
UTF-8 is a character encoding standard used for electronic communication. Defined by the Unicode Standard, the name is derived from Unicode Transformation Format 8-bit. As of 2026, almost every webpage (99%) is transmitted as UTF-8.
via Wikipedia infobox
via Wikidata sitelinks · CC0
Discovered by embedding cosine similarity (sentence-transformers MiniLM, 384-dim).