abugida script for the Cambodian (Khmer) language
via Wikipedia infobox
Khmer script (Khmer: អក្សរខ្មែរ, Âksâr Khmêr [ʔaksɑː kʰmae]) is an abugida (alphasyllabary) script used to write the Khmer language, the official language of Cambodia. It is also used to write Pali in the Buddhist liturgy of Cambodia and Thailand.
Khmer is written from left to right. Words within the same sentence or phrase are generally run together with no spaces between them. Consonant clusters within a word are "stacked", with the second (and occasionally third) consonant being written in reduced form under the main consonant. Originally there were 35 consonant characters, but modern Khmer uses only 33. Each character represents a consonant sound together with an inherent vowel, either â or ô; in many cases, in the absence of another vowel mark, the inherent vowel is to be pronounced after the consonant.
Discovered by embedding cosine similarity (sentence-transformers MiniLM, 384-dim).