language of the Romani people belonging to the Indo-Aryan branch of the Indo-European language family
Romani is the language spoken by the Romani people and belongs to the Indo-Aryan branch of the Indo-European language family, making it distantly related to languages like Hindi and English. Understanding Romani matters because it is the native language of a distinct ethnic group whose history, culture, and linguistic heritage have often been overlooked or marginalized in mainstream society.
AI-generated from the Wikipedia summary — may contain errors.
Romani (/ˈrɒməni, ˈroʊ-/ ROM-ə-nee, ROH-; also Romanes /ˈrɒmənɪs/ ROM-ən-iss, Romany; Romani: rromani ćhib) is an Indo-Aryan macrolanguage of the Romani people. The largest Romani dialects are Vlax Romani (about 500,000 speakers), Balkan Romani (600,000), and Sinte Romani (300,000). Some Romani communities speak mixed languages based on the surrounding language with retained Romani-derived vocabulary – these are known by linguists as Para-Romani varieties, rather than dialects of the Romani language itself.
The differences between the various varieties can be as large as, for example, the differences between the Slavic languages.
Discovered by embedding cosine similarity (sentence-transformers MiniLM, 384-dim).