Windows-1250 is a code page used under Microsoft Windows to represent texts in Central European and Eastern European languages that use the Latin script. It is primarily used by Czech. It is also used for Polish (as can Windows-1257), Slovak, Hungarian, Slovene (as can Windows-1257), Serbo-Croatian (Latin script), Romanian (before a 1993 spelling reform) and Albanian (as can Windows-1252). It may also be used with the German language, though it is missing uppercase ẞ. German-language texts encoded with Windows-1250 and Windows-1252 are identical.
Windows-1250 is a code page used under Microsoft Windows to represent texts in Central European and Eastern European languages that use the Latin script. It is primarily used by Czech. It is also used for Polish (as can Windows-1257), Slovak, Hungarian, Slovene (as can Windows-1257), Serbo-Croatian (Latin script), Romanian (before a 1993 spelling reform) and Albanian (as can Windows-1252). It may also be used with the German language, though it is missing uppercase ẞ. German-language texts encoded with Windows-1250 and Windows-1252 are identical.
This has been replaced by UTF-8 far more than Windows-1252 has. , less than 0.05% of all web pages use Windows-1250.
Discovered by embedding cosine similarity (sentence-transformers MiniLM, 384-dim).