Category
page 1Unicode
International Phonetic Alphabet
alphabetic system of phonetic notation
Unicode
Unicode (also known as The Unicode Standard and TUS) is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 17.0 defines 159,801 characters and 172 scripts used in various ordinary, literary, academic and technical contexts.
optical character recognition
computer recognition of visual text
Unicode Consortium
nonprofit organization that coordinates the development of the Unicode Standard
Universal Character Set
standard set of coded characters defined by the ISO/IEC 10646 international standard
arrow
direction symbol
XeTeX
XeTeX (
or ; see also ), sometimes stylized as '''''', is a TeX typesetting engine using Unicode and supporting modern font technologies such as OpenType, Graphite and Apple Advanced Typography (AAT). It was originally written by Jonathan Kew and is distributed under the X11 free software license.
homoglyph
thumb|The homoglyphs and overlaid. In the image, both characters are set in Helvetica LT Std Roman.|class=skin-invert-image
In orthography and typography, a homoglyph is one of two or more graphemes, characters, or glyphs with shapes that appear identical or very similar but may have differing meaning. The designation is also applied to sequences of characters sharing these properties.
CJKV stroke
basic calligraphic component needed to draw CJKV characters used in East Asia
Number Forms
Unicode block (U+2150-218F)
Unicode chess symbol
text character representing a chess piece or noting a movement
list of character entity references in XML and HTML
Wikimedia list article
Unicode plane
range of 65 536 code points for the universal character set defined by ISO/CIE 10646 and The Unicode® Standard
astrological symbol
signs ans symbols denoting various astrological concepts
Common Locale Data Repository
project of the Unicode Consortium to provide locale data in XML format for use in computer applications
International Components for Unicode
software libraries for Unicode support
Uralic Phonetic Alphabet
phonetic transcription system for Uralic languages
box-drawing character
type of character used to draw frames and boxes
Uniscribe
Uniscribe is the Microsoft Windows set of services for rendering Unicode-encoded text, supporting complex text layout. It is implemented in the dynamic link library . Uniscribe was released with Windows 98 SE, Windows 2000 and Internet Explorer 5.0. In addition, the Windows CE platform has supported Uniscribe since version 5.0.
Han unification
effort by Unicode/ISO 10646 to map Han characters into a single set, ignoring regional variations
Omega
extension of the TeX typesetting system that uses the Basic Multilingual Plane of Unicode
Unicode typeface
typeface that maps glyphs to code points defined in the Unicode Standard
IDN homograph attack
using visually similar characters in domain names to deceive users
ghost characters
erroneous kanji included in the Japanese JIS X 0208 standard and later in Unicode
quad
metal spacer used in typography
Universal Character Set characters
Wikimedia list article
precomposed character
Unicode codepoint that represents a base character with one or more combining characters
Unicode input
base speaking inputs with unicode reference
CJK unified ideograph
ideographic character used in Chinese or Japanese languages and traditionally in Korean or Vietnamese, defined by Unicode under ISO/IEC 10646

ʻ
modifier letter turned comma (U+02BB): typographical alternate for U+02BD ‹ʽ› or U+02BF ‹ʿ›; used for glottal stop in some Polynesian orthographies ("ʻokina" in Hawaiʻian, "fakauʻa" in Tongan)
script in Unicode
subset of characters in Unicode
Zalgo text
distorted text containing an excess of unusual and non-meaningful combining characters, with no other goal than to obscure it with a glitchy layout, making it difficult to reproduce and read
International Ideographs Core
subset of Unicode CJK Unified Ideographs characters intended for use on less capable devices
Unicode and email
relationship between Unicode and email
numeric character reference
markup construct used in SGML, XML, and HTML to refer to a Unicode character by codepoint, either in decimal (Æ) or in hexadecimal (Æ)