• v
  • t
  • e
Unicode
Unicode
  • Unicode Consortium
  • ISO/IEC 10646 (Universal Character Set)
  • Versions
Code points
  • Block
    • List
  • Universal Character Set
  • Character charts
  • Character property
  • Plane
  • Private Use Area
Characters
Special purpose
  • BOM
  • Combining grapheme joiner
  • Left-to-right mark / Right-to-left mark
  • Soft hyphen
  • Variant form
  • Word joiner
  • Zero-width joiner
  • Zero-width non-joiner
  • Zero-width space
Lists
  • Characters
  • CJK Unified Ideographs
  • Combining character
  • Duplicate characters
  • Numerals
  • Scripts
  • Spaces
  • Symbols
  • Halfwidth and fullwidth
  • Alias names and abbreviations
  • Whitespace characters
Processing
Algorithms
  • Bidirectional text
  • Collation
    • ISO/IEC 14651
  • Equivalence
  • Variation sequences
  • International Ideographs Core
Comparison
  • BOCU-1
  • CESU-8
  • Punycode
  • SCSU
  • UTF-1
  • UTF-7
  • UTF-8
  • UTF-16/UCS-2
  • UTF-32/UCS-4
  • UTF-EBCDIC
On pairs of
code points
  • Combining character
  • Compatibility characters
  • Duplicate characters
  • Equivalence
  • Homoglyph
  • Precomposed character
    • list
  • Z-variant
  • Variation sequences
  • Regional indicator symbol
  • Emoji skin color
Usage
  • Domain names (IDN)
  • Email
  • Fonts
  • HTML
    • entity references
    • numeric references
  • Input
  • International Ideographs Core
Related standards
  • Common Locale Data Repository (CLDR)
  • GB 18030
  • ISO/IEC 8859
  • ISO 15924
Related topics
  • Anomalies
  • ConScript Unicode Registry
  • Ideographic Research Group
  • International Components for Unicode
  • People involved with Unicode
  • Han unification
Scripts and symbols in Unicode
Common and
inherited scripts
  • Combining marks
  • Diacritics
  • Punctuation marks
  • Spaces
  • Numbers
Modern scripts
  • Adlam
  • Arabic
  • Armenian
  • Balinese
  • Bamum
  • Batak
  • Bengali
  • Bopomofo
  • Braille
  • Buhid
  • Burmese
  • Canadian Aboriginal
  • Chakma
  • Cham
  • Cherokee
  • CJK Unified Ideographs (Han)
  • Cyrillic
  • Deseret
  • Devanagari
  • Geʽez
  • Georgian
  • Greek
  • Gujarati
  • Gunjala Gondi
  • Gurmukhi
  • Hangul
  • Hanifi Rohingya
  • Hanja
  • Hanunuoo
  • Hebrew
  • Hiragana
  • Javanese
  • Kanji
  • Kannada
  • Katakana
  • Kayah Li
  • Khmer
  • Lao
  • Latin
  • Lepcha
  • Limbu
  • Lisu (Fraser)
  • Lontara
  • Malayalam
  • Masaram Gondi
  • Mende Kikakui
  • Medefaidrin
  • Miao (Pollard)
  • Mongolian
  • Mru
  • N'Ko
  • New Tai Lue
  • Nüshu
  • Nyiakeng Puachue Hmong
  • Odia
  • Ol Chiki
  • Osage
  • Osmanya
  • Pahawh Hmong
  • Pau Cin Hau
  • Pracalit (Newa)
  • Ranjana
  • Rejang
  • Samaritan
  • Saurashtra
  • Shavian
  • Sinhala
  • Sorang Sompeng
  • Sundanese
  • Syriac
  • Tagbanwa
  • Tai Le
  • Tai Tham
  • Tai Viet
  • Tamil
  • Tangsa
  • Telugu
  • Thaana
  • Thai
  • Tibetan
  • Tifinagh
  • Tirhuta
  • Toto
  • Vai
  • Wancho
  • Warang Citi
  • Yi
Ancient and
historic scripts
  • Ahom
  • Anatolian hieroglyphs
  • Ancient North Arabian
  • Avestan
  • Bassa Vah
  • Bhaiksuki
  • Brāhmī
  • Carian
  • Caucasian Albanian
  • Coptic
  • Cuneiform
  • Cypriot
  • Cypro-Minoan
  • Dives Akuru
  • Dogra
  • Egyptian hieroglyphs
  • Elbasan
  • Elymaic
  • Glagolitic
  • Gothic
  • Grantha
  • Hatran
  • Imperial Aramaic
  • Inscriptional Pahlavi
  • Inscriptional Parthian
  • Kaithi
  • Kharosthi
  • Khitan small script
  • Khojki
  • Khudawadi
  • Khwarezmian (Chorasmian)
  • Linear A
  • Linear B
  • Lycian
  • Lydian
  • Mahajani
  • Makasar
  • Mandaic
  • Manichaean
  • Marchen
  • Meetei Mayek
  • Meroitic
  • Modi
  • Multani
  • Nabataean
  • Nandinagari
  • Ogham
  • Old Hungarian
  • Old Italic
  • Old Permic
  • Old Persian cuneiform
  • Old Sogdian
  • Old Turkic
  • Old Uyghur
  • Palmyrene
  • ʼPhags-pa
  • Phoenician
  • Psalter Pahlavi
  • Runic
  • Sharada
  • Siddham
  • Sogdian
  • South Arabian
  • Soyombo
  • Sylheti Nagri
  • Tagalog (Baybayin)
  • Takri
  • Tangut
  • Ugaritic
  • Vithkuqi
  • Yezidi
  • Zanabazar Square
Notational scripts
  • Duployan
  • SignWriting
Symbols, emojis
  • Cultural, political, and religious symbols
  • Currency
  • Control Pictures
  • Mathematical operators and symbols
    • List by subject
  • Phonetic symbols (including IPA)
  • Emoji
  •  Category: Unicode
  •  Category: Unicode blocks
Template documentation[view] [edit] [history] [purge]

Contents

  • 1 Usage
  • 2 Unicode terms used
  • 3 Unicode version
  • 4 See also

Usage[edit]

  • The box is aimed at Unicode-related pages. Guidelines do recommend not to create a box with numerous entries, like with names of 165 Unicode-blocks.

Initial visibility: currently defaults to collapsed

To set this template's initial visibility, the |state= parameter may be used:

  • |state=collapsed: ((Unicode navigation|state=collapsed)) to show the template collapsed, i.e., hidden apart from its title bar
  • |state=expanded: ((Unicode navigation|state=expanded)) to show the template expanded, i.e., fully visible
  • |state=autocollapse: ((Unicode navigation|state=autocollapse))
    • shows the template collapsed to the title bar if there is a ((navbar)), a ((sidebar)), or some other table on the page with the collapsible attribute
    • shows the template in its expanded state if there are no other collapsible items on the page

If the |state= parameter in the template on this page is not set, the template's initial visibility is taken from the |default= parameter in the Collapsible option template. For the template on this page, that currently evaluates to collapsed.

Unicode terms used[edit]

  • A block in Unicode is a named, single continuous group of code points, e.g. Miscellaneous Symbols Unicode block with range U+2600–26FF. Each character is in one block. A block can include non/assigned code points, non/character codepoints etcetera.
  • A script is related to speech (like the alphabet), symbols are related to their meaning (like chess-symbols). Symbols also include controls and Unicode-specials like Byte order mark(BOM). All defined Unicode characters are either in a script or a symbol.
  • Charts are examples of glyphs, i.e. a rendered character (the "A" you read)
  • Due to legacy character sets like ASCII, and due to intended usage of Unicode, multiple issues arise from pairs of characters with an overlapping meaning etc.
  • Special characters have a Unicode-defined behavior.
  • Miscellaneous lists are lists of characters that are not in one block. Cultural symbols like religious crosses are in different blocks.

Unicode version[edit]

  • Scripts: Unicode as of version 5.2; Batak, Brāhmī, Mandaic: added as per version 6.0

See also[edit]

  • ((Unicode navigation/colors)) -- Wikipedia standard derived background colors
  • v
  • t
  • e
Unicode templates
General
  • ((Unicode navigation))
Inline
  • ((Unichar))
  • ((U+))
  • ((GB18030))
  • ((#invoke:Unicode convert))
Character
properties
  • Bidi Class
  • General Category
    • Diacritics in Unicode
    • Punctuation marks in Unicode
  • Hexadecimal digit
  • Numeric Type
  • Whitespace
  • Alias names and abbreviations
Code points
  • Planes
  • Unicode blocks
  • Private Use Area
Scripts
  • ((ISO 15924 script codes and related Unicode data))
CJK-specific
  • ((CJK ideographs in Unicode))
  • ((CJKV))
  • ((Unihan))
  • ((Lang-zh))
Wikipedia related
  • ((Contains special characters)) (with 'Uncommon Unicode' or more specifically)
  • ((PUA)) (MOS:PUA)
  • Unicode blocks
  • Unicode charts
  • Unicode templates
The above documentation is transcluded from Template:Unicode navigation/doc. (edit | history)
Editors can experiment in this template's sandbox (create | mirror) and testcases (create) pages.
Add categories to the /doc subpage. Subpages of this template.
Categories
Categories:
  • Software navigational boxes
  • Unicode templates