This article has multiple issues. Please help improve it or discuss these issues on the talk page. (Learn how and when to remove these template messages) .mw-parser-output .hidden-begin{box-sizing:border-box;width:100%;padding:5px;border:none;font-size:95%}.mw-parser-output .hidden-title{font-weight:bold;line-height:1.6;text-align:left}.mw-parser-output .hidden-content{text-align:left}@media all and (max-width:500px){.mw-parser-output .hidden-begin{width:auto!important;clear:none!important;float:none!important))You can help expand this article with text translated from the corresponding article in Chinese. (August 2020) Click [show] for important translation instructions. Machine translation, like DeepL or Google Translate, is a useful starting point for translations, but translators must revise errors as necessary and confirm that the translation is accurate, rather than simply copy-pasting machine-translated text into the English Wikipedia. Consider adding a topic to this template: there are already 292 articles in the main category, and specifying|topic= will aid in categorization. Do not translate text that appears unreliable or low-quality. If possible, verify the text with references provided in the foreign-language article. You must provide copyright attribution in the edit summary accompanying your translation by providing an interlanguage link to the source of your translation. A model attribution edit summary is Content in this edit is translated from the existing Chinese Wikipedia article at [[:zh:西夏文]]; see its history for attribution. You may also add the template ((Translated|zh|西夏文)) to the talk page. For more guidance, see Wikipedia:Translation. This article includes a list of general references, but it lacks sufficient corresponding inline citations. Please help to improve this article by introducing more precise citations. (January 2022) (Learn how and when to remove this message) (Learn how and when to remove this message)

Tangut
The Art of War written in Tangut
Script type	Logographic
Creator	Yeli Renrong
Time period	1036–1502
Direction	Vertical right-to-left, left-to-right
Languages	Tangut language
Related scripts
Parent systems	Constructed script inspired by Chinese characters Tangut
ISO 15924
ISO 15924	Tang (520), Tangut
Unicode
Unicode alias	Tangut
Unicode range	U+17000–U+187FF Tangut U+18D00–U+18D7F Tangut Supplement U+18800–U+18AFF Tangut Components U+16FE0–U+16FFF Ideographic Symbols & Punct.

This article contains Tangut text. Without proper rendering support, you may see question marks, boxes, or other symbols instead of Tangut characters.

The Tangut script (Tangut: 𗼇𘝞; Chinese: 西夏文; pinyin: Xī Xià Wén; lit. 'Western Xia script') was a logographic writing system, used for writing the extinct Tangut language of the Western Xia dynasty. According to the latest count, 5863 Tangut characters are known, excluding variants.^[1] The Tangut characters are similar in appearance to Chinese characters,^[2] with the same type of strokes, but the methods of forming characters in the Tangut writing system are significantly different from those of forming Chinese characters. As in Chinese calligraphy, regular, running, cursive and seal scripts were used in Tangut writing.

History

According to the History of Song (1346), the script was designed by the high-ranking official Yeli Renrong in 1036.^[3]^[4] The script was invented in a short period of time, and was put into use quickly. Government schools were founded to teach the script. Official documents were written in the script (with diplomatic ones written bilingually). A great number of Buddhist scriptures were translated from Tibetan and Chinese, and block printed in the script.^[5] Although the dynasty collapsed in 1227, the script continued to be used for another few centuries. The last known example of the script occurs on a pair of Tangut dharani pillars found at Baoding in present-day Hebei province, which were erected in 1502.^[6]

Structure

[Tangut] is remarkable for being written in one of the most inconvenient of all scripts, a collection of nearly 5,800 characters of the same kind as Chinese characters but rather more complicated; very few are made up of as few as four strokes and most are made up of a good many more, in some cases nearly twenty... There are few recognizable indications of sound and meaning in the constituent parts of a character, and in some cases characters which differ from one another only in minor details of shape or by one or two strokes have completely different sounds and meanings.^[7]
— Gerard Clauson

Tangut characters can be divided into two classes: simple and composite. The latter are much more numerous. The simple characters can be either semantic or phonetic. None of the Tangut characters are pictographic, while the Chinese characters were at the time of their creation; this is one of the major differences between Tangut and Chinese characters.

The Tangut character "mud" is made with part of the character "water" (far left) and the whole of the character "soil"

Most composite characters comprise two components. A few comprise three or four. A component can be a simple character, or part of a composite character. The composite characters include semantic-semantic ones and semantic-phonetic ones. A few special composite characters were made for transliterating Chinese and Sanskrit.

There are a number of pairs of special composite characters worth noting. The members of such a pair have the same components, only the location of the components in them is different (e.g. AB vs. BA, ABC vs. ACB). The members of such a pair have very similar meanings.

The Sea of Characters (Tangut: 𘝞𗗚; Chinese: 文海; pinyin: wén hǎi), a 12th century monolingual Tangut rhyming dictionary, analyzes what other characters each character is derived from. Its analyses illustrate another difference between Tangut and Chinese characters. In Chinese, typically, each semantic component has its own meaning, and each phonetic component its own sound; they contribute this meaning or sound to any complex character they appear in. By contrast, in the Sea of Characters analysis of Tangut, a component contributes the meaning or sound of some other character that contains it, potentially a different one in every appearance. For example, the component 𘤊 can have the meaning of "bird" (𗿼 *dźjwow, of which it is the left side), as in 𗿝 *dze "wild goose" = 𗿼 *dźjwow "bird" + 𗨜 *dze "longevity". But the same component is also used to convey meanings of bone, smoke, food, and time, among others.^[8]

Some components take different shape depending on what part of the character they appear in (e.g., left side, right side, middle, bottom).^[8]

Reconstruction

Main articles: Tangutology and Tangut language

This section is empty. You can help by adding to it. (March 2020)

Unicode

Main articles: Tangut (Unicode block), Tangut Components (Unicode block), and Tangut Supplement (Unicode block)

6,125 characters of the Tangut script were included in Unicode version 9.0 in June 2016 in the Tangut block. 755 Radicals and components used in the modern study of Tangut were added to the Tangut Components block. An iteration mark, U+16FE0 𖿠 TANGUT ITERATION MARK, was included in the Ideographic Symbols and Punctuation block.^[9] Five additional characters were added in June 2018 with the release of Unicode version 11.0. Six additional characters were added in March 2019 with the release of Unicode version 12.0. A further nine Tangut ideographs were added to the Tangut Supplement block and 13 Tangut components were added to the Tangut Components block in March 2020 with the release of Unicode version 13.0. The Tangut Supplement block size was changed in Unicode version 14.0 to correct the erroneous block end point (version 13: 18D8F → version 14.0: 18D7F).^[10]

References

Sources

External links

Types of writing systems

Overview	History of writing Grapheme
Lists	Writing systems undeciphered inventors constructed Languages by writing system / by first written accounts

Types

Abjads

Numerals

Aramaic
- Hatran
Arabic
- Elifba
Egyptian hieroglyphs
Elymaic
Hebrew
- Ashuri
- Cursive
- Rashi
- Solitreo
Tifinagh
Mandaic
Manichaean
Nabataean
Ancient North Arabian
Pahlavi
- Book
- Inscriptional
- Inscriptional Parthian
- Psalter
Pegon
Phoenician
- Paleo-Hebrew
Pitman shorthand
Proto-Sinaitic
Punic
Samaritan
South Arabian
- Zabur
- Musnad
Sogdian
Syriac
- ʾEsṭrangēlā
- Serṭā
- Maḏnḥāyā
Teeline Shorthand
Ugaritic

Abugidas

Brahmic

Northern

Bengali–Assamese
Bhaiksuki
Brahmi script
Devanagari
Dogri
Gujarati
Gupta
Gurmukhi
Kaithi
Kalinga
Khema
Khojki
Khudabadi
Laṇḍā
Lepcha
Mahajani
Marchen
Meitei
Modi
Multani
Nagari
Nandinagari
Nepalese scripts
- Bhujimol
- Golmol
- Himmol
- Kummol
- Kvemmol
- Pachumol
- Pracalit
- Ranjana
- Tamyig
- Tirhuta
- Limbu
- Litumol
Odia
- Karani
ʼPhags-pa
Sharada
Siddhaṃ
Soyombo
Sylheti Nagri
Takri
Tibetan
- Uchen
- Umê
Tocharian
Zanabazar square

Southern

Ahom
Balinese
Batak
Baybayin
Bhattiprolu
Buda
Buhid
Chakma
Cham
Fakkham
Grantha
Goykanadi
Hanunoo
Javanese
Kadamba
Kannada
Karen
Kawi
Khmer
- Khom Thai
Kulitan
Lanna
Langdi
Lao
Leke
Lontara
- Bilang-bilang
Makasar
Malayalam
Old Maldivian
- Dhives Akuru
- Eveyla Akuru
Mon–Burmese
Pallava
Pyu
Saurashtra
Shan
Sinhala
Sukhothai
Sundanese
- Old Sundanese
Tagbanwa
Tai Le
New Tai Lue
Tai Noi
Tai Tham
Tai Viet
Lai Tay
Tamil
Tamil-Brahmi
Tanchangya
Telugu
Thai
Tigalari
Ulu scripts
- Incung
- Lampung
- Lembak
- Ogan
- Pasemah
- Rejang
- Serawai
Vatteluttu
- Kolezhuthu
- Malayanma

Others

Bharati
Boyd's syllabic shorthand
Canadian syllabics
- Blackfoot
- Déné syllabics
Dham
Fox I
Geʽez
Gunjala Gondi
Japanese Braille
Sarati
Jenticha
Kharosthi
Mandombe
Masaram Gondi
Meroitic
Miao
Mwangwego
Pahawh Hmong
Sorang Sompeng
Tengwar
Thaana
Thomas Natural Shorthand
Warang Citi
Mwangwego
Rma

Alphabets

Linear

Adlam
Ariyaka
Armenian
Avestan
- Pazend
Avoiuli
Bassa Vah
Carian
Caucasian Albanian
Cirth
Coelbren
Coorgi–Cox alphabet
Coptic
Cyrillic
- Bosnian
- Early
Deseret
Duployan shorthand
- Chinook
Eclectic shorthand
Elbasan
Enochian
Etruscan
Evenki
Formosan
Fox II
Fraser
Gabelsberger shorthand
Gadabuursi
Garay alphabet
Georgian
- Asomtavruli
- Nuskhuri
- Mkhedruli
Veso Bey
Glagolitic
Gothic
Gregg shorthand
Greek (Archaic)
Greco-Iberian alphabet
Hangul
Hanifi
Jenticha
Kaddare
Kayah Li
Klingon
Latin
- Beneventan
- Blackletter
- Carolingian minuscule
- Fraktur
- Gaelic
- Insular
- Interlac
- IPA
- Kurrent
- Merovingian
- Sigla
- Sütterlin
- Tironian notes
- Visigothic
Luo
Lycian
Lydian
Manchu
Medefaidrin
Molodtsov
Mongolian
Mru
Mundari Bani
N'Ko
Ogham
Oirat
Ol Chiki
Old Hungarian
Old Italic
Old Permic
Orkhon
Old Uyghur
Ol Onal
Osage
Osmanya
Pau Cin Hau
Phrygian
Pisidian
Runic
- Anglo-Saxon
- Cipher
- Dalecarlian
- Elder Futhark
- Younger Futhark
- Gothic
- Marcomannic
- Medieval
- Staveless
Shavian
Sidetic
Sorang Sompeng
Sunuwar
Tifinagh
Todhri
Tolong Siki
Vagindra
Vellara
Visible Speech
Vithkuqi
Wancho
Warang Citi
Yezidi
Zaghawa

Non-linear

Ideograms
Adinkra Aztec Blissymbols Dongba Ersu Shaba Emoji Isotype Kaidā Miꞌkmaw Mixtec New Epoch Notation Painting Nsibidi Ojibwe Hieroglyphs Olmec Siglas poveiras Testerian Yerkish Zapotec

Logograms

Chinese family of scripts

Chinese characters	Simplified Traditional Oracle bone script Bronze scripts Seal script large small bird-worm Hanja Kanji Chữ Nôm Sawndip Bowen
Chinese-influenced	Jurchen Khitan large script Sui Tangut

Cuneiform

Other logosyllabic

Logoconsonantal

Numerals

Other

Sitelen Pona

Semi-syllabaries

Full	Linear Elamite Celtiberian Northeastern Iberian Southeastern Iberian Khom Dunging
Redundant	Espanca script Pahawh Hmong Khitan small script Southwest Paleohispanic Bopomofo Quốc Âm Tân Tự

Sign languages
ASLwrite SignWriting si5s Stokoe notation

Syllabaries
Afaka Bamum Bété Byblos Canadian Aboriginal Cherokee Cypriot Cypro-Minoan Ditema tsa Dinoko Eskayan Geba Great Lakes Algonquian Iban Idu Kana Hiragana Katakana Man'yōgana Hentaigana Sōgana Jindai moji Kikakui Kpelle Linear B Linear Elamite Lisu Loma Nüshu Nwagu Aneke script Old Persian cuneiform Sumerian Vai Woleai Yi Yugtun

Braille ⠃⠗⠁⠊⠇⠇⠑

Braille cell

Braille scripts

French-ordered

Albanian
Azerbaijani
Cantonese
Catalan
Chinese (mainland Mandarin) (largely reassigned)
Czech
Dutch
English (Unified English)
Esperanto
French
German
Ghanaian
Guarani
Hawaiian
Hungarian
Iñupiaq
IPA
Irish
Italian
Latvian
Lithuanian
Luxembourgish (extended to 8-dot)
Maltese
Māori
Navajo
Nigerian
Philippine
Polish
Portuguese
Romanian
Samoan
Slovak
South African
Spanish
Taiwanese Mandarin (largely reassigned)
Turkish
Vietnamese
Welsh
Yugoslav
Zambian

Nordic family	Estonian Faroese Icelandic Scandinavian Danish Finnish Greenlandic Northern Sámi Norwegian Swedish
Russian lineage family i.e. Cyrillic-mediated scripts	Belarusian Bulgarian Kazakh Kyrgyz Mongolian Russian Tatar Ukrainian
Egyptian lineage family i.e. Arabic-mediated scripts	Arabic Persian Urdu (Pakistan)
Indian lineage family i.e. Bharati Braille	Devanagari (Hindi / Marathi / Nepali) Bengali (Bangla / Assamese) Gujarati Kannada Malayalam Odia Punjabi Sinhala Tamil Telugu Urdu (India)
Other scripts	Amharic Armenian Burmese Dzongkha (Bhutanese) Georgian Greek Hebrew Inuktitut (reassigned vowels) Khmer Thai and Lao (Japanese vowels) Tibetan

Reordered

Algerian Braille (obsolete)

Frequency-based

American Braille (obsolete)

Independent

Eight-dot

Symbols in braille

Braille music
Canadian currency marks
Computer Braille Code
Gardner–Salinas braille codes (science; GS8/GS6)
International Phonetic Alphabet (IPA)
Nemeth braille code

Braille technology

People

Organisations

Other tactile alphabets