𐼑𐼇𐼄𐼌𐼊𐼋 [*𐼀𐼈𐼂𐼀𐼋]swγδyk [*ʾzβʾk]
𐼼𐼴𐼶𐼹𐼷𐼸 (𐼰𐼵𐼱𐼰𐼸)swγδyk (ʾzβʾk)
Script type
Time period
Late Antiquity
DirectionHorizontal and vertical writing in East Asian scripts, top-to-bottom Edit this on Wikidata
Vertical (left-to-right);
Horizontal (right-to-left)
Related scripts
Parent systems
Child systems
Manichaean alphabet
Old Turkic alphabet

Old Uyghur alphabet

ISO 15924
ISO 15924Sogd (141), ​Sogdian (Sogdian)
Sogo, 142(Old Sogdian)
Unicode alias
 This article contains phonetic transcriptions in the International Phonetic Alphabet (IPA). For an introductory guide on IPA symbols, see Help:IPA. For the distinction between [ ], / / and  , see IPA § Brackets and transcription delimiters.

The Sogdian alphabet was originally used for the Sogdian language, a language in the Iranian family used by the people of Sogdia.[1] The alphabet is derived from Syriac, a descendant script of the Aramaic alphabet. The Sogdian alphabet is one of three scripts used to write the Sogdian language, the others being the Manichaean alphabet and the Syriac alphabet.[1] It was used throughout Central Asia, from the edge of Iran in the west, to China in the east, from approximately 100–1200 A.D.[1]


Like the writing systems from which it is descended, the Sogdian writing system can be described as an abjad, but it also displays tendencies towards an alphabet. The script consists of 17 consonants, many of which have alternative forms for initial, middle, and final position.[2] As in the Aramaic alphabet, long vowels were commonly written with matres lectionis, the consonants aleph, yodh and waw.[2] However, unlike Aramaic and most abjads, these consonant signs would also sometimes serve to express the short vowels (which could also sometimes be left unexpressed as in the parent systems).[3] To disambiguate long vowels from short ones, an additional aleph could be written before the sign denoting the long vowel.[3] The alphabet also includes several diacritics, which were used inconsistently. It is written from right to left, but by the time it had evolved into its child system, the Old Uyghur alphabet, it had been rotated 90 degrees, written vertically in columns from left to right.[2] Voiced and voiceless fricatives are consistently not distinguished in the script.[1]

Aramaic logograms also appear in the script, remnants of adapting the Aramaic alphabet to the Sogdian language.[4] These logograms are used mainly for functional words such as pronouns, articles, prepositions, and conjunctions.[1]


This article should specify the language of its non-English content, using ((lang)), ((transliteration)) for transliterated languages, and ((IPA)) for phonetic transcriptions, with an appropriate ISO 639 code. Wikipedia's multilingual support templates may also be used. See why. (June 2022)

The Sogdian alphabet was found inscribed in Panjakent, so we can suppose alphabetisation rules – they are the same as in the Aramaic alphabet, but the letter Lāmadh is repeated at the end of the alphabet for values ð, θ.

Transcription ˀ β γ d h w z x y k l m n s ˁ p c q r š t ð
Value ā̆, ə, ɨ β, f γ, x * Ø, ā̆ w, ʷ, ū̆, ō̆, ü, ȫ z, ž, ẓ̌/δʳ x * y, ī̆, ē̆, ɨ, ə, ǟ k, g l m, ṁ n, ṁ s * p, b, f č, ǰ, ts * r, ʳ, l š, ṣ̌/θʳ t, d ð, θ
Aramaic values ʔ B G D H W Z Y K L M N S ʕ P Q R Š T (L)

* those letters are not used in Sogdian words.


Three main varieties of the Sogdian alphabet developed over time: Early Sogdian, a Hindu-Buddhist archaic non-cursive type; the sutra script, a calligraphic script used in Sogdian Buddhist scriptures; and the so-called "Uyghur" cursive script (not to be confused with the Old Uyghur alphabet).[1] Early Sogdian dates to the early fourth century C.E., and is characterized by distinct, separated graphemes.[4] The sutra script appears around 500 C.E., while the cursive script develops approximately a century later. The cursive script is thus named because its letters are connected with a base line. Since many letters in the cursive script are extremely similar in form, to the point of being indistinguishable, it is the most difficult to read of the three varieties.[2] As the Sogdian alphabet became more cursive and more stylized, some letters became more difficult to distinguish, or were distinguished only in final position, e.g. n and z.[4]

Source materials

The Sogdian script is known from religious texts of Buddhism, Manichaeism, and Christianity, as well as from secular sources such as letters, coins, and legal documents. The oldest known Sogdian documents are 5 letters known as the Ancient Letters, found in 1907 by Sir Aurel Stein in a watchtower near Dunhuang, China.[2][5] These letters date to approximately 312–313 C.E. and are written in Early Sogdian.[1]

The Sogdian Buddhist texts, written in the sutra script, are younger, dating to approximately the sixth to eighth or ninth century. They were found during the first two decades of the twentieth century in one of the caves of the Thousand Buddhas in the Chinese province of Gansu. The bulk of these manuscripts reside in the British Library, the Bibliothèque nationale de France, and the Russian Academy of Sciences.[1]

Another important discovery was of the Mug Documents in 1933 by Soviet scholars. These documents were found in the remains of a fortress on Mount Mug in northern Tajikistan. The documents, numbering over 76, were written on many different types of materials, such as paper, silk, wood, and skin. According to the dates on the documents, they date to the eighth century C.E. The majority of them were written using the Sogdian cursive script.[1]

Child writing systems

Sogdian script on the Bugut Inscription (585), central Mongolia. Sogdian is the distant ancestor of the Mongolian script.

The "Uyghur" cursive script eventually developed into the Old Uyghur alphabet, which was used to write the Old Uyghur language.[2] This child script was, however, rotated 90 degrees, written in a vertical direction from top to bottom, but with the first vertical line starting from the left side, not from the right as in Chinese, most probably because the right-to-left direction was used in horizontal writing. The Traditional Mongolian alphabet, being an adaptation of the Old Uyghur alphabet, still uses this kind of vertical writing, as does its more remote descendant Manchu.[6]


Main articles: Old Sogdian (Unicode block) and Sogdian (Unicode block)

In Syriac script, three additional characters were used to represent sounds of Sogdian that were not present in the Syriac language. These were included in Unicode in 2002.[7][8][9][10]

Old Sogdian and Sogdian were added to the Unicode Standard in June, 2018 with the release of version 11.0.[11]


The Unicode block for Old Sogdian is U+10F00–U+10F2F and contains 40 characters:

Old Sogdian[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+10F0x 𐼀 𐼁 𐼂 𐼃 𐼄 𐼅 𐼆 𐼇 𐼈 𐼉 𐼊 𐼋 𐼌 𐼍 𐼎 𐼏
U+10F1x 𐼐 𐼑 𐼒 𐼓 𐼔 𐼕 𐼖 𐼗 𐼘 𐼙 𐼚 𐼛 𐼜 𐼝 𐼞 𐼟
U+10F2x 𐼠 𐼡 𐼢 𐼣 𐼤 𐼥 𐼦 𐼧
1.^ As of Unicode version 15.1
2.^ Grey areas indicate non-assigned code points

The Unicode block for Sogdian is U+10F30–U+10F6F and contains 42 characters:

Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+10F3x 𐼰 𐼱 𐼲 𐼳 𐼴 𐼵 𐼶 𐼷 𐼸 𐼹 𐼺 𐼻 𐼼 𐼽 𐼾 𐼿
U+10F4x 𐽀 𐽁 𐽂 𐽃 𐽄 𐽅 𐽆 𐽇 𐽈 𐽉 𐽊 𐽋 𐽌 𐽍 𐽎 𐽏
U+10F5x 𐽐 𐽑 𐽒 𐽓 𐽔 𐽕 𐽖 𐽗 𐽘 𐽙
1.^ As of Unicode version 15.1
2.^ Grey areas indicate non-assigned code points

Comparison of Sogdian alphabets

Sogdian alphabets in Unicode
Aramaic Ancient letters Translit. Sogdian cursive Translit. Manichaean Translit. Syriac ("Christian") Translit.
𐡀 𐼀 (𐼁) ʼ 𐼰 ʼ 𐫀 ʼ ܐ ʼ
𐡁 𐫁 b
𐼂 (𐼃) β (B) 𐼱 β (B) 𐫂 β ܒ b
𐼱𐽎 β̣
𐽃 f
𐡂 𐫃 g ܓ g
𐼄 γ (G) 𐼲 γ (G) 𐫄 γ
𐡃 𐼘 D 𐽀 D 𐫅 d ܕ ܖ d
𐼧 ʽD
𐡄 𐼅 (𐼆) h 𐼳 h 𐫆 ܗ h
𐡅 𐼇 w 𐼴 w 𐫇 w ܘ w
𐡆 𐼈 z 𐼵 z 𐫉 z ܙ z
𐼵𐽆 𐼵𐽇 𐼵𐽍 ẓ z̤ ž 𐫊 ž ݍ ž
𐫋 j
𐡇 𐼉 x () 𐼶 x () 𐫍 h ܚ
𐼲 γ (Γ)
𐡈 𐫎 ܛ t
𐡉 𐼊 y 𐼷 y 𐫏 y ܝ y
𐡊 𐼋 k 𐼸 k 𐫐 k ܟ k
𐫑 x ݎ x
𐡋 𐫓 l ܠ l
𐼌 δ (L, D) 𐼹 δ (L, D) 𐫔 δ
𐫕 δδ
𐡌 𐼍 m 𐼺 m 𐫖 m ܡ m
𐡍 𐼎 (𐼏 𐼐) n 𐼻 n 𐫗 n ܢ n
𐡎 𐼑 s 𐼼 s 𐫘 s ܣ s
𐡏 𐼒 𐼓 𐼘 ʽ 𐼽 𐽀 ʽ 𐫙 ʽ ܥ γ
𐡐 𐼔 p 𐼾 p 𐫛 p ܦ p
𐼾𐽉 𐫜 f ݏ f
𐡑 𐼕 (𐼖 𐼗) c () 𐼿 c () 𐫝 c ܨ c
𐡒 𐫞 q ܩ q
𐡓 𐼘 r (D, ʽ) 𐽀 r (D, ʽ) 𐫡 r ܪ r
𐽄 𐽀𐽏 l ṛ
𐡔 𐼙 š 𐽁 š 𐫢 š ܫ š
𐡕 𐼚 (𐼛 𐼜) t 𐽂 t 𐫤 t ܬ θ
𐡘 𐼝 I 𐽑 I 𐫫 I
𐡙 𐼞 II 𐽑𐽑 II
𐡚 𐼟 III 𐽑𐽑𐽑 III
𐡚𐡘 𐼠 IV 𐽑𐽑𐽑𐽑 IV
𐡚𐡙 𐼡 V 𐽑𐽑𐽑 𐽑𐽑 V 𐫬 V
𐡛 𐼢 X 𐽒 X 𐫭 X
𐡜 𐼣 XX 𐽓 XX 𐫮 XX
𐡝 𐼥 C 𐽔 C 𐫯 C
𐼦 12


There are fonts encoding Sogdian alphabets in Unicode – Noto Sans Old Sogdian and Noto Sans Sogdian.

See also


  1. ^ a b c d e f g h i Gharib, B. (1995), Sogdian Dictionary: Sogdian-Persian-English, Tehran, Iran: Farhangan Publications, xiii–xxxvi, ISBN 964-5558-06-9
  2. ^ a b c d e f Coulmas, Florian (10 June 1996), The Blackwell Encyclopedia of Writing Systems, Cambridge, MA: Blackwell Publishers (published 1996), pp. 471–474, 512, ISBN 0-631-19446-0
  3. ^ a b Clauson, Gerard. 2002. Studies in Turkic and Mongolic linguistics. P.103-104.
  4. ^ a b c Daniels, Peter T.; Bright, William (1996), The World's Writing Systems, New York: Oxford University Press, pp. 515–536, ISBN 0-19-507993-0
  5. ^ "The Sogdian Ancient Letters 1, 2, 3, and 5". Silk Road Seattle – University of Washington. translated by Prof. Nicholas Sims-Williams.((cite web)): CS1 maint: others (link)
  6. ^ F.W. Mote (1999), Imperial China, 900–1800, Harvard University Press, pp. 42–43, ISBN 0-674-01212-7
  7. ^ "none". Retrieved 9 April 2018.
  8. ^ Proposal adopted "WG2 resolves to encode the six additional Syriac characters the BMP for use in Sogdian and Persian languages"
  9. ^ Bunz, Carl-Martin (2002) Meeting report: 2nd Iranian Unicode Meeting "For the Sogdian script (as well as the Uyghur script), two possible encoding strategies were discussed. While the soundest solution would consist in providing a code block of its own, a mapping onto the existing Unicode block of Mongolian (U+1800-18af) would be historically also adequate, given that the latter script developed from it."
  10. ^ Michael Everson and Nicholas Sims-Williams (2002-11-04) Shaping behaviour of six Syriac letters for Sogdian and Persian
  11. ^ "Unicode 11.0.0". Unicode Consortium. June 5, 2018. Retrieved June 5, 2018.