|Native to||Serbia, Croatia, Bosnia and Herzegovina, Montenegro, and Kosovo[a]|
|19 million (2022)|
Official language in
Areas where Serbo-Croatian is spoken by a plurality of inhabitants (as of 2005)[needs update]
Note: a Kosovo independence disputed, see 2008 Kosovo declaration of independence
|South Slavic languages and dialects|
Serbo-Croatian (/ˌsɜːrboʊkroʊˈeɪʃən/ (listen)) – also called Serbo-Croat (/ˌsɜːrboʊˈkroʊæt/), Serbo-Croat-Bosnian (SCB), Bosnian-Croatian-Serbian (BCS), and Bosnian-Croatian-Montenegrin-Serbian (BCMS) – is a South Slavic language and the primary language of Serbia, Croatia, Bosnia and Herzegovina, and Montenegro. It is a pluricentric language with four mutually intelligible standard varieties, namely Serbian, Croatian, Bosnian, and Montenegrin.
South Slavic languages historically formed a continuum. The turbulent history of the area, particularly due to expansion of the Ottoman Empire, resulted in a patchwork of dialectal and religious differences. Due to population migrations, Shtokavian became the most widespread dialect in the western Balkans, intruding westwards into the area previously occupied by Chakavian and Kajkavian (which further blend into Slovenian in the northwest). Bosniaks, Croats and Serbs differ in religion and were historically often part of different cultural circles, although a large part of the nations have lived side by side under foreign overlords. During that period, the language was referred to under a variety of names, such as "Slavic" in general or "Serbian", "Croatian" or "Bosnian" in particular. In a classicizing manner, it was also referred to as "Illyrian".
The process of linguistic standardization of Serbo-Croatian was originally initiated in the mid-19th-century Vienna Literary Agreement by Croatian and Serbian writers and philologists, decades before a Yugoslav state was established. From the very beginning, there were slightly different literary Serbian and Croatian standards, although both were based on the same dialect of Shtokavian, Eastern Herzegovinian. In the 20th century, Serbo-Croatian served as the official language of the Kingdom of Yugoslavia (when it was called "Serbo-Croato-Slovenian"), and later as one of the official languages of the Socialist Federal Republic of Yugoslavia. The breakup of Yugoslavia affected language attitudes, so that social conceptions of the language separated along ethnic and political lines. Since the breakup of Yugoslavia, Bosnian has likewise been established as an official standard in Bosnia and Herzegovina, and there is an ongoing movement to codify a separate Montenegrin standard.
Like other South Slavic languages, Serbo-Croatian has a simple phonology, with the common five-vowel system and twenty-five consonants. Its grammar evolved from Common Slavic, with complex inflection, preserving seven grammatical cases in nouns, pronouns, and adjectives. Verbs exhibit imperfective or perfective aspect, with a moderately complex tense system. Serbo-Croatian is a pro-drop language with flexible word order, subject–verb–object being the default. It can be written in either localized variants of Latin (Gaj's Latin alphabet, Montenegrin Latin) or Cyrillic (Serbian Cyrillic, Montenegrin Cyrillic), and the orthography is highly phonemic in all standards.
Serbo-Croatian is typically referred to by names of its standardized varieties: Serbian, Croatian, Bosnian and Montenegrin; it is rarely referred to by names of its sub-dialects, such as Bunjevac.
In the language itself, it is typically known as srpskohrvatski/српскохрватски "Serbo-Croatian", hrvatskosrpski/хрватскoсрпски "Croato-Serbian", or informally naški/нашки "ours".
Throughout the history of the South Slavs, the vernacular, literary, and written languages (e.g. Chakavian, Kajkavian, Shtokavian) of the various regions and ethnicities developed and diverged independently. Prior to the 19th century, they were collectively called "Illyric", "Slavic", "Slavonian", "Bosnian", "Dalmatian", "Serbian" or "Croatian". Since the nineteenth century the term Illyrian or Illyric was used quite often (thus creating confusion with the Illyrian language). Although the word Illyrian was used on a few occasions before, its widespread usage began after Ljudevit Gaj and several other prominent linguists met at Ljudevit Vukotinović's house to discuss the issue in 1832. The term Serbo-Croatian was first used by Jacob Grimm in 1824, popularized by the Viennese philologist Jernej Kopitar in the following decades, and accepted by Croatian Zagreb grammarians in 1854 and 1859. At that time, Serb and Croat lands were still part of the Ottoman and Austrian Empires. Officially, the language was called variously Serbo-Croat, Croato-Serbian, Serbian and Croatian, Croatian and Serbian, Serbian or Croatian, Croatian or Serbian. Unofficially, Serbs and Croats typically called the language "Serbian" or "Croatian", respectively, without implying a distinction between the two, and again in independent Bosnia and Herzegovina, "Bosnian", "Croatian", and "Serbian" were considered to be three names of a single official language. Croatian linguist Dalibor Brozović advocated the term Serbo-Croatian as late as 1988, claiming that in an analogy with Indo-European, Serbo-Croatian does not only name the two components of the same language, but simply charts the limits of the region in which it is spoken and includes everything between the limits ('Bosnian' and 'Montenegrin'). Today, use of the term "Serbo-Croatian" is controversial due to the prejudice that nation and language must match. It is still used for lack of a succinct alternative, though alternative names have emerged, such as Bosnian/Croatian/Serbian (BCS), which is often seen in political contexts such as the International Criminal Tribunal for the former Yugoslavia.
In the 9th century, Old Church Slavonic was adopted as the language of the liturgy in churches serving various Slavic nations. This language was gradually adapted to non-liturgical purposes and became known as the Croatian version of Old Slavonic. The two variants of the language, liturgical and non-liturgical, continued to be a part of the Glagolitic service as late as the middle of the 19th century. The earliest known Croatian Church Slavonic Glagolitic manuscripts are the Glagolita Clozianus and the Vienna Folia from the 11th century.
The beginning of written Serbo-Croatian can be traced from the tenth century and on when Serbo-Croatian medieval texts were written in five scripts: Latin, Glagolitic, Early Cyrillic, Bosnian Cyrillic (bosančica/bosanica), and Arebica, the last principally by Bosniak nobility. Serbo-Croatian competed with the more established literary languages of Latin and Old Slavonic in the west and Persian and Arabic in the east.
Old Slavonic developed into the Serbo-Croatian variant of Church Slavonic between the 12th and 16th centuries.
Among the earliest attestations of Serbo-Croatian are: the Humac tablet, dating from the 10th or 11th century, written in Bosnian Cyrillic and Glagolitic; the Plomin tablet, dating from the same era, written in Glagolitic; the Valun tablet, dated to the 11th century, written in Glagolitic and Latin; and the Inscription of Župa Dubrovačka, a Glagolitic tablet dated to the 11th century.
The Baška tablet from the late 11th century was written in Glagolitic. It is a large stone tablet found in the small Church of St. Lucy, Jurandvor on the Croatian island of Krk that contains text written mostly in Chakavian in the Croatian angular Glagolitic script.
The Charter of Ban Kulin of 1189, written by Ban Kulin of Bosnia, was an early Shtokavian text, written in Bosnian Cyrillic.
The luxurious and ornate representative texts of Serbo-Croatian Church Slavonic belong to the later era, when they coexisted with the Serbo-Croatian vernacular literature. The most notable are the "Missal of Duke Novak" from the Lika region in northwestern Croatia (1368), "Evangel from Reims" (1395, named after the town of its final destination), Hrvoje's Missal from Bosnia and Split in Dalmatia (1404), and the first printed book in Serbo-Croatian, the Glagolitic Missale Romanum Glagolitice (1483).
During the 13th century Serbo-Croatian vernacular texts began to appear, the most important among them being the "Istrian land survey" of 1275 and the "Vinodol Codex" of 1288, both written in the Chakavian dialect.
The Shtokavian dialect literature, based almost exclusively on Chakavian original texts of religious provenance (missals, breviaries, prayer books) appeared almost a century later. The most important purely Shtokavian vernacular text is the Vatican Croatian Prayer Book (c. 1400).
Both the language used in legal texts and that used in Glagolitic literature gradually came under the influence of the vernacular, which considerably affected its phonological, morphological, and lexical systems. From the 14th and the 15th centuries, both secular and religious songs at church festivals were composed in the vernacular.
Writers of early Serbo-Croatian religious poetry (začinjavci) gradually introduced the vernacular into their works. These začinjavci were the forerunners of the rich literary production of the 16th-century literature, which, depending on the area, was Chakavian-, Kajkavian-, or Shtokavian-based. The language of religious poems, translations, miracle and morality plays contributed to the popular character of medieval Serbo-Croatian literature.
One of the earliest dictionaries, also in the Slavic languages as a whole, was the Bosnian–Turkish Dictionary of 1631 authored by Muhamed Hevaji Uskufi and was written in the Arebica script.
In the mid-19th century, Serbian (led by self-taught writer and folklorist Vuk Stefanović Karadžić) and most Croatian writers and linguists (represented by the Illyrian movement and led by Ljudevit Gaj and Đuro Daničić), proposed the use of the most widespread dialect, Shtokavian, as the base for their common standard language. Karadžić standardised the Serbian Cyrillic alphabet, and Gaj and Daničić standardized the Croatian Latin alphabet, on the basis of vernacular speech phonemes and the principle of phonological spelling. In 1850 Serbian and Croatian writers and linguists signed the Vienna Literary Agreement, declaring their intention to create a unified standard. Thus a complex bi-variant language appeared, which the Serbs officially called "Serbo-Croatian" or "Serbian or Croatian" and the Croats "Croato-Serbian", or "Croatian or Serbian". Yet, in practice, the variants of the conceived common literary language served as different literary variants, chiefly differing in lexical inventory and stylistic devices. The common phrase describing this situation was that Serbo-Croatian or "Croatian or Serbian" was a single language. In 1861, after a long debate, the Croatian Sabor put up several proposed names to a vote of the members of the parliament; "Yugoslavian" was opted for by the majority and legislated as the official language of the Triune Kingdom. The Austrian Empire, suppressing Pan-Slavism at the time, did not confirm this decision and legally rejected the legislation, but in 1867 finally settled on "Croatian or Serbian" instead. During the Austro-Hungarian occupation of Bosnia and Herzegovina, the language of all three nations in this territory was declared "Bosnian" until the death of administrator von Kállay in 1907, at which point the name was changed to "Serbo-Croatian".
With unification of the first the Kingdom of the Serbs, Croats, and Slovenes – the approach of Karadžić and the Illyrians became dominant. The official language was called "Serbo-Croato-Slovenian" (srpsko-hrvatsko-slovenački) in the 1921 constitution. In 1929, the constitution was suspended, and the country was renamed the Kingdom of Yugoslavia, while the official language of Serbo-Croato-Slovene was reinstated in the 1931 constitution.
In June 1941, the Nazi puppet Independent State of Croatia began to rid the language of "Eastern" (Serbian) words, and shut down Serbian schools. The totalitarian dictatorship introduced a language law that promulgated Croatian linguistic purism as a policy that tried to implement a complete elimination of Serbisms and internationalisms.
On January 15, 1944, the Anti-Fascist Council of the People's Liberation of Yugoslavia (AVNOJ) declared Croatian, Serbian, Slovene, and Macedonian to be equal in the entire territory of Yugoslavia. In 1945 the decision to recognize Croatian and Serbian as separate languages was reversed in favor of a single Serbo-Croatian or Croato-Serbian language. In the Communist-dominated second Yugoslavia, ethnic issues eased to an extent, but the matter of language remained blurred and unresolved.
In 1954, major Serbian and Croatian writers, linguists and literary critics, backed by Matica srpska and Matica hrvatska signed the Novi Sad Agreement, which in its first conclusion stated: "Serbs, Croats and Montenegrins share a single language with two equal variants that have developed around Zagreb (western) and Belgrade (eastern)". The agreement insisted on the equal status of Cyrillic and Latin scripts, and of Ekavian and Ijekavian pronunciations. It also specified that Serbo-Croatian should be the name of the language in official contexts, while in unofficial use the traditional Serbian and Croatian were to be retained. Matica hrvatska and Matica srpska were to work together on a dictionary, and a committee of Serbian and Croatian linguists was asked to prepare a pravopis. During the sixties both books were published simultaneously in Ijekavian Latin in Zagreb and Ekavian Cyrillic in Novi Sad. Yet Croatian linguists claim that it was an act of unitarianism. The evidence supporting this claim is patchy: Croatian linguist Stjepan Babić complained that the television transmission from Belgrade always used the Latin alphabet— which was true, but was not proof of unequal rights, but of frequency of use and prestige. Babić further complained that the Novi Sad Dictionary (1967) listed side by side words from both the Croatian and Serbian variants wherever they differed, which one can view as proof of careful respect for both variants, and not of unitarism. Moreover, Croatian linguists criticized those parts of the Dictionary for being unitaristic that were written by Croatian linguists. And finally, Croatian linguists ignored the fact that the material for the Pravopisni rječnik came from the Croatian Philological Society. Regardless of these facts, Croatian intellectuals brought the Declaration on the Status and Name of the Croatian Literary Language in 1967. On occasion of the publication's 45th anniversary, the Croatian weekly journal Forum published the Declaration again in 2012, accompanied by a critical analysis.
West European scientists judge the Yugoslav language policy as an exemplary one: although three-quarters of the population spoke one language, no single language was official on a federal level. Official languages were declared only at the level of constituent republics and provinces, and very generously: Vojvodina had five (among them Slovak and Romanian, spoken by 0.5 per cent of the population), and Kosovo four (Albanian, Turkish, Romany and Serbo-Croatian). Newspapers, radio and television studios used sixteen languages, fourteen were used as languages of tuition in schools, and nine at universities. Only the Yugoslav Army used Serbo-Croatian as the sole language of command, with all other languages represented in the army's other activities—however, this is not different from other armies of multilingual states, or in other specific institutions, such as international air traffic control where English is used worldwide. All variants of Serbo-Croatian were used in state administration and republican and federal institutions. Both Serbian and Croatian variants were represented in respectively different grammar books, dictionaries, school textbooks and in books known as pravopis (which detail spelling rules). Serbo-Croatian was a kind of soft standardisation. However, legal equality could not dampen the prestige Serbo-Croatian had: since it was the language of three quarters of the population, it functioned as an unofficial lingua franca. And within Serbo-Croatian, the Serbian variant, with twice as many speakers as the Croatian, enjoyed greater prestige, reinforced by the fact that Slovene and Macedonian speakers preferred it to the Croatian variant because their languages are also Ekavian. This is a common situation in other pluricentric languages, e.g. the variants of German differ according to their prestige, the variants of Portuguese too. Moreover, all languages differ in terms of prestige: "the fact is that languages (in terms of prestige, learnability etc.) are not equal, and the law cannot make them equal".
In 2017, the "Declaration on the Common Language" (Deklaracija o zajedničkom jeziku) was signed by a group of NGOs and linguists from former Yugoslavia. It states that all standardized variants belong to a common polycentric language with equal status.
About 19 million people declare their native language as either 'Bosnian', 'Croatian', 'Serbian', 'Montenegrin', or 'Serbo-Croatian'.
Serbian is spoken by 10 million people around the world, mostly in Serbia (7.8 million), Bosnia and Herzegovina (1.2 million), and Montenegro (300,000). Serbian minorities are found in Kosovo, North Macedonia and in Romania. In Serbia, there are about 760,000 second-language speakers of Serbian, including Hungarians in Vojvodina and the 400,000 estimated Roma. In Kosovo, Serbian is spoken by the members of the Serbian minority which approximates between 70,000 and 100,000. Familiarity of Kosovo Albanians with Serbian varies depending on age and education, and exact numbers are not available.
Croatian is spoken by 6.8 million people in the world, including 4.1 million in Croatia and 600,000 in Bosnia and Herzegovina. A small Croatian minority that lives in Italy, known as Molise Croats, have somewhat preserved traces of Croatian. In Croatia, 170,000, mostly Italians and Hungarians, use it as a second language.
Bosnian is spoken by 2.7 million people worldwide, chiefly Bosniaks, including 2.0 million in Bosnia and Herzegovina, 200,000 in Serbia and 40,000 in Montenegro.
Montenegrin is spoken by 300,000 people globally. The notion of Montenegrin as a separate standard from Serbian is relatively recent. In the 2011 census, around 229,251 Montenegrins, of the country's 620,000, declared Montenegrin as their native language. That figure is likely to increase, due to the country's independence and strong institutional backing of the Montenegrin language.
Serbo-Croatian is also a second language of many Slovenians and Macedonians, especially those born during the time of Yugoslavia. According to the 2002 Census, Serbo-Croatian and its variants have the largest number of speakers of the minority languages in Slovenia.
Outside the Balkans, there are over two million native speakers of the language(s), especially in countries which are frequent targets of immigration, such as Australia, Austria, Brazil, Canada, Chile, Germany, Hungary, Italy, Sweden, and the United States.
Further information: Serbo-Croatian grammar
Serbo-Croatian is a highly inflected language. Traditional grammars list seven cases for nouns and adjectives: nominative, genitive, dative, accusative, vocative, locative, and instrumental, reflecting the original seven cases of Proto-Slavic, and indeed older forms of Serbo-Croatian itself. However, in modern Shtokavian the locative has almost merged into dative (the only difference is based on accent in some cases), and the other cases can be shown declining; namely:
Like most Slavic languages, there are mostly three genders for nouns: masculine, feminine, and neuter, a distinction which is still present even in the plural (unlike Russian and, in part, the Čakavian dialect). They also have two numbers: singular and plural. However, some consider there to be three numbers (paucal or dual, too), since (still preserved in closely related Slovene) after two (dva, dvije/dve), three (tri) and four (četiri), and all numbers ending in them (e.g. twenty-two, ninety-three, one hundred four, but not twelve through fourteen) the genitive singular is used, and after all other numbers five (pet) and up, the genitive plural is used. (The number one [jedan] is treated as an adjective.) Adjectives are placed in front of the noun they modify and must agree in both case and number with it.
There are seven tenses for verbs: past, present, future, exact future, aorist, imperfect, and pluperfect; and three moods: indicative, imperative, and conditional. However, the latter three tenses are typically used only in Shtokavian writing, and the time sequence of the exact future is more commonly formed through an alternative construction.
In addition, like most Slavic languages, the Shtokavian verb also has one of two aspects: perfective or imperfective. Most verbs come in pairs, with the perfective verb being created out of the imperfective by adding a prefix or making a stem change. The imperfective aspect typically indicates that the action is unfinished, in progress, or repetitive; while the perfective aspect typically denotes that the action was completed, instantaneous, or of limited duration. Some Štokavian tenses (namely, aorist and imperfect) favor a particular aspect (but they are rarer or absent in Čakavian and Kajkavian). Actually, aspects "compensate" for the relative lack of tenses, because aspect of the verb determines whether the act is completed or in progress in the referred time.
Main article: Serbo-Croatian phonology
The Serbo-Croatian vowel system is simple, with only five vowels in Shtokavian. All vowels are monophthongs. The oral vowels are as follows:
|Latin script||Cyrillic script||IPA||Description||English approximation|
|a||а||/a/||open central unrounded||father|
|e||е||/e/||mid front unrounded||den|
|i||и||/i/||close front unrounded||seek|
|o||о||/o/||mid back rounded||lord|
|u||у||/u/||close back rounded||pool|
The vowels can be short or long, but the phonetic quality does not change depending on the length. In a word, vowels can be long in the stressed syllable and the syllables following it, never in the ones preceding it.
The consonant system is more complicated, and its characteristic features are series of affricate and palatal consonants. As in English, voice is phonemic, but aspiration is not.
|Latin script||Cyrillic script||IPA||Description||English approximation|
|r||р||/r/||alveolar trill||rolled (vibrating) r as in carramba|
|v||в||/ʋ/||labiodental approximant||roughly between vortex and war|
|l||л||/l/||alveolar lateral approximant||light|
|lj||љ||/ʎ/||palatal lateral approximant||roughly battalion|
|nj||њ||/ɲ/||palatal nasal||British news or American canyon|
|f||ф||/f/||voiceless labiodental fricative||five|
|z||з||/z/||voiced dental sibilant||zero|
|s||с||/s/||voiceless dental sibilant||some|
|ž||ж||/ʒ/||voiced postalveolar fricative||television|
|š||ш||/ʃ/||voiceless postalveolar fricative||sharp|
|h||х||/x/||voiceless velar fricative||loch|
|c||ц||/t͡s/||voiceless dental affricate||pots|
|dž||џ||/d͡ʒ/||voiced postalveolar affricate||as English jam|
|č||ч||/t͡ʃ/||voiceless postalveolar affricate||as English check|
|đ||ђ||/d͡ʑ/||voiced alveolo-palatal affricate||roughly jeans|
|ć||ћ||/t͡ɕ/||voiceless alveolo-palatal affricate||roughly cheese|
|b||б||/b/||voiced bilabial plosive||book|
|p||п||/p/||voiceless bilabial plosive||top|
|d||д||/d/||voiced dental plosive||dog|
|t||т||/t/||voiceless dental plosive||stop|
|g||г||/ɡ/||voiced velar plosive||good|
|k||к||/k/||voiceless velar plosive||duck|
In consonant clusters all consonants are either voiced or voiceless. All the consonants are voiced if the last consonant is normally voiced or voiceless if the last consonant is normally voiceless. This rule does not apply to approximants – a consonant cluster may contain voiced approximants and voiceless consonants; as well as to foreign words (Washington would be transcribed as VašinGton), personal names and when consonants are not inside of one syllable.
/r/ can be syllabic, playing the role of the syllable nucleus in certain words (occasionally, it can even have a long accent). For example, the tongue-twister navrh brda vrba mrda involves four words with syllabic /r/. A similar feature exists in Czech, Slovak, and Macedonian. Very rarely other sonorants can be syllabic, like /l/ (in bicikl), /ʎ/ (surname Štarklj), /n/ (unit njutn), as well as /m/ and /ɲ/ in slang.
Further information: Pitch accent § Serbo-Croatian, and Serbo-Croatian phonology § Pitch accent
Apart from Slovene, Serbo-Croatian is the only Slavic language with a pitch accent (simple tone) system. This feature is present in some other Indo-European languages, such as Norwegian, Ancient Greek, and Punjabi. Neo-Shtokavian Serbo-Croatian, which is used as the basis for standard Bosnian, Croatian, Montenegrin, and Serbian, has four "accents", which involve either a rising or falling tone on either long or short vowels, with optional post-tonic lengths:
|e||[e]||non-tonic short vowel|
|ē||[eː]||non-tonic long vowel|
|è||[ě]||short vowel with rising tone|
|é||[ěː]||long vowel with rising tone|
|ȅ||[ê]||short vowel with falling tone|
|ȇ||[êː]||long vowel with falling tone|
The tone stressed vowels can be approximated in English with set vs. setting? said in isolation for a short tonic e, or leave vs. leaving? for a long tonic i, due to the prosody of final stressed syllables in English.
General accent rules in the standard language:
There are no other rules for accent placement, thus the accent of every word must be learned individually; furthermore, in inflection, accent shifts are common, both in type and position (the so-called "mobile paradigms"). The second rule is not strictly obeyed, especially in borrowed words.
Comparative and historical linguistics offers some clues for memorising the accent position: If one compares many standard Serbo-Croatian words to e.g. cognate Russian words, the accent in the Serbo-Croatian word will be one syllable before the one in the Russian word, with the rising tone. Historically, the rising tone appeared when the place of the accent shifted to the preceding syllable (the so-called "Neo-Shtokavian retraction"), but the quality of this new accent was different – its melody still "gravitated" towards the original syllable. Most Shtokavian (Neo-Shtokavian) dialects underwent this shift, but Chakavian, Kajkavian and the Old-Shtokavian dialects did not.
Accent diacritics are not used in the ordinary orthography, but only in the linguistic or language-learning literature (e.g. dictionaries, orthography and grammar books). However, there are very few minimal pairs where an error in accent can lead to misunderstanding.
Serbo-Croatian orthography is almost entirely phonetic. Thus, most words should be spelled as they are pronounced. In practice, the writing system does not take into account allophones which occur as a result of interaction between words:
Also, there are some exceptions, mostly applied to foreign words and compounds, that favor morphological/etymological over phonetic spelling:
One systemic exception is that the consonant clusters ds and dš are not respelled as ts and tš (although d tends to be unvoiced in normal speech in such clusters):
Only a few words are intentionally "misspelled", mostly in order to resolve ambiguity:
Main articles: Gaj's Latin alphabet, Serbian Cyrillic alphabet, and Yugoslav Braille
Through history, this language has been written in a number of writing systems:
The oldest texts since the 11th century are in Glagolitic, and the oldest preserved text written completely in the Latin alphabet is Red i zakon sestara reda Svetog Dominika, from 1345. The Arabic alphabet had been used by Bosniaks; Greek writing is out of use there, and Arabic and Glagolitic persisted so far partly in religious liturgies.
The Serbian Cyrillic alphabet was revised by Vuk Stefanović Karadžić in the 19th century.
The Croatian Latin alphabet (Gajica) followed suit shortly afterwards, when Ljudevit Gaj defined it as standard Latin with five extra letters that had diacritics, apparently borrowing much from Czech, but also from Polish, and inventing the unique digraphs ⟨lj⟩, ⟨nj⟩ and ⟨dž⟩. These digraphs are represented as ⟨ļ⟩, ⟨ń⟩ and ⟨ǵ⟩ respectively in the Rječnik hrvatskog ili srpskog jezika, published by the former Yugoslav Academy of Sciences and Arts in Zagreb. The latter digraphs, however, are unused in the literary standard of the language. All in all, this makes Serbo-Croatian the only Slavic language to officially use both the Latin and Cyrillic scripts, albeit the Latin version is more commonly used.
In both cases, spelling is phonetic and spellings in the two alphabets map to each other one-to-one:
|Latin collation order||Cyrillic|
The digraphs Lj, Nj and Dž represent distinct phonemes and are considered to be single letters. In crosswords, they are put into a single square, and in sorting, lj follows l and nj follows n, except in a few words where the individual letters are pronounced separately. For instance, nadživ(j)eti "to outlive" is composed of the prefix nad- "out, over" and the verb živ(j)eti "to live". The Cyrillic alphabet avoids such ambiguity by providing a single letter for each phoneme: наджив(ј)ети.
Đ used to be commonly written as Dj on typewriters, but that practice led to too many ambiguities. It is also used on car license plates. Today Dj is often used again in place of Đ on the Internet as a replacement due to the lack of installed Serbo-Croat keyboard layouts.
Serbian, Bosnian and Montenegrin standards officially use both alphabets, while Croatian uses the Latin only.
Latin script has been rising in popularity in Serbia with the advent of the digital age and Internet in Serbia, whether due to restraints (Cyrillic letters use up twice the space and therefore cost on SMS), accessibility (intention to be readable internationally, as Latin is taught in all four countries speaking the language) or ease of use. This has been perceived by Serbian government officials as a suppression and threat for existence of the national script that is Cyrillic, with the Ministry of Culture and Information of Serbia pushing for more tight language laws on top of those stipulated by the existing Constitution.
Montenegrin alphabet, adopted in 2009, provides replacements of sj and zj with digraphs ⟨ś⟩ and ⟨ź⟩ in both Latin and Cyrillic, but they remain largely unused, even by the Parliament of Montenegro which introduced them.
Unicode has separate characters for the digraphs lj (Ǉ, ǈ, ǉ), nj (Ǌ, ǋ, ǌ) and dž (Ǆ, ǅ, ǆ).
Main article: Dialects of Serbo-Croatian
See also: South Slavic dialect continuum
South Slavic historically formed a dialect continuum, i.e. each dialect has some similarities with the neighboring one, and differences grow with distance. However, migrations from the 16th to 18th centuries resulting from the spread of Ottoman Empire on the Balkans have caused large-scale population displacement that broke the dialect continuum into many geographical pockets. Migrations in the 20th century, primarily caused by urbanization and wars, also contributed to the reduction of dialectal differences.
The primary dialects are named after the most common question word for what: Shtokavian uses the pronoun što or šta, Chakavian uses ča or ca, Kajkavian (kajkavski), kaj or kej. In native terminology they are referred to as nar(j)ečje, which would be equivalent of "group of dialects", whereas their many subdialects are referred to as dijalekti "dialects" or govori "speeches".
The pluricentric Serbo-Croatian standard language and all four contemporary standard variants are based on the Eastern Herzegovinian subdialect of Neo-Shtokavian. Other dialects are not taught in schools or used by the state media. The Torlakian dialect is often added to the list, though sources usually note that it is a transitional dialect between Shtokavian and the Bulgaro-Macedonian dialects.
The Serbo-Croatian dialects differ not only in the question word they are named after, but also heavily in phonology, accentuation and intonation, case endings and tense system (morphology) and basic vocabulary. In the past, Chakavian and Kajkavian dialects were spoken on a much larger territory, but have been replaced by Štokavian during the period of migrations caused by Ottoman Turkish conquest of the Balkans in the 15th and the 16th centuries. These migrations caused the koinéisation of the Shtokavian dialects, that used to form the West Shtokavian (more closer and transitional towards the neighbouring Chakavian and Kajkavian dialects) and East Shtokavian (transitional towards the Torlakian and the whole Bulgaro-Macedonian area) dialect bundles, and their subsequent spread at the expense of Chakavian and Kajkavian. As a result, Štokavian now covers an area larger than all the other dialects combined, and continues to make its progress in the enclaves where non-literary dialects are still being spoken.
The differences among the dialects can be illustrated on the example of Schleicher's fable. Diacritic signs are used to show the difference in accents and prosody, which are often quite significant, but which are not reflected in the usual orthography.
Main article: yat
A series of isoglosses crosscuts the main dialects. The modern reflexes of the long Common Slavic vowel jat, usually transcribed *ě, vary by location as /i/, /e/, and /ije/ or /je/. Local varieties of the dialects are labeled Ikavian, Ekavian, and Ijekavian, respectively, depending on the reflex. The long and short jat is reflected as long or short */i/ and /e/ in Ikavian and Ekavian, but Ijekavian dialects introduce a ije/je alternation to retain a distinction.
Standard Croatian and Bosnian are based on Ijekavian, whereas Serbian uses both Ekavian and Ijekavian forms (Ijekavian for Bosnian Serbs, Ekavian for most of Serbia). Influence of standard language through state media and education has caused non-standard varieties to lose ground to the literary forms.
The jat-reflex rules are not without exception. For example, when short jat is preceded by r, in most Ijekavian dialects developed into /re/ or, occasionally, /ri/. The prefix prě- ("trans-, over-") when long became pre- in eastern Ijekavian dialects but to prije- in western dialects; in Ikavian pronunciation, it also evolved into pre- or prije- due to potential ambiguity with pri- ("approach, come close to"). For verbs that had -ěti in their infinitive, the past participle ending -ěl evolved into -io in Ijekavian Neo-Štokavian.
The following are some examples:
|beautiful||*lěp||lep||lip||lijep||long ě → ije|
|faith||*věra||vera||vira||vjera||short ě → je|
|pr + long ě → prije|
|times||*vrěmena||vremena||vrimena||vremena||r + short ě → re|
|heat||*grějati||grejati||grijati||grijati||r + short ě → ri|
|saw||*viděl||video||vidio||vidio||ěl → io|
|village||*selo||selo||selo||selo||e in root, not ě|
The nature and classification of Serbo-Croatian has been the subject of long-standing sociolinguistic debate. The question is whether Serbo-Croatian should be called a single language or a cluster of closely related languages.
See also: Declaration on the Common Language
Enisa Kafadar argues that there is only one Serbo-Croatian language with several varieties. This has made it possible to include all four varieties in new grammars of the language. Daniel Bunčić concludes that it is a pluricentric language, with four standard variants spoken in Serbia, Croatia, Montenegro, and Bosnia-Herzegovina. The mutual intelligibility between their speakers "exceeds that between the standard variants of English, French, German, or Spanish". "There is no doubt of the near 100% mutual intelligibility of (standard) Croatian and (standard) Serbian, as is obvious from the ability of all groups to enjoy each others' films, TV and sports broadcasts, newspapers, rock lyrics etc." Other linguists have argued that the differences between the variants of Serbo-Croatian are less significant than those between the variants of English, German, Dutch, and Hindustani.
Among pluricentric languages, Serbo-Croatian was the only one with a pluricentric standardisation within one state. The dissolution of Yugoslavia has made Serbo-Croatian even more of a typical pluricentric language, since the variants of other pluricentric languages are also spoken in different states.
As in other pluricentric languages, all Serbo-Croatian standard varieties are based on the same dialect (the Eastern Herzegovinian subdialect of the Shtokavian dialect) and consequently, according to the sociolinguistic definitions, constitute a single pluricentric language (and not, for example, several Ausbau languages). According to linguist John Bailyn, "An examination of all the major 'levels' of language shows that BCS is clearly a single language with a single grammatical system."
In 2017, numerous prominent writers, scientists, journalists, activists and other public figures from Croatia, Bosnia-Herzegovina, Montenegro and Serbia signed the Declaration on the Common Language, which states that in Croatia, Serbia, Bosnia-Herzegovina and Montenegro a common polycentric standard language is used, consisting of several standard varieties, such as German, English or Spanish.
The use of Serbo-Croatian as a linguistic label has been the subject of long-standing controversy. Wayles Browne calls it a "term of convenience" and notes the difference of opinion as to whether it comprises a single language or a cluster of languages. Ronelle Alexander refers to the national standards as three separate languages, but also notes that the reasons for this are complex and generally non-linguistic. She calls BCS (her term for Serbo-Croatian) a single language for communicative linguistic purposes, but three separate languages for symbolic non-linguistic purposes.
The current Serbian constitution of 2006 refers to the official language as Serbian, while the Montenegrin constitution of 2007 proclaimed Montenegrin as the primary official language, but also grants other languages and dialects the right of official use.
The International Organization for Standardization (ISO) has specified different Universal Decimal Classification (UDC) numbers for Croatian (UDC 862, abbreviation hr) and Serbian (UDC 861, abbreviation sr), while the cover term Serbo-Croatian is used to refer to the combination of original signs (UDC 861/862, abbreviation sh). Furthermore, the ISO 639 standard designates the Bosnian language with the abbreviations bos and bs.
While it operated, the International Criminal Tribunal for the former Yugoslavia, which had English and French as official languages, translated court proceedings and documents into what it referred to as "Bosnian/Croatian/Serbian", usually abbreviated as BCS. Translators were employed from all regions of the former Yugoslavia and all national and regional variations were accepted, regardless of the nationality of the person on trial (sometimes against a defendant's objections), on the grounds of mutual intelligibility.
For utilitarian purposes, Serbo-Croatian is often called "naš jezik" ("our language") or "naški" (sic. "ourish" or "ourian") by native speakers. This term is frequently used to describe Serbo-Croatian by those who wish to avoid nationalistic and linguistic discussions. Native speakers traditionally describe their language as "jedan ali ne jedinstven"—"one but not uniform".
In 2021, the Board for Standardization of the Serbian Language issued an opinion that Serbo-Croatian is one language, and that it should be referred to as "Serbian language", while "Croatian", "Bosnian" and "Montenegrin" are to be considered merely local names for Serbian language. This opinion was widely criticized by Croatian government and representatives of the Croatian minority in Serbia. Serbian linguist Ranko Bugarski called this opinion "absurd" and "legacy of the 19th century linguistics". He said that Serbo-Croatian should be considered one language in a scientific sense under the "Serbo-Croatian" label, but four different languages in an administrative sense. Legally, Croatian, Bosnian and Montenegrin are all officially recognized minority languages in Serbia. the Serbian Government also officially recognized Bunjevac language as a standard minority language in 2018 and was approved by the Serbian Ministry of Education for learning in schools.
The opinion of the majority of Croatian linguists is that there has never been a Serbo-Croatian language, but two different standard languages that overlapped sometime in the course of history. However, Croatian linguist Snježana Kordić has been leading an academic discussion on this issue in the Croatian journal Književna republika from 2001 to 2010. In the discussion, she shows that linguistic criteria such as mutual intelligibility, the huge overlap in the linguistic system, and the same dialect basis of the standard language are evidence that Croatian, Serbian, Bosnian and Montenegrin are four national variants of the pluricentric Serbo-Croatian language. Igor Mandić states: "During the last ten years, it has been the longest, the most serious and most acrid discussion (…) in 21st-century Croatian culture". Inspired by that discussion, a monograph on language and nationalism has been published.
The view of the majority of Croatian linguists that there is no single Serbo-Croatian language but several different standard languages has been sharply criticized by German linguist Bernhard Gröschel in his monograph Serbo-Croatian Between Linguistics and Politics.
A more detailed overview, incorporating arguments from Croatian philology and contemporary linguistics, would be as follows:
The linguistic debate in this region is more about politics than about linguistics per se.
The topic of language for writers from Dalmatia and Dubrovnik prior to the 19th century made a distinction only between speakers of Italian or Slavic, since those were the two main groups that inhabited Dalmatian city-states at that time. Whether someone spoke Croatian or Serbian was not an important distinction then, as the two languages were not distinguished by most speakers.
However, most intellectuals and writers from Dalmatia who used the Štokavian dialect and practiced the Catholic faith saw themselves as part of a Croatian nation as far back as the mid-16th to 17th centuries, some 300 years before Serbo-Croatian ideology appeared. Their loyalty was first and foremost to Catholic Christendom, but when they professed an ethnic identity, they referred to themselves as "Slovin" and "Illyrian" (a sort of forerunner of Catholic baroque pan-Slavism) and Croat – these 30-odd writers over the span of c. 350 years always saw themselves as Croats first and never as part of a Serbian nation. It should also be noted that, in the pre-national era, Catholic religious orientation did not necessarily equate with Croat ethnic identity in Dalmatia. A Croatian follower of Vuk Karadžić, Ivan Broz, noted that for a Dalmatian to identify oneself as a Serb was seen as foreign as identifying oneself as Macedonian or Greek. Vatroslav Jagić pointed out in 1864:
As I have mentioned in the preface, history knows only two national names in these parts—Croatian and Serbian. As far as Dubrovnik is concerned, the Serbian name was never in use; on the contrary, the Croatian name was frequently used and gladly referred to ...
At the end of the 15th century [in Dubrovnik and Dalmatia], sermons and poems were exquisitely crafted in Croatian by those men whose names are widely renowned by deep learning and piety.— The History of the Croatian Language, Zagreb, 1864.
On the other hand, the opinion of Jagić from 1864 is argued not to have firm grounds. When Jagić says "Croatian", he refers to a few cases referring to the Dubrovnik vernacular as ilirski (Illyrian). This was a common name for all Slavic vernaculars in Dalmatian cities among the Roman inhabitants. In the meantime, other written monuments are found that mention srpski, lingua serviana (= Serbian), and some that mention Croatian. By far the most competent Serbian scientist[editorializing] on the Dubrovnik language issue, Milan Rešetar, who was born in Dubrovnik himself, wrote behalf of language characteristics: "The one who thinks that Croatian and Serbian are two separate languages must confess that Dubrovnik always (linguistically) used to be Serbian."
Finally, the former medieval texts from Dubrovnik and Montenegro dating before the 16th century were neither true Štokavian nor Serbian, but mostly specific a Jekavian-Čakavian that was nearer to actual Adriatic islanders in Croatia.
Nationalists have conflicting views about the language(s). The nationalists among the Croats conflictingly claim either that they speak an entirely separate language from Serbs and Bosniaks or that these two peoples have, due to the longer lexicographic tradition among Croats, somehow "borrowed" their standard languages from them. Bosniak nationalists claim that both Croats and Serbs have "appropriated" the Bosnian language, since Ljudevit Gaj and Vuk Karadžić preferred the Neo-Štokavian Ijekavian dialect, widely spoken in Bosnia and Herzegovina, as the basis for language standardization, whereas the nationalists among the Serbs claim either that any divergence in the language is artificial, or claim that the Štokavian dialect is theirs and the Čakavian Croats'— in more extreme formulations Croats have "taken" or "stolen" their language from the Serbs.
Proponents of unity among Southern Slavs claim that there is a single language with normal dialectal variations. The term "Serbo-Croatian" (or synonyms) is not officially used in any of the successor countries of former Yugoslavia.
In Serbia, the Serbian standard has an official status countrywide, while both Serbian and Croatian are official in the province of Vojvodina. A large Bosniak minority is present in the southwest region of Sandžak, but the "official recognition" of Bosnian is moot. Bosnian is an optional course in first and second grade of the elementary school, while it is also in official use in the municipality of Novi Pazar. However, its nomenclature is controversial, as there is incentive that it is referred to as "Bosniak" (bošnjački) rather than "Bosnian" (bosanski) (see Bosnian language#Controversy and recognition for details).
Croatian is the official language of Croatia, while Serbian is also official in municipalities with significant Serb population.
In Bosnia and Herzegovina, all three standard languages are recorded as official. Confrontations have on occasion been absurd. The academic Muhamed Filipović, in an interview to Slovenian television, told of a local court in a Croatian district requesting a paid translator to translate from Bosnian to Croatian before the trial could proceed.
The International Criminal Tribunal for the former Yugoslavia referred to the language as "Bosnian/Croatian/Serbian", usually abbreviated as BCS. Translators were employed from all regions of the former Yugoslavia and all national and regional variations were accepted, regardless of the nationality of the person on trial (sometimes against a defendant's objections), on the grounds of mutual intelligibility.
Since the year 2000, ISO 639 classification recognizes Serbo-Croatian only as a 'macrolanguage', having removed its original codes from ISO 639-1 and ISO 639-2 standards. That left the ISO 639-3 'macrolanguage' (a book-keeping device in the ISO 639-3 standard to keep track of which ISO 639-3 codes correspond with which ISO 639-2 codes) stranded without a corresponding ISO 639-2 code.
Article 1 of the Universal Declaration of Human Rights in Serbo-Croatian, written in the Latin alphabet:
Article 1 of the Universal Declaration of Human Rights in Serbo-Croatian, written in the Cyrillic script:
Article 1 of the Universal Declaration of Human Rights in English:
Serbo-Croatian, which features four ethnic variants: Serbian, Croatian, Bosnian, and Montenegrin
Slova ś i ź odnedavno ne koristi ni Skupština Crne Gore, prva i jedina državna institucija koja ih je nakon reforme crnogorskog jezika koristila u zvaničnoj komunikaciji.[The letters ś and ź have recently not been used even by the Parliament of Montenegro, the first and only state institution that used them in official communication after the reform of the Montenegrin language.]
The debate about the status of the Serbo-Croatian language and its varieties has recently shifted (again) towards a position which looks at the internal variation within Serbo-Croatian through the prism of linguistic pluricentricity
Obwohl das Kroatische sich in den letzten Jahren in einigen Gebieten, vor allem jedoch auf lexikalischer Ebene, verändert hat, sind diese Änderungen noch nicht bedeutend genug, dass der Terminus Ausbausprache gerechtfertigt wäre. Ausserdem können sich Serben, Kroaten, Bosnier und Montenegriner immer noch auf ihren jeweiligen Nationalsprachen unterhalten und problemlos verständigen. Nur schon diese Tatsache zeigt, dass es sich immer noch um eine polyzentrische Sprache mit verschiedenen Varietäten handelt.
The official language in Montenegro shall be Montenegrin.[…]Serbian, Bosniac, Albanian and Croatian shall also be in the official use.