This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.Find sources: "Italian phonology" – news · newspapers · books · scholar · JSTOR (July 2012) (Learn how and when to remove this template message)

The phonology of Italian describes the sound system—the phonology and phonetics—of Standard Italian and its geographical variants.


Consonant phonemes
Labial Dental/
Nasal m n   ɲ
Stop p b t d k ɡ
Affricate t͡s d͡z t͡ʃ d͡ʒ
Fricative f v s z ʃ (ʒ)
Approximant   j w
Lateral l   ʎ
Trill r



Vowels of Italian. From Rogers & d'Arcangeli (2004:119).
Vowels of Italian. From Rogers & d'Arcangeli (2004:119).
Vowel phonemes
Front Central Back
Close i u
Close-mid e o
Open-mid ɛ ɔ
Open a

In Italian there is no phonemic distinction between long and short vowels, but vowels in stressed open syllables, unless word-final, are long at the end of the intonational phrase (including isolated words) or when emphasized.[23][24] Adjacent identical vowels found at morpheme boundaries are not resyllabified, but pronounced separately ("quickly rearticulated"), and they might be reduced to a single short vowel in rapid speech.[25]

Although Italian contrasts close-mid (/e, o/) and open-mid (/ɛ, ɔ/) vowels in stressed syllables, the distinction is neutralised in unstressed position[23] in which only the close-mid vowels occur. The height of such vowels in unstressed position is context-sensitive; they are somewhat lowered ([, ]) in the vicinity of more open vowels.[26] The distinction between close-mid and open-mid vowels is lost entirely in a few Southern varieties of Regional Italian, especially in Northern Sicily (e.g. Palermo), where they are realized as open-mid [ɛ, ɔ], as well as in some Northern varieties (in particular in Piedmont), where they are realized as mid [, ].

Word-final stressed /ɔ/ is found in a small number of words: però, ciò, paltò.[27] However, as a productive morpheme, it marks the first person singular of all future tense verbs (e.g. dormirò 'I will sleep') and the third person singular preterite of first conjugation verbs (parlò 's/he spoke', but credé 's/he believed', dormì 's/he slept'). Word-final unstressed /u/ is rare, [28] found in onomatopoeic terms (babau),[29] loanwords (guru),[30] and place or family names derived from the Sardinian language (Gennargentu,[31] Porcu).[32]

When the last phoneme of a word is an unstressed vowel and the first phoneme of the following word is any vowel, the former vowel tends to become non-syllabic. This phenomenon is called synalepha and should be taken into account when counting syllables, e.g. in poetry.

In addition to monophthongs, Italian has diphthongs, which, however, are both phonemically and phonetically simply combinations of the other vowels. Some are very common (e.g. /ai, au/), others are rarer (e.g. /ɛi/) and some never occur within native Italian words (e.g. /ou/). None of the diphthongs is, however, considered to have distinct phonemic status since their constituents do not behave differently from how they occur in isolation, unlike the diphthongs in other languages like English and German. Grammatical tradition distinguishes 'falling' from 'rising' diphthongs, but since rising diphthongs are composed of one semiconsonantal sound [j] or [w] and one vowel sound, they are not actually diphthongs. The practice of referring to them as 'diphthongs' has been criticised by phoneticians like Luciano Canepari.[19]


A visual representation of Italian phonotactics.
A visual representation of Italian phonotactics.


Italian allows up to three consonants in syllable-initial position, though there are limitations:[33]


As an onset, the cluster /s/ + voiceless consonant is inherently unstable. Phonetically, word-internal s+C normally syllabifies as [s.C]: [ˈrɔs.po] rospo 'toad', [tras.ˈteː] Trastevere (neighborhood of Rome).[34][35] Phonetic syllabification of the cluster also occurs at word boundaries if a vowel precedes it without pause, e.g. [las.ˈtɔː.rja] la storia 'the history', implying the same syllable break at the structural level, /sˈtɔrja/,[36] thus always latent due to the extrasyllabic /s/, but unrealized phonetically unless a vowel precedes.[37] A competing analysis accepts that while the syllabification /s.C/ is accurate historically, modern retreat of i-prosthesis before word initial /s/+C (e.g. erstwhile con isforzo 'with effort' has generally given way to con sforzo) suggests that the structure is now underdetermined, with occurrence of /s.C/ or /.sC/ variable "according to the context and the idiosyncratic behaviour of the speakers."[38]


The last combination is however rare and one of the approximants is often vocalised, e.g. quieto /kwiˈɛto,ˈkwjɛto/, continuiamo /kontinuˈjamo, kontinwiˈamo, ((kontiˈnwjamo))/


The nucleus is the only mandatory part of a syllable (for instance, a 'to, at' is a word) and must be a vowel or a diphthong. In a falling diphthong the most common second elements are /i̯/ or /u̯/ but other combinations such as idea /iˈdɛa̯/, trae /ˈtrae̯/ may also be interpreted as diphthongs.[19] Combinations of /j w/ with vowels are often labelled diphthongs, allowing for combinations of /j w/ with falling diphthongs to be called triphthongs. One view holds that it is more accurate to label /j w/ as consonants and /jV wV/ as consonant-vowel sequences rather than rising diphthongs. In that interpretation, Italian has only falling diphthongs (phonemically at least, cf. Synaeresis) and no triphthongs.[19]

j a j a a a
ɛ ɛ ɛ -
i̯ɛ -
ɔ ɔ ɔ ɔ
u̯ɔ u̯ɔ u̯ɔ
(k/g)w a (k/g)w a a -
ɛ ɛ ɛ ɛ
i̯ɛ i̯ɛ
ɔ ɔ ɔ -
- -
j e e o
o o (u̯o)
u u u
(k/g)w e e e
o o (i̯e)
i i i
(k/g)w j a -
ɛ -
- -
- -
o -


Italian permits a small number of coda consonants. Outside of loanwords,[39] the permitted consonants are:

There are also restrictions in the types of syllables that permit consonants in the syllable coda. Krämer (2009) explains that neither geminates, nor coda consonants with "rising sonority" can follow falling diphthongs. However, "rising diphthongs" (or sequences of an approximant and a following vowel) may precede clusters with falling sonority, particularly those that stem historically from an obstruent+liquid onset.[42] For example:[43]

Syntactic gemination

Main article: Syntactic gemination

Word-initial consonants are geminated after certain vowel-final words in the same prosodic unit. There are two types of triggers of initial gemination: some unstressed particles, prepositions, and other monosyllabic words, and any oxytonic polysyllabic word.[35] As an example of the first type, casa ('house') is pronounced [ˈkaːza] but a casa ('homeward') is pronounced [akˈkaːza]. This is not a purely phonological process, as no gemination is cued by the la in la casa 'the house' [laˈkaːza], and there is nothing detectable in the structure of the preposition a to account for the gemination. This type normally originates in language history: modern a, for example, derives from Latin AD, and today's geminate in [akˈkaːza] is a continuation of what was once a simple assimilation. Gemination cued by final stressed vowels, however, is transparently phonological. Final stressed vowels are short by nature, if a consonant follows a short stressed vowel the syllable must be closed, thus the consonant following the final stressed vowel is drawn to lengthen: parlò portoghese [parˈlɔpportoˈɡeːze] 's/he spoke Portuguese' vs. parla portoghese [ˈparlaportoˈɡeːze] 's/he speaks Portuguese'.

To summarize, syntactic gemination occurs in standard Italian mainly in the following two cases:[44]

Syntactic gemination is the normal native pronunciation in Central Italy (both "stress-induced" and "lexical") and Southern Italy (only "lexical"), including Sicily and Corsica (France).

In Northern Italy and Sardinia, speakers use it inconsistently because the feature is not present in the dialectal substratum and is not usually shown in the written language unless a new word is produced by the fusion of the two: "chi sa"-> chissà ("who knows" in the sense of goodness knows).

Regional variation

Main article: Regional Italian

The above IPA symbols and description refer to standard Italian, based on a somewhat idealized version of the Tuscan-derived national language. As is common in many cultures, this single version of the language was pushed as neutral, proper, and eventually superior, leading to some stigmatization of varying accents. Television news anchors and other high-profile figures had to put aside their regional Italian when in the public sphere. However, in more recent years the enforcement of this standard has fallen out of favor in Italy, and news reporters, actors, and the like are now more free to deliver their words in their native regional variety of Italian, which appeals to the Italian population's range of linguistic diversity. The variety is still not represented in its wholeness and accents from the South are maybe to be considered less popular, except in shows set in the South and in comedy, a field in which Naples, Sicily and the South in general have always been present. Though it still represents the basics for the standard variety, the loosened restrictions have led to Tuscan being seen for what it is, just one dialect among many with its own regional peculiarities and qualities, many of which are shared with Umbria, Southern Marche and Northern Lazio.

The various Tuscan, Corsican and Central Italian dialects are, to some extent, the closest ones to Standard Italian in terms of linguistic features, since the latter is based on a somewhat polished form of Florentine.

Phonological development

Main article: Phonological development

Very little research has been done on the earliest stages of phonological development in Italian.[47] This article primarily describes phonological development after the first year of life. See the main article on phonological development for a description of first year stages. Many of the earliest stages are thought to be universal to all infants.

Phoneme inventory

Word-final consonants are rarely produced during the early stages of word production. Consonants are usually found in word-initial position, or in intervocalic position.[48]

17 months

Most consonants are word-initial: They are the stops /p/, /b/, /t/, and /k/ and the nasal /m/. A preference for a front place of articulation is present.

21 months

More phones now appear in intervocalic contexts. The additions to the phonetic inventory are the voiced stop /d/, the nasal /n/, the voiceless affricate /t͡ʃ/, and the liquid /l/.[48]

24 months

The fricatives /f/, /v/, and /s/ are added, primarily at the intervocalic position.[48]

27 months

Approximately equal numbers of phones are now produced in word-initial and intervocalic position. Additions to the phonetic inventory are the voiced stop /ɡ/ and the consonant cluster /kw/. While the word-initial inventory now tends to have all the phones of the adult targets (adult production of the child's words), the intervocalic inventory tends to still be missing four consonants or consonant clusters of the adult targets: /f/, /d͡ʒ/, /r/, and /st/.[48]

Stops are the most common manner of articulation at all stages and are produced more often than they are present in the target words at around 18 months. Gradually this frequency decreases to almost target-like frequency by around 27 months. The opposite process happens with fricatives, affricates, laterals and trills. Initially, the production of these phonemes is significantly less than what is found in the target words and the production continues to increases to target-like frequency. Alveolars and bilabials are the two most common places of articulation, with alveolar production steadily increasing after the first stage and bilabial production gently decreasing. Labiodental and postalveolar production increases throughout development, while velar production decreases.[49]


Syllable structures

6–10 months

Babbling becomes distinct from previous, less structured vocal play. Initially, syllable structure is limited to CVCV, called reduplicated babbling. At this stage, children’s vocalizations have a weak relation to adult Italian and the Italian lexicon.[50]

11–14 months

The most-used syllable type changes as children age, and the distribution of syllables takes on increasingly Italian characteristics. This ability significantly increases between the ages of 11 and 12 months, 12 and 13 months, and 13 and 14 months.[50] Consonant clusters are still absent. Children's first ten words appear around month 12, and take CVCV format (e.g. mamma 'mom', papà 'dad').[51]

18–24 months

Reduplicated babbling is replaced by variegated babbling, producing syllable structures such as C1VC2V (e.g. cane 'dog', topo 'mouse'). Production of trisyllabic words begins (e.g. pecora 'sheep', matita 'pencil').[51] Consonant clusters are now present (e.g. bimba 'female child', venti 'twenty'). Ambient language plays an increasingly significant role as children begin to solidify early syllable structure. Syllable combinations that are infrequent in the Italian lexicon, such as velar-labial sequences (e.g. capra 'goat' or gamba 'leg') are infrequently produced correctly by children, and are often subject to consonant harmony.[52]

Stress patterns

In Italian, stress is lexical, meaning it is word-specific and partly unpredictable. Penultimate stress (primary stress on the second-to-last syllable) is also generally preferred.[53][54] This goal, acting simultaneously with the child's initial inability to produce polysyllabic words, often results in weak-syllable deletion. The primary environment for weak-syllable deletion in polysyllabic words is word-initial, as deleting word-final or word-medial syllables would interfere with the penultimate stress pattern heard in ambient language.[55]

Phonological awareness

Children develop syllabic segmentation awareness earlier than phonemic segmentation awareness. In earlier stages, syllables are perceived as a separate phonetic unit, while phonemes are perceived as assimilated units by coarticulation in spoken language. By first grade, Italian children are nearing full development of segmentation awareness on both syllables and phonemes. Compared to those children whose mother tongue exhibits closed syllable structure (CVC,CCVC, CVCC, etc.), Italian-speaking children develop this segmentation awareness earlier, possibly due to its open syllable structure (CVCV, CVCVCV, etc.).[56] Rigidity in Italian (shallow orthography and open syllable structure) makes it easier for Italian-speaking children to be aware of those segments.[57]

Sample texts

Provided here is a rendition of the Bible, Luke 2, 1–7, as read by a native Italian speaker from Milan. As a northerner, his pronunciation lacks syntactic doubling ([ˈfu ˈfatto] instead of [ˈfu fˈfatto]) and intervocalic [s] ([ˈkaːza] instead of [ˈkaːsa]). The speaker realises /r/ as [ʋ] in some positions.

2:1 In quei giorni, un decreto di Cesare Augusto ordinava che si facesse un censimento di tutta la terra.
2 Questo primo censimento fu fatto quando Quirino era governatore della Siria.
3 Tutti andavano a farsi registrare, ciascuno nella propria città.
4 Anche Giuseppe, che era della casa e della famiglia di Davide, dalla città di Nazaret e dalla Galilea si recò in Giudea nella città di Davide, chiamata Betlemme,
5 per farsi registrare insieme a Maria, sua sposa, che era incinta.
6 Proprio mentre si trovavano lì, venne il tempo per lei di partorire.
7 Mise al mondo il suo primogenito, lo avvolse in fasce e lo depose in una mangiatoia, poiché non c'era posto per loro nella locanda.

The differences in pronunciation are underlined in the following transcriptions; the velar [ŋ] is an allophone of /n/ and the long vowels are allophones of the short vowels, but are shown for clarity.

A rough transcription of the audio sample is:

2:1 [iŋ ˈkwɛi ˈdʒorni un deˈkreːto di ˈtʃeːzare auˈɡusto ordiˈnaːva ke si faˈtʃɛsse un tʃensiˈmento di ˈtutta la ˈtɛrra
2 ˈkwɛsto ˈpriːmo tʃensiˈmento fu ˈfatto ˈkwando kwiˈriːno ˈeːra ɡovernaˈtoːre dɛlla ˈsiːrja
3 ˈtutti anˈdaːvano a ˈfarsi redʒiˈstraːre tʃaˈskuːno nɛlla ˈprɔːprja tʃitˈta
4 ˈaŋke dʒuˈzɛppe ke ˈeːra dɛlla ˈkaːza e dɛlla faˈmiʎʎa di ˈdaːvide dalla tʃitˈta di ˈnaddzaret e dalla ɡaliˈleːa si reˈkɔ in dʒuˈdeːa nɛlla tʃitˈta di ˈdaːvide kjaˈmaːta beˈtlɛmme
5 per ˈfarsi redʒiˈstraːre inˈsjeːme a maˈriːa swa ˈspoːza ke ˈeːra inˈtʃinta
6 ˈprɔːprjo ˈmentre si troˈvaːvano ˈli ˈvɛnne il ˈtempo per ˈlɛi di partoˈriːre
7 ˈmiːze al ˈmondo il swo primoˈdʒeːnito, lo avˈvɔlse iɱ ˈfaːʃe e lo deˈpoːze in ˈuːna mandʒaˈtɔːja poiˈke non ˈtʃeːra ˈpɔsto per ˈloːro nɛlla loˈkanda]

The Standard Italian pronunciation of the text is:

2:1 [iŋ ˈkwei ˈdʒorni un deˈkreːto di ˈtʃeːzare auˈɡusto ordiˈnaːva ke ssi faˈtʃesse un tʃensiˈmento di ˈtutta la ˈtɛrra
2 ˈkwesto ˈpriːmo tʃensiˈmento fu fˈfatto ˈkwando kwiˈriːno ˈɛːra ɡovernaˈtoːre della ˈsiːrja
3 ˈtutti anˈdaːvano a fˈfarsi redʒiˈstraːre tʃaˈskuːno nella ˈprɔːprja tʃitˈta
4 ˈaŋke dʒuˈzɛppe ke ˈɛːra della ˈkaːsa e ddella faˈmiʎʎa di ˈdaːvide dalla tʃitˈta ddi ˈnaddzaret e ddalla ɡaliˈlɛːa si reˈkɔ in dʒuˈdɛːa nella tʃitˈta ddi ˈdaːvide kjaˈmaːta beˈtlɛmme
5 per ˈfarsi redʒiˈstraːre inˈsjɛːme a mmaˈriːa ˈsa ˈspɔːza ke ˈɛːra inˈtʃinta
6 ˈprɔːprjo ˈmentre si troˈvaːvano ˈli ˈvenne il ˈtɛmpo per ˈlɛi di partoˈriːre
7 ˈmiːse al ˈmondo il ˈso primoˈdʒɛːnito, lo avˈvɔlse iɱ ˈfaʃʃe e llo deˈpoːse in ˈuːna mandʒaˈtoːja poiˈke nnon ˈtʃɛːra ˈposto per ˈloːro nella loˈkanda]

See also


  1. ^ Hall (1944), pp. 77–78.
  2. ^ a b c d Rogers & d'Arcangeli (2004), p. 117.
  3. ^ Hall (1944), p. 78.
  4. ^ a b c Bertinetto & Loporcaro (2005), p. 132.
  5. ^ a b Canepari (1992), p. 62.
  6. ^ a b c d Canepari (1992), pp. 68, 75–76.
  7. ^ Canepari (1992), pp. 57, 84, 88–89.
  8. ^ Bertinetto & Loporcaro (2005), p. 133.
  9. ^ Canepari (1992), pp. 58, 88–89.
  10. ^ Bertinetto & Loporcaro (2005), p. 134.
  11. ^ Canepari (1992), pp. 57–59, 88–89.
  12. ^ a b Bertinetto & Loporcaro (2005), pp. 134–135.
  13. ^ Canepari (1992), p. 59.
  14. ^ Canepari (1992), p. 58.
  15. ^ Recasens (2013), p. 13.
  16. ^ "(...) in a large number of Italian accents, there is considerable friction involved in the pronunciation of [ʎ], creating a voiced palatal lateral fricative (for which there is no established IPA symbol)" Ashby (2011:64).
  17. ^ a b Ladefoged & Maddieson (1996), p. 221.
  18. ^ Rogers & d'Arcangeli (2004), p. 118.
  19. ^ a b c d e Luciano Canepari, A Handbook of Pronunciation, chapter 3: «Italian».
  20. ^ Romano, Antonio. "A preliminary contribution to the study of phonetic variation of /r/ in Italian and Italo-Romance." Rhotics. New data and perspectives (Proc. of’r-atics-3, Libera Università di Bolzano (2011): 209–226, pp. 213–214.
  21. ^ "Dizionario d'ortografia e di pronunzia".
  22. ^ "Dizionario d'ortografia e di pronunzia".
  23. ^ a b Rogers & d'Arcangeli (2004), p. 119.
  24. ^ Bertinetto & Loporcaro (2005), p. 136.
  25. ^ Bertinetto & Loporcaro (2005), p. 137.
  26. ^ Bertinetto & Loporcaro (2005), pp. 137–138.
  27. ^ "paltò". (in Italian). Vocabolario – Treccani.
  28. ^ Bertinetto & Loporcaro (2005), p. 138.
  29. ^ "Dizionario d'ortografia e di pronunzia".
  30. ^ "Dizionario d'ortografia e di pronunzia".
  31. ^ "Dizionario d'ortografia e di pronunzia".
  32. ^ "Dizionario d'ortografia e di pronunzia".
  33. ^ Hall (1944), p. 79.
  34. ^ "Sibilanti in "Enciclopedia dell'Italiano"".
  35. ^ a b Hall (1944), p. 80.
  36. ^ Luciano Canepari, A Handbook of Pronunciation, Chapter 3: "Italian", pp. 135–36
  37. ^ "acoustic data confirm the fact that [|sˈtV] /|sˈtV/ (after a pause, or 'silence') is part of the same syllable (a little particular, possibly, on the scale of syllabicity, but nothing really surprising) whereas, obviously, [VsˈtV] /VsˈtV/ constitute two phono-syllables bordering between two C" Luciano Canepari, A Handbook of Pronunciation, Chapter 3: "Italian", p. 136.
  38. ^ Bertinetto & Loporcaro (2005), p. 141.
  39. ^ Krämer (2009), pp. 138, 139.
  40. ^ a b c Krämer (2009), p. 138.
  41. ^ Krämer (2009), pp. 138, 141.
  42. ^ Krämer (2009), p. 135.
  43. ^ Examples come from Krämer (2009:136)
  44. ^ thebigbook-2ed, p. 111
  45. ^ Hall (1944), p. 75.
  46. ^ Hall (1944), p. 76.
  47. ^ Keren-Portnoy, Majorano & Vihman (2009), p. 240.
  48. ^ a b c d Zmarich & Bonifacio (2005), p. 759.
  49. ^ Zmarich & Bonifacio (2005), p. 760.
  50. ^ a b Majorano & D'Odorico (2011), p. 53.
  51. ^ a b Fasolo, Majorano & D'Odorico (2006), p. 86.
  52. ^ Majorano & D'Odorico (2011), p. 58.
  53. ^ "Stress in Italian occurs most often on the penultimate syllable (paroxytones); it also occurs on the antepenultimate syllable (proparoxytones) ...Borrelli (2002:8).
  54. ^ D'imperio & Rosenthall (1999), p. 5.
  55. ^ Majorano & D'Odorico (2011), p. 61.
  56. ^ Cossu et al. (1988), p. 10.
  57. ^ Cossu et al. (1988), p. 11.