While the spelling of German is officially standardised by an international organisation (the Council for German Orthography) the pronunciation has no official standard and relies on a de facto standard documented in reference works such as Deutsches Aussprachewörterbuch (German Pronunciation Dictionary) by Eva-Maria Krech et al.,Duden 6 Das Aussprachewörterbuch (Duden volume 6, The Pronunciation Dictionary) by Max Mangold and the training materials of radio and television stations such as Westdeutscher Rundfunk, Deutschlandfunk, or Schweizer Radio und Fernsehen. This standardised pronunciation was invented, rather than coming from any particular German-speaking city. But the pronunciation that Germans usually consider to be closest to the standard is that of Hanover. Standard German is sometimes referred to as Bühnendeutsch (stage German), but the latter has its own definition and is slightly different.
Some scholars treat /ə/ as an unstressed allophone of /ɛ/. Likewise, some scholars treat /ɐ/ as an allophone of the sequence /ər/ or as a vocalized variant of /r/. The phonemic status of /ɛː/ is also debated – see below.
In non-standard accents of the Low German speaking area, as well as in some Austrian accents it may be pronounced as a narrow closing diphthong [oʊ].
/ə/ has been variously described as mid central unrounded [ə]. and close-mid central unrounded [ɘ]. It occurs only in unstressed syllables, for instance in besetzen[bəˈzɛtsən] ('occupy'). It is often considered a complementary allophone together with [ɛ], which only rarely occurs in unstressed syllables (e.g. entsetzt'). If a sonorant follows in the syllable coda, the schwa often disappears so that the sonorant becomes syllabic, for instance Kissen[ˈkɪsn̩] ('pillow'), Esel[ˈeːzl̩] ('donkey'). However, Standard German spoken in Luxembourg often lacks syllabic sonorants under the influence of Luxembourgish, so that e.g. setzen ('put') is pronounced [ˈzɛtsən], rather than [ˈzɛtsn̩].
/ɛ/ has been variously described as mid near-front unrounded [ɛ̽] and open-mid front unrounded [ɛ].
/ɛː/ has been variously described as mid front unrounded [ɛ̝ː] and open-mid front unrounded [ɛː].
/œ/ has been variously described as open-mid near-front rounded [œ̠] and somewhat lowered open-mid near-front rounded [œ̠˕].
/ɔ/ has been variously described as somewhat fronted open-mid back rounded [ɔ̟] and open-mid back rounded [ɔ].
/ɐ/ is near-open central unrounded [ɐ]. It is a common allophone of the sequence /ər/ common to all German-speaking areas but Switzerland. As schwa /ə/ is never pronounced here, it may be more appropriate to interpret [ɐ] as the vocalised allophone of the consonant /r/.
/a/ has been variously described as open front unrounded [a] and open central unrounded [ä]. Some scholars differentiate two short /a/, namely front /a/ and back /ɑ/. The latter occurs only in unstressed open syllables, exactly as /i, y, u, e, ø, o/.
Standard Austrian pronunciation of this vowel is back [ɑ].
Front [a] or even [æ] is a common realization of /a/ in northern German varieties influenced by Low German.
/aː/ has been variously described as open central unrounded [äː] and open back unrounded [ɑː]. Because of this, it is sometimes transcribed /ɑː/.
Back [ɑː] is the Standard Austrian pronunciation. It is also a common realization of /aː/ in northern German varieties influenced by Low German (in which it may even be rounded [ɒː]).
Wiese (1996) notes that "there is a tendency to neutralize the distinction between [a(ː)], [aɐ̯], and [ɐ]. That is, Oda, Radar, and Oder have final syllables which are perceptually very similar, and are nearly or completely identical in some dialects." He also says that "outside of a word context, [ɐ] cannot be distinguished from [a]. (As early as 1847, Verdi's librettist found it natural, when adaptinga play by Schiller into the Italian language, to render the distinctly German name Roller as Rolla.)
Although there is also a length contrast, vowels are often analyzed according to a tenseness contrast, with long /iː, yː, uː, eː, øː, oː/ being the tense vowels and short /ɪ, ʏ, ʊ, ɛ, œ, ɔ/ their lax counterparts. Like the English checked vowels, the German lax vowels require a following consonant, with the notable exception of [ɛː] (which is absent in many varieties, as discussed below). /a/ is sometimes considered the lax counterpart of tense /aː/ in order to maintain this tense/lax division. Short /i, y, u, e, ø, o/ occur in unstressed syllables of loanwords, for instance in Psychometrie/psyçomeˈtʁiː/ ('psychometry'). They are usually considered allophones of tense vowels, which cannot occur in unstressed syllables (unless in compounds).
Northern German varieties influenced by Low German could be analyzed as lacking contrasting vowel quantity entirely:
/aː/ has a different quality than /a/ (see above).
These varieties also consistently lack /ɛː/, and use only /eː/ in its place.
Phonemic status of /ɛː/
The long open-mid front unrounded vowel[ɛː] does not exist in many varieties of Standard German and is rendered as the close-mid front unrounded vowel[eː], so that both Ähre ('ear of grain') and Ehre ('honor') are pronounced [ˈeːʁə] (instead of "Ähre" being [ˈɛːʁə]) and both Bären ('bears') and Beeren ('berries') are pronounced [ˈbeːʁən] (instead of Bären being [ˈbɛːʁən]). However, the disputed vowel [ɛː] seems much more stable in other contexts, i.e., not preceding /r/ as in the examples above. Other relevant minimal pairs include beten ('pray') – bäten ('bid, conjunctive'), dehnen ('stretch') – Dänen ('Danes'), Segen ('blessing') – Sägen ('saws, n.'). It has been debated whether [ɛː] is a distinct phoneme or even exists, except when consciously self-censoring speech, for several reasons:
The existence of a phoneme /ɛː/ is an irregularity in a vowel system that otherwise has pairs of long and tense vs. short and lax vowels such as [oː] vs. [ɔ]. On the other hand, such irregularities are not ruled out by any principle.
Although some dialects (e.g. Ripuarian and some Alemannic dialects) have an opposition of [eː] vs. [ɛː], there is little agreement across dialects as to whether individual lexical items should be pronounced with [eː] or with [ɛː].[example needed]
The use of [ɛː] is a spelling pronunciation rather than an original feature of the language. It is an attempt to "speak as printed" (sprechen wie gedruckt) and to differentiate the spellings ⟨e⟩ and ⟨ä⟩ (i.e. speakers attempt to justify the appearance of ⟨e⟩ and ⟨ä⟩ in writing by making them distinct in the spoken language).
Speakers with an otherwise fairly standard idiolect find it rather difficult to utter longer passages with [eː] and [ɛː] in the right places. Such persons apparently have to picture the spellings of the words in question, which impedes the flow of speech.[failed verification] However, the examples above with a non-rhotic context for the disputed vowel distinction speak against this view.
/aʊ̯/ has been variously described as [äʊ],[äʊ̞],[äo̟] and [aɔ].
/ɔʏ̯/ has been variously described as [ɔʏ],[ɔʏ̞],[ɔ̝e̠],[ɔɪ], and [ɔœ].
/ʊɪ̯/ is found only in a handful of interjections such as pfui[pfʊɪ̯] and hui[hʊɪ̯], and as an alternative to disyllabic [uː.ɪ] in words such as ruhig[ʁʊɪ̯ç].
The following usually are not counted among the German diphthongs as German speakers often feel they are distinct marks of "foreign words" (Fremdwörter). These appear only in loanwords:
[o̯a], as in Croissant[kʁ̥o̯aˈsɑ̃], colloquially: [kʁ̥o̯aˈsaŋ].
Many German speakers use [ɛɪ̯] and [ɔʊ̯] as adaptations of the English diphthongs /eɪ/ and /oʊ/ in English loanwords, according to Wiese (1996), or they replace them with the native German long vowels /eː/ and /oː/. Thus, the word okay may be pronounced [ɔʊ̯ˈkɛɪ̯] or /oːˈkeː/. However, Mangold (2005) and Krech et al. (2009) do not recognize these diphthongs as phonemes, and prescribe pronunciations with the long vowels /eː/ and /oː/ instead.
In the varieties where speakers vocalize /r/ to [ɐ] in the syllable coda, a diphthong ending in [ɐ̯] may be formed with every stressable vowel:
^1Wiese (1996) notes that the length contrast is not very stable before non-prevocalic /r/ and that "Meinhold & Stock (1980:180), following the pronouncing dictionaries (Mangold (1990), Krech & Stötzer (1982)) judge the vowel in Art, Schwert, Fahrt to be long, while the vowel in Ort, Furcht, hart is supposed to be short. The factual basis of this presumed distinction seems very questionable." He goes on stating that in his own dialect, there is no length difference in these words, and that judgements on vowel length in front of non-prevocalic /r/ which is itself vocalized are problematic, in particular if /a/ precedes.
According to the "lengthless" analysis, the aforementioned "long" diphthongs are analyzed as [iɐ̯], [yɐ̯], [uɐ̯], [ɛɐ̯], [eɐ̯], [øɐ̯], [oɐ̯] and [aɐ̯]. This makes non-prevocalic /ar/ and /aːr/ homophonous as [aɐ̯] or [aː]. Non-prevocalic /ɛr/ and /ɛːr/ may also merge, but the vowel chart in Kohler (1999) shows that they have somewhat different starting points – mid-centralized open-mid front [ɛ̽] for the former, open-mid front [ɛ] for the latter.
Wiese (1996) also states that "laxing of the vowel is predicted to take place in shortened vowels; it does indeed seem to go hand in hand with the vowel shortening in many cases." This leads to [iɐ̯], [yɐ̯], [uɐ̯], [eɐ̯], [øɐ̯], [oɐ̯] being pronounced the same as [ɪɐ̯], [ʏɐ̯], [ʊɐ̯], [ɛɐ̯], [œɐ̯], [ɔɐ̯]. This merger is usual in the Standard Austrian accent, in which e.g. Moor 'bog' is often pronounced [mɔɐ̯]; this, in contrast with the Standard Northern variety, also happens intervocalically, along with the diphthongization of the laxed vowel to [Vɐ̯], so that e.g. Lehrer 'teacher' is pronounced [ˈlɛɐ̯ʁɐ] (the corresponding Standard Northern pronunciation is [ˈleːʁɐ]). Another feature of the Standard Austrian accent is complete absorption of [ɐ̯] by the preceding /ɑ, ɑː/, so that e.g. rar 'scarce' is pronounced [ʁɑː].
With around 22 to 26 phonemes, the German consonant system has an average number of consonants in comparison with other languages. One of the more noteworthy ones is the unusual affricate/pf/.
In the Standard Austrian variety, /k/ may be affricated to [kx] before front vowels.
/ts, s, z/ can be laminal alveolar [t̻s̻, s̻, z̻], laminal post-dental [t̪s̪, s̪, z̪] (i.e. fronted alveolar, articulated with the blade of the tongue just behind upper front teeth), or even apical alveolar [t̺s̺, s̺, z̺]. Austrian German often uses the post-dental articulation. /s, z/ are always strongly fricated.
Laminal, articulated with the foremost part of the blade of the tongue approaching the foremost part of the hard palate, with the tip of the tongue resting behind either upper or lower front teeth.
Apico-laminal, articulated with the tip of the tongue approaching the gums and the foremost part of the blade approaching the foremost part of the hard palate. According to Morciniec & Prędota (2005), this variant is used more frequently.
/r/ has a number of possible realizations:
Voiced apical coronal trill [r̺], either alveolar (articulated with the tip of the tongue against the alveolar ridge), or dental (articulated with the tip of the tongue against the back of the upper front teeth).
Distribution: Common in the south (Bavaria and many parts of Switzerland and Austria), but it is also found in some speakers in central and northern Germany, especially the elderly. It is also one of possible realizations of /r/ in the Standard Austrian accent, but a more common alveolar realization is an approximant [ɹ]. Even more common are uvular realizations, fricatives [ʁ ~ χ] and a trill [ʀ].
Distribution: Occurs in some conservative varieties—most speakers with a uvular /r/ realize it as a fricative or an approximant. It is also one of possible realizations of /r/ in the Standard Austrian accent, but it is less common than a fricative [ʁ ~ χ].
Dorsal continuant, about the quality of which there is not a complete agreement:
Mangold (2005) states that "with educated professional radio and TV announcers, as with professional actors on the stage and in film, the [voiced uvular] fricative [realization of] /r/ clearly predominates."
In the Standard Austrian accent, the uvular fricative is also the most common realization, although its voicing is variable (that is, it can be either voiced [ʁ] or voiceless [χ]).
Kohler (1999) writes that "the place of articulation of the consonant varies from uvular in e.g. rot ('red') to velar in e.g. treten ('kick'), depending on back or front vowel contexts." He also notes that [ʁ] is devoiced after voiceless plosives and fricatives, especially those within the same word, giving the word treten as an example. According to this author, [ʁ] can be reduced to an approximant in an intervocalic position.
Distribution: Almost all areas apart from Bavaria and parts of Switzerland.
Near-open central unrounded vowel [ɐ] is a post-vocalic allophone of (mostly dorsal) varieties of /r/. The non-syllabic variant of it is not always near-open or central; it is similar to either [ɑ] or [ə], depending on the environment.
Distribution: Widespread, but less common in Switzerland.
The voiceless stops /p/, /t/, /k/ are aspirated except when preceded by a sibilant. Many southern dialects do not aspirate /p t k/, and some northern ones do so only in a stressed position. The voiceless affricates /pf/, /ts/, and /tʃ/ are never aspirated, and neither are any other consonants besides the aforementioned /p, t, k/.
The obstruents /b, d, ɡ, z, ʒ, dʒ/ are voiceless lenis consonants[b̥, d̥, ɡ̊, z̥, ʒ̊, d̥ʒ̊] in southern varieties. Voiceless lenis consonants [b̥, d̥, ɡ̊, z̥] continue to contrast with voiceless fortis consonants [p, t, k, s]. The section § Fortis–lenis pairs covers the issue in more detail.
Before and after front vowels (/ɪ, iː, ʏ, yː, ɛ, ɛː, eː, œ, øː/ and, in varieties that realize them as front, /a/ and/or /aː/), the velar consonants /ŋ, k, ɡ/ are realized as post-palatal [ŋ˖, k̟, ɡ˖]. According to Wiese (1996), in a parallel process, /k, ɡ/ before and after back vowels (/ʊ, uː, ɔ, oː/ and, in varieties that realize them as back, /a/ and/or /aː/) are retracted to post-velar [k̠, ɡ˗] or even uvular [q, ɢ].
There is no complete agreement about the nature of /j/; it has been variously described as:
a fricative which can be fricated less strongly than /ç/,
a sound variable between a weak fricative and an approximant, and
an approximant [j], which is the usual realization in the Standard Austrian variety.
In many varieties of standard German, the glottal stop, [ʔ], occurs in careful speech before word stems that begin with a vowel and before stressed vowels word-internally, as in Oase [ʔo.ʔaː.zə] (twice). It is much more frequent in northern varieties than in the south. It is not usually considered a phoneme. In colloquial and dialectal speech, [ʔ] is often omitted, especially when the word beginning with a vowel is unstressed.
The phonemic status of affricates is controversial. The majority view accepts /pf/ and /ts/, but not /tʃ/ or the non-native /dʒ/; some accept none, some accept all but /dʒ/, and some accept all.
Although [tʃ] occurs in native words, it only appears in historic clusters of /t/ + /ʃ/ (e.g. deutsch < OHGdiutisc) or in words with expressive quality (e.g. glitschen, hutschen). [tʃ] is, however, well-established in loanwords, including German toponyms of non-Germanic origin (e.g. Zschopau).
[dʒ] and [ʒ] occur only in words of foreign origin. In certain varieties, they are replaced by [tʃ] and [ʃ] altogether.
[ʋ] is occasionally considered to be an allophone of /v/, especially in southern varieties of German.
[ç] and [x] are traditionally regarded as allophones after front vowels and back vowels, respectively. For a more detailed analysis see below at ich-Laut and ach-Laut. According to some analyses, [χ] is an allophone of /x/ after /a, aː/ and according to some also after /ʊ, ɔ, aʊ̯/. However, according to Moosmüller, Schmid & Brandstätter (2015), the uvular allophone is used after /ɔ/ only in the Standard Austrian variety.
Some phonologists do not posit a separate phoneme /ŋ/ and use /nɡ/ instead, along with /nk/ instead of /ŋk/. The phoneme sequence /nɡ/ is realized as [ŋɡ] when /ɡ/ can start a valid onset of the next syllable whose nucleus is a vowel other than unstressed /ə/, /ɪ/, or /ʊ/. It becomes [ŋ] otherwise. For example:
Ganges/ˈɡanɡəs/ [ˈɡaŋəs] ~ /ˈɡanɡɛs/ [ˈɡaŋɡɛs]
Ich-Laut and ach-Laut
Ich-Laut is the voiceless palatal fricative[ç] (which is found in the word ich[ɪç] 'I'), and ach-Laut is the voiceless velar fricative[x] (which is found in the word ach[ax] the interjection 'oh', 'alas'). Laut[laʊ̯t] is the German word for 'sound, phone'. In German, these two sounds are allophones occurring in complementary distribution. The allophone [x] occurs after back vowels and /a aː/ (for instance in Buch[buːx] 'book'), the allophone [ç] after front vowels (for instance in mich[mɪç] 'me/myself') and consonants (for instance in Furcht[fʊʁçt] 'fear', manchmal[ˈmançmaːl] 'sometimes'). The allophone [ç] also appears after vocalized ⟨r⟩ in superregional variants, e.g. in Furcht[fʊɐ̯çt] 'fear'. In southeastern regiolects, the ach-Laut is commonly used here, yielding [fʊɐ̯xt].
In loanwords, the pronunciation of potential fricatives in onsets of stressed syllables varies: in the Northern varieties of standard German, it is [ç], while in Southern varieties, it is [k], and in Western varieties, it is [ʃ] (for instance in China: [ˈçiːna] vs. [ˈkiːna] vs. [ˈʃiːna]).
The diminutivesuffix-chen is always pronounced with an ich-Laut[-çən]. Usually, this ending triggers umlaut (compare for instance Hund[hʊnt] 'dog' to Hündchen[ˈhʏntçn̩] 'little dog'), so theoretically, it could only occur after front vowels. However, in some comparatively recent coinings, there is no longer an umlaut, for instance in the word Frauchen[ˈfʀaʊ̯çən] (a diminutive of Frau 'woman'), so that a back vowel is followed by a [ç], even though normally it would be followed by a [x], as in rauchen[ˈʀaʊ̯xən] ('to smoke'). This exception to the allophonic distribution may be an effect of the morphemic boundary or an example of phonemicization, where erstwhile allophones undergo a split into separate phonemes.
The allophonic distribution of [ç] after front vowels and [x] after other vowels is also found in other languages, such as Scots, e.g. licht[lɪçt] 'light', dochter[ˈdɔxtər] 'daughter', and the same distribution is reconstructed for Middle English. However, it is by no means inevitable: Dutch, Yiddish, and many Southern German dialects retain [x] (which can be realized as [χ] instead) in all positions. It is thus reasonable to assume that Old High Germanih, the ancestor of modern ich, was pronounced with [x] rather than [ç]. While it is impossible to know for certain whether Old English words such as niht (modern night) were pronounced with [x] or [ç], [ç] is likely (see Old English phonology).
Despite the phonetic history, the complementary distribution of [ç] and [x] in modern Standard German is better described as backing of /ç/ after a back vowel, rather than fronting of /x/ after a front vowel, because [ç] is used in onsets (Chemie[çeˈmiː] 'chemistry') and after consonants (Molch[mɔlç] 'newt'), and is thus the underlying form of the phoneme.
According to Kohler, the German ach-Laut is further differentiated into two allophones, [x] and [χ]: [x] occurs after /uː, oː/ (for instance in Buch[buːx] 'book') and [χ] after /a, aː/ (for instance in Bach[baχ] 'brook'), while either [x] or [χ] may occur after /ʊ, ɔ, aʊ̯/, with [χ] predominating.
In Western varieties, there is a strong tendency to realize /ç/ as unrounded [ʃ] or [ɕ], and the phoneme may be confused or merged with /ʃ/ altogether, secondarily leading to hypercorrection effects where /ʃ/ is replaced with /ç/, for instance in Fisch[fɪʃ], which may be realized as [fɪç].
Within German dialects, a large variation exists as to the environments which trigger or prevent one realization or the other.
Various German consonants occur in pairs at the same place of articulation and in the same manner of articulation, namely the pairs /p–b/, /t–d/, /k–ɡ/, /s–z/, /ʃ–ʒ/. These pairs are often called fortis–lenis pairs, since describing them as voiced–voiceless pairs is inadequate. With certain qualifications, /tʃ–dʒ/, /f–v/ and /θ–ð/ are also considered fortis–lenis pairs.
Fortis-lenis distinction for /ʔ, m, n, ŋ, l, r, h/ is unimportant.
The fortis stops /p, t, k/ are aspirated in many varieties. The aspiration is strongest in the onset of a stressed syllable (such as Taler[ˈtʰaːlɐ] 'thaler'), weaker in the onset of an unstressed syllable (such as Vater[ˈfaːtʰɐ] 'father'), and weakest in the syllable coda (such as in Saat[zaːtʰ] 'seed'). All fortis consonants, i.e. /p, t, k, f, θ, s, ʃ, ç, x, pf, ts, tʃ/ are fully voiceless.
The lenis consonants /b, d, ɡ, v, ð, z, ʒ, j, r, dʒ/ range from being weakly voiced to almost voiceless [b̥, d̥, ɡ̊, v̥, ð̥, z̥, ʒ̊, j̥, r̥, d̥ʒ̊] after voiceless consonants:Kasbah[ˈkasb̥a] ('kasbah'), abdanken[ˈapd̥aŋkn̩] ('to resign'), rotgelb[ˈʁoːtɡ̊ɛlp] ('red-yellow'), Abwurf[ˈapv̥ʊʁf] ('dropping'), Absicht[ˈapz̥ɪçt] ('intention'), Holzjalousie[ˈhɔltsʒ̊aluziː] ('wooden jalousie'), wegjagen[ˈvɛkj̥aːɡn̩] ('to chase away'), tropfen[ˈtʁ̥ɔpfn̩] ('to drop'), Obstjuice[ˈoːpstd̥ʒ̊uːs] ('fruit juice'). Mangold (2005) states that they are "to a large extent voiced" [b, d, ɡ, v, ð, z, ʒ, j, r, dʒ] in all other environments, but some studies have found the stops /b, d, ɡ/ to be voiceless word/utterance-initially in most dialects (while still contrasting with /p, t, k/ due to the aspiration of the latter).
/b, d, ɡ, z, ʒ/ are voiceless in most southern varieties of German. For clarity, they are often transcribed as [b̥, d̥, ɡ̊, z̥, ʒ̊].
The nature of the phonetic difference between the voiceless lenis consonants and the similarly voiceless fortis consonants is controversial. It is generally described as a difference in articulatory force, and occasionally as a difference in articulatory length; for the most part, it is assumed that one of these characteristics implies the other.
In various central and southern varieties, the opposition between fortis and lenis is neutralized in the syllable onset; sometimes just in the onset of stressed syllables, sometimes in all cases.
The pair /f–v/ is not considered a fortis–lenis pair, but a simple voiceless–voiced pair, as /v/ remains voiced in all varieties, including the Southern varieties that devoice the lenes (with however some exceptions). Generally, the southern /v/ is realized as the voiced approximant[ʋ]. However, there are southern varieties which differentiate between a fortis /f/ (such as in sträflich[ˈʃtrɛːflɪç] 'culpable' from Middle High German stræflich) and a lenis /f/ ([v̥], such as in höflich[ˈhøːv̥lɪç] 'polite' from Middle High German hovelîch); this is analogous to the opposition of fortis /s/ ([s]) and lenis [z̥].
In varieties from Northern Germany, lenis stops in the syllable coda are realized as fortis stops. This does not happen in varieties from Southern Germany, Austria or Switzerland.
Since the lenis stops /b, d, ɡ/ are unvoiced or at most variably voiced (as stated above), this cannot be called devoicing in the strict sense of the word because it does not involve the loss of phonetic voice. More accurately, it can be called coda fortition or a neutralization of fortis and lenis sounds in the coda. Fricatives are truly and contrastively voiced in Northern Germany. Therefore, the fricatives undergo coda devoicing in the strict sense of the word. It is disputed whether coda devoicing is due to a constraint which specifically operates on syllable codas or whether it arises from constraints which "protect voicing in privileged positions".
As against standard pronunciation rules, in western varieties including those of the Rhineland, coda fortis–lenis neutralization results in voicing rather than devoicing if the following word begins with a vowel. For example, mit uns becomes [mɪd‿ʊns] and darf ich becomes [daʁv‿ɪʃ]. The same sandhi phenomenon exists also as a general rule in the Luxembourgish language.
In German words there is always one syllable carrying main stress, with all other syllables either being unstressed or carrying a secondary stress. The position of the main stress syllable has been a matter of debate. Traditionally, word stress is seen as falling onto the first stem syllable. In recent analyses, there is agreement that main stress is placed onto one of the last three (stressable) syllables. Within this three-syllable window, word stress is put regularly onto the second-to-last syllable, the penultimate syllable. However, syllable quantity may modify this pattern: a heavy final or prefinal syllable, i.e., one with a long vowel or with one or more consonants in the syllable coda, will usually attract main stress.
A set of illustrative examples also stems from Japanese loan words, as these cannot be borrowed with their stress patterns (Japanese has a system of pitch accents, completely different from word stress in Germanic languages):
final stress: Shoˈgun, Samuˈrai
penultimate stress: Mitsuˈbishi, Ikeˈbana
antepenultimate stress: Hiˈroshima, ˈOsaka
A list of Japanese words in German reveals that none of the words with four syllables has initial stress, confirming the three-syllable-window analysis.
Secondary stresses precede the main stress if at least two syllables are present, as in ̩Bib-li- ̩o-the-'ka-rin.
Suffixes, if containing a stressable vowel, are either stressed (-ei, ion, -al, etc.) or unstressed (-ung, -heit, -isch, etc.)
In addition, German uses different stresses for separable prefixes and inseparable prefixes in verbs and words derived from such verbs:
Words beginning with be-, ge-, er-, ver-, zer-, ent-, emp- and a few other inseparable prefixes are stressed on the root.
Words beginning with the separable prefixes ab-, auf-, ein-, vor-, and most prepositional adverbs are stressed on the prefix.
Some prefixes, notably über-, unter-, um-, and durch-, can function as separable or inseparable prefixes and are stressed or not accordingly.
A few homographs with such prefixes exist. They are not perfect homophones. Consider the word umschreiben. As ˈum•schreiben (separable prefix), it means 'to rewrite' and is pronounced [ˈʊmʃʀaɪ̯bən], with stress on the first syllable. Its associated noun, die ˈUmschreibung is also stressed on the first syllable – [ˈʊmʃʀaɪ̯bʊŋ]. On the other hand, umˈschreiben (inseparable prefix) is pronounced [ʊmˈʃʀaɪ̯bən], with stress on the second syllable. This word means 'to paraphrase', and its associated noun, die Umˈschreibung is also stressed on the second syllable – [ʊmˈʃʀaɪ̯bʊŋ]. Another example is the word umˈfahren; with stress on the root ([ʊmˈfaːʀən]) it means 'to drive around (an obstacle in the street)', and with stress on the prefix ([ˈʊmfaːʀən]) it means 'to run down/over' or 'to knock down'.
Like all infants, German infants go through a babbling stage in the early phases of phonological acquisition, during which they produce the sounds they will later use in their first words.Phoneme inventories begin with stops, nasals, and vowels; (contrasting) short vowels and liquids appear next, followed by fricatives and affricates, and finally all other consonants and consonant clusters. Children begin to produce protowords near the end of their first year. These words do not approximate adult forms, yet have a specific and consistent meaning. Early word productions are phonetically simple and usually follow the syllable structure CV or CVC, although this generalization has been challenged. The first vowels produced are /ə/, /a/, and /aː/, followed by /e/, /i/, and /ɛ/, with rounded vowels emerging last. German children often use phonological processes to simplify their early word production. For example, they may delete an unstressed syllable (Schokolade 'chocolate' pronounced [ˈlaːdə]), or replace a fricative with a corresponding stop (Dach[dax] 'roof' pronounced [dak]). One case study found that a 17-month-old child acquiring German replaced the voiceless velar fricative[x] with the nearest available continuant[h], or deleted it altogether (Buch[buːx] 'book' pronounced [buh] or [buː]).
Prosodically, children prefer bisyllabic words with the pattern strong – weak over monosyllabic words.
Vowel space development
In 2009, Lintfert examined the development of vowel space of German speakers in their first three years of life. During the babbling stage, vowel distribution has no clear pattern. However, stressed and unstressed vowels already show different distributions in the vowel space. Once word production begins, stressed vowels expand in the vowel space, while the F1 – F2 vowel space of unstressed vowels becomes more centralized. The majority of infants are then capable of stable production of F1. The variability of formant frequencies among individuals decreases with age. After 24 months, infants expand their vowel space individually at different rates. However, if the parents' utterances possess a well-defined vowel space, their children produce clearly distinguished vowel classes earlier. By about three years old, children command the production of all vowels, and they attempt to produce the four cardinal vowels, /y/, /i/, /u/ and /a/, at the extreme limits of the F1–F2 vowel space (i.e., the height and backness of the vowels are made extreme by the infants).
The acquisition of nasals in German differs from that of Dutch, a phonologically closely related language. German children produce proportionately more nasals in onset position (sounds before a vowel in a syllable) than Dutch children do. German children, once they reached 16 months, also produced significantly more nasals in syllables containing schwas, when compared with Dutch-speaking children. This may reflect differences in the languages the children are being exposed to, although the researchers claim that the development of nasals likely cannot be seen apart from the more general phonological system the child is developing.
Phonotactic constraints and reading
A 2006 study examined the acquisition of German in phonologically delayed children (specifically, issues with fronting of velars and stopping of fricatives) and whether they applied phonotactic constraints to word-initial consonant clusters containing these modified consonants. In many cases, the subjects (mean age = 5.1) avoided making phonotactic violations, opting instead for other consonants or clusters in their speech. This suggests that phonotactic constraints do apply to the speech of German children with phonological delay, at least in the case of word-initial consonant clusters. Additional research has also shown that spelling consistencies seen in German raise children's phonemic awareness as they acquire reading skills.
Sound changes and mergers
A merger found mostly in Northern accents of German is that of /ɛː/ (spelled ⟨ä, äh⟩) with /eː/ (spelled ⟨e⟩, ⟨ee⟩, or ⟨eh⟩). Some speakers merge the two everywhere, some distinguish them everywhere, others keep /ɛː/ distinct only in conditional forms of strongverbs (for example ich gäbe[ˈɡɛːbə] 'I would give' vs. ich gebe[ˈɡeːbə] 'I give' are distinguished, but Bären[ˈbeːʁən] 'bears' vs. Beeren[ˈbeːʁən] 'berries' are not. The standard pronunciation of Bären is [ˈbɛːʁən]).
Another common merger is that of /ɡ/ at the end of a syllable with [ç] or [x], for instance Krieg[kʁ̥iːç] ('war'), but Kriege[ˈkʁ̥iːɡə] ('wars'); er lag[laːx] ('he lay'), but wir lagen[ˈlaːɡən] ('we lay'). This pronunciation is frequent all over central and northern Germany. It is characteristic of regional languages and dialects, particularly Low German in the North, where ⟨g⟩ represents a fricative, becoming voiceless in the syllable coda, as is common in German (final-obstruent devoicing). However common it is, this pronunciation is considered sub-standard. Only in one case, in the grammatical ending -ig (which corresponds to English -y), the fricative pronunciation of final ⟨g⟩ is prescribed by the Siebs standard, for instance wichtig[ˈvɪçtɪç] ('important'), Wichtigkeit[ˈvɪçtɪçkaɪt] 'importance'. The merger occurs neither in Austro-Bavarian and Alemannic German nor in the corresponding varieties of Standard German, and therefore in these regions -ig is pronounced [ɪɡ̊].
Many speakers do not distinguish the affricate/pf/ from the simple fricative/f/ in the beginning of a word, in which case the verb (er) fährt ('[he] travels') and the noun Pferd ('horse') are both pronounced [fɛɐ̯t]. This most commonly occurs in northern and western Germany, where the local dialects did not originally have the sound /pf/. Some speakers also have peculiar pronunciation for /pf/ in the middle or end of a word, replacing the [f] in /pf/ with a voiceless bilabial fricative, i.e. a consonant produced by pressing air flow through the tensed lips. Thereby Tropfen ('drop') becomes [ˈtʁ̥ɔpɸn̩], rather than [ˈtʁ̥ɔpfn̩].
Many speakers who have a vocalization of /r/ after /a/ merge this combination with long /aː/ (i.e. /ar/ > *[aɐ] or *[ɑɐ] > [aː] or [ɑː]). Hereby, Schaf ('sheep') and scharf ('sharp') can both be pronounced [ʃaːf] or [ʃɑːf]. This merger does not occur where /a/ is a front vowel while /aː/ is realised as a back vowel. Here the words are kept distinct as [ʃɑːf] ('sheep') and [ʃaːf] ('sharp').
In umlaut forms, the difference usually reoccurs: Schäfer[ˈʃɛːfɐ] or [ˈʃeːfɐ] vs. schärfer[ˈʃɛɐ̯fɐ]. Speakers with this merger also often use [aːç] (instead of formally normal /aːx/) where it stems from original /arç/. The word Archen ('arks') is thus pronounced [ˈaːçn̩], which makes a minimal pair with Aachen[ˈaːxn̩], arguably making the difference between [ç] and [x]phonemic, rather than just allophonic, for these speakers.
In the standard pronunciation, the vowel qualities /i/, /ɪ/, /e/, /ɛ/, as well as /u/, /ʊ/, /o/, /ɔ/, are all still distinguished even in unstressed syllables. In this latter case, however, many simplify the system in various degrees. For some speakers, this may go so far as to merge all four into one, hence misspellings by schoolchildren such as Bräutegam (instead of Bräutigam) or Portogal (instead of Portugal).
In everyday speech, more mergers occur, some of which are universal and some of which are typical for certain regions or dialect backgrounds. Overall, there is a strong tendency of reduction and contraction. For example, long vowels may be shortened, consonant clusters may be simplified, word-final [ə] may be dropped in some cases, and the suffix -en may be contracted with preceding consonants, e.g. [ham] for haben[ˈhaːbən] ('to have').
If the clusters [mp], [lt], [nt], or [ŋk] are followed by another consonant, the stops /p/, /t/ and /k/ usually lose their phonemic status. Thus while the standard pronunciation distinguishes ganz[ɡants] ('whole') from Gans[ɡans] ('goose'), as well as er sinkt[zɪŋkt] from er singt[zɪŋt], the two pairs are homophones for most speakers. The commonest practice is to drop the stop (thus [ɡans], [zɪŋt] for both words), but some speakers insert the stop where it is not etymological ([ɡants], [zɪŋkt] for both words), or they alternate between the two ways. Only a few speakers retain a phonemic distinction.
Middle High German
The Middle High German vowels [ei̯] and [iː] developed into the modern Standard German diphthong [aɪ̯], whereas [ou̯] and [uː] developed into [aʊ̯]. For example, Middle High German heiz/hei̯s/ and wîz/wiːs/ ('hot' and 'white') became Standard German heiß/haɪ̯s/ and weiß/vaɪ̯s/. In some dialects, the Middle High German vowels have not changed, e.g. Swiss Germanheiss/hei̯s/ and wiiss/viːs/, while in other dialects or languages, the vowels have changed but the distinction is kept, e.g. Bavarianhoaß/hɔɐ̯s/ and weiß/vaɪ̯s/, Ripuarianheeß/heːs/ and wieß/viːs/ (however the Colognian dialect has kept the original [ei] diphthong in heiß), Yiddishהײסheys/hɛɪ̯s/ and װײַסvays/vaɪ̯s/.
The Middle High German diphthongs[iə̯], [uə̯] and [yə̯] became the modern Standard German long vowels [iː], [uː] and [yː] after the Middle High German long vowels changed to diphthongs. Most Upper German dialects retain the diphthongs. A remnant of their former diphthong character is shown when [iː] continues to be written ie in German (as in Liebe 'love').
German incorporates a significant number of loanwords from other languages. Loanwords are often adapted to German phonology but to varying degrees, depending on the speaker and the commonness of the word. /ʒ/ and /dʒ/ do not occur in native German words but are common in a number of French and English loan words. Many speakers replace them with /ʃ/ and /tʃ/ respectively (especially in Southern Germany, Austria and Switzerland), so that Dschungel (from English jungle) can be pronounced [ˈdʒʊŋl̩] or [ˈtʃʊŋl̩]. Some speakers in Northern and Western Germany merge /ʒ/ with /dʒ/, so that Journalist (phonemically /dʒʊʁnaˈlɪst ~ ʒʊʁnaˈlɪst/) can be pronounced [ʒʊɐ̯naˈlɪst], [dʒʊɐ̯naˈlɪst] or [ʃʊɐ̯naˈlɪst]. The realization of /ʒ/ as [tʃ], however, is uncommon.
Loanwords from English
Many English words are used in German, especially in technology and pop culture. Some speakers pronounce them similarly to their native pronunciation, but many speakers change non-native phonemes to similar German phonemes (even if they pronounce them in a rather English manner in an English-language setting):
English /θ, ð/ are usually pronounced as in RP or General American; some speakers replace them with /s/ and /z/ respectively (th-alveolarization) e.g. Thriller[ˈθʁɪlɐ ~ ˈsʁɪlɐ].
English /ɹ/ can be pronounced the same as in English, i.e. [ɹ], or as the corresponding native German /r/ e.g. Rock[ʀɔk] or [rɔk]. German and Austrian speakers tend to be variably rhotic when using English loanwords.
English /w/ is often replaced with German /v/ e.g. Whisk(e)y[ˈvɪskiː].
word-initial /s/ is often retained (especially in the South, where word-initial /s/ is common), but many speakers replace it with /z/ e.g. Sound[zaʊ̯nt].
word-initial /st/ and /sp/ are usually retained, but some speakers (especially in South Western Germany and Western Austria) replace them with /ʃt/ and /ʃp/ e.g. Steak[ʃteɪk] or [ʃteːk], Spray[ʃpʁeɪ] or [ʃpʁeː].
English /tʃ/ is usually retained, but in Northern and Western Germany as well as Luxembourg, it is often replaced with /ʃ/ e.g. Chips[ʃɪps].
In Northern Standard German, final-obstruent devoicing is applied to English loan words just as to other words e.g. Airbag[ˈɛːɐ̯bɛk], Lord[lɔʁt] or [lɔɐ̯t], Backstage[ˈbɛksteːtʃ]. However, in Southern Standard German, in Swiss Standard German and Austrian Standard German, final-obstruent devoicing does not occur and so speakers are more likely to retain the original pronunciation of word-final lenes (although realizing them as fortes may occur because of confusing English spelling with pronunciation).
English /eɪ/ and /oʊ/ are often replaced with /eː/ and /oː/ respectively e.g. Homepage[ˈhoːmpeːtʃ].
English /æ/ and /ɛ/ are pronounced the same, as German /ɛ/ (met–mat merger) e.g. Backup[ˈbɛkap].
English /ɒ/ and /ɔː/ are pronounced the same, as German /ɔ/ (cot–caught merger) e.g. Box[bɔks].
English /ʌ/ is usually pronounced as German /a/ e.g. Cutter[ˈkatɐ].
English /ɜːr/ is usually pronounced as German /œʁ/ e.g. Shirt[ʃœʁt] or [ʃœɐ̯t].
English /i/ is pronounced as /iː/ (happy-tensing) e.g. Whisk(e)y[ˈvɪskiː].
Loanwords from French
French loanwords, once very numerous, have in part been replaced by native German coinages or more recently English loanwords. Besides /ʒ/, they can also contain the characteristic nasal vowels[ãː], [ɛ̃ː], [œ̃ː] and [õː] (always long). However, their status as phonemes is questionable and they are often resolved into sequences either of (short) oral vowel and [ŋ] (in the north), or of (long or short) oral vowel and [n] or sometimes [m] (in the south). For example, Ballon[baˈlõː] ('balloon') may be realized as [baˈlɔŋ] or [baˈloːn], Parfüm[paʁˈfœ̃ː] ('perfume') as [paʁˈfœŋ] or [paʁ'fyːm] and Orange[oˈʁãːʒə] ('orange') as [oˈʁaŋʒə] or [o'ʁanʒə].
The sample text is a reading of "The North Wind and the Sun". The phonemic transcription treats every instance of [ɐ] and [ɐ̯] as /ər/ and /r/, respectively. The phonetic transcription is a fairly narrow transcription of the educated northern accent. The speaker transcribed in the narrow transcription is 62 years old, and he is reading in a colloquial style. Aspiration, glottal stops and devoicing of the lenes after fortes are not transcribed.
The audio file contains the whole fable and was recorded by a much younger speaker.
Einst stritten sich Nordwind und Sonne, wer von ihnen beiden wohl der Stärkere wäre, als ein Wanderer, der in einen warmen Mantel gehüllt war, des Weges daherkam. Sie wurden einig, daß derjenige für den Stärkeren gelten sollte, der den Wanderer zwingen würde, seinen Mantel abzunehmen. Der Nordwind blies mit aller Macht, aber je mehr er blies, desto fester hüllte sich der Wanderer in seinen Mantel ein. Endlich gab der Nordwind den Kampf auf. Nun erwärmte die Sonne die Luft mit ihren freundlichen Strahlen, und schon nach wenigen Augenblicken zog der Wanderer seinen Mantel aus. Da mußte der Nordwind zugeben, daß die Sonne von ihnen beiden der Stärkere war.
^Pages 1-2 of the book (Deutsches Aussprachewörterbuch) discuss die Standardaussprache, die Gegenstand dieses Wörterbuches ist (the standard pronunciation which is the topic of this dictionary). It also mentions Da sich das Deutsche zu einer plurizentrischen Sprache entwickelt hat, bildeten sich jeweils eigene Standardvarietäten (und damit Standardaussprachen) (German has developed into a pluricentric language separate standard varieties (and hence standard pronunciations)), but refers to these standards as regionale und soziolektale Varianten (regional and sociolectal variants).
^"Reflections on Diglossia". In northern Germany, it appears that in Hanover – perhaps because of the presence of the electoral (later royal) court – a parastandard High German was spoken by the 18th century as well, at least among the educated, with the curious result that Hanover speech – though non-native – became the model of German pronunciation on the stage (Bühnendeutsch), since everywhere else in Germany dialects were still spoken by everyone. Other capitals (Berlin, Dresden, Munich, Vienna) eventually developed their own Umgangssprachen, but the Hanover model remained the ideal.
^"Reading Heinrich Heine"(PDF). He spoke the dialect of Hanover, where – as also in the vicinity to the south of this city – German is pronounced best.
^"Nicht das beste Hochdeutsch in Hannover". In Hannover wird zweifellos ein Deutsch gesprochen, das sehr nah an der nationalen Aussprachenorm liegt. Aber das gilt auch für andere norddeutsche Städte wie Kiel, Münster oder Rostock. Hannover hat da keine Sonderstellung.
^Differences include the pronunciation of the endings -er, -en, and -em.
^ abcSource: Wiese (1996:11, 14). On the page 14, the author states that /aɪ̯/, /aʊ̯/ and /ɔʏ̯/ are of the same quality as vowels of which they consist. On the page 8, he states that /a/ is low central.
^ abcSee vowel chart in Kohler (1999:87). Despite their true ending points, Kohler still transcribes them as /aɪ̯ aʊ̯ ɔɪ̯/, i.e. with higher offsets than those actually have.
^Source: Krech et al. (2009:72). Authors do not provide a vowel chart. Rather, they state rather vaguely that "the diphthong [aɛ̯] is a monosyllabic compound consisting of the unrounded open vowel [a] and the unrounded mid front vowel [ɛ]."
^Source: Krech et al. (2009:72–73). Authors do not provide a vowel chart. Rather, they state rather vaguely that "the diphthong [aɔ̯] is a monosyllabic compound consisting of the unrounded open vowel [a] and the rounded mid back vowel [ɔ]."
^Krech et al. (2009:73). Authors do not provide a vowel chart. Rather, they state rather vaguely that "the diphthong [ɔœ̯] is a monosyllabic compound consisting of the rounded mid back vowel [ɔ] and the rounded mid front vowel [œ]."
^ abcMoosmüller, Schmid & Brandstätter (2015:341–342): "SAG features a wide variety of realizations of the trill. In approximately the past 40 years, the pronunciation norm has changed from an alveolar to a uvular trill. The latter is mostly pronounced as a fricative, either voiced or voiceless. Alveolar trills are still in use, mostly pronounced as an approximant.
^[v] written v[clarify] can devoice in nearly every place once the word has become common; w is devoiced in Möwe, Löwe. On the other hand, the keeping to the variety is so standard that doof/do:f/ induced the writing "(der) doofe" even though the standard pronunciation of the latter word is /ˈdoːvə/
Altvater-Mackensen, N.; Fikkert, P. (2007), "On the acquisition of nasals in Dutch and German", Linguistics in the Netherlands, 24: 14–24, doi:10.1075/avt.24.04alt
Ammon, Ulrich; Bickel, Hans; Ebner, Jakob; Esterhammer, Ruth; Gasser, Markus; Hofer, Lorenz; Kellermeier-Rehbein, Birte; Löffler, Heinrich; Mangott, Doris; Moser, Hans; Schläpfer, Robert; Schloßmacher, Michael; Schmidlin, Regula; Vallaster, Günter (2004), Variantenwörterbuch des Deutschen. Die Standardsprache in Österreich, der Schweiz und Deutschland sowie in Liechtenstein, Luxemburg, Ostbelgien und Südtirol, Berlin, New York: Walter de Gruyter, ISBN3-11-016575-9
Goswami, U.; Ziegler, J.; Richardson, U. (2005), "The effects of spelling consistency on phonological awareness: A comparison of English and German", Journal of Experimental Child Psychology, 92 (4): 345–365, doi:10.1016/j.jecp.2005.06.002, PMID16087187
Kohler, Klaus J. (1999), "German", Handbook of the International Phonetic Association: A guide to the use of the International Phonetic Alphabet, Cambridge: Cambridge University Press, pp. 86–89, ISBN0-521-65236-7
Krech, Eva Maria; Stötzer, Ursula (1982), Großes Wörterbuch der deutschen Aussprache, Leipzig: VEB Bibhographisches Institut, ISBN978-3323001404
Krech, Eva Maria; Stock, Eberhard; Hirschfeld, Ursula; Anders, Lutz-Christian (2009), Deutsches Aussprachewörterbuch, Berlin, New York: Walter de Gruyter, ISBN978-3-11-018202-6