|Writing system||Latin script|
|Language of origin||Latin language|
|Time period||~-300 to present|
|Other letters commonly used with||gh, g(x)|
G, or g, is the seventh letter in the Latin alphabet, used in the modern English alphabet, the alphabets of other western European languages and others worldwide. Its name in English is gee (pronounced //), plural gees.
For earlier history, see C § History.
The evolution of the Latin alphabet's G can be traced back to the Latin alphabet's predecessor, the Greek alphabet. The voiced velar stop was represented by the third letter of the Greek alphabet, gamma (Γ), which was later adopted by the Etruscan language. Latin then borrowed this "rounded form" of gamma, C, to represent the same sound in words such as recei, which was likely an early dative form of rex, meaning "king," as found in an "early Latin inscription." Over time, however, the letter C shifted to represent the unvoiced velar stop, leading to the displacement of the letter K. Scholars believe that this change can be attributed to the influence of the Etruscan language on Latin.
Afterwards, the letter 'G' was introduced in the Old Latin period as a variant of 'C' to distinguish voiced /ɡ/ from voiceless /k/, and G was used to represent a voiced velar from this point on and C "stood for the unvoiced velar only".
The recorded originator of 'G' is freedman Spurius Carvilius Ruga, who added letter G to the teaching of the Roman alphabet during the 3rd century BC: he was the first Roman to open a fee-paying school, around 230 BCE. At this time, 'K' had fallen out of favor, and 'C', which had formerly represented both /ɡ/ and /k/ before open vowels, had come to express /k/ in all environments.
Ruga's positioning of 'G' shows that alphabetic order related to the letters' values as Greek numerals was a concern even in the 3rd century BC. According to some records, the original seventh letter, 'Z', had been purged from the Latin alphabet somewhat earlier in the 3rd century BC by the Roman censor Appius Claudius, who found it distasteful and foreign. Sampson (1985) suggests that: "Evidently the order of the alphabet was felt to be such a concrete thing that a new letter could be added in the middle only if a 'space' was created by the dropping of an old letter."
George Hempl proposed in 1899 that there never was such a "space" in the alphabet and that in fact 'G' was a direct descendant of zeta. Zeta took shapes like ⊏ in some of the Old Italic scripts; the development of the monumental form 'G' from this shape would be exactly parallel to the development of 'C' from gamma. He suggests that the pronunciation /k/ > /ɡ/ was due to contamination from the also similar-looking 'K'.
Eventually, both velar consonants /k/ and /ɡ/ developed palatalized allophones before front vowels; consequently in today's Romance languages, ⟨c⟩ and ⟨g⟩ have different sound values depending on context (known as hard and soft C and hard and soft G). Because of French influence, English language orthography shares this feature.
The modern lowercase 'g' has two typographic variants: the single-storey (sometimes "opentail") '' and the double-storey (sometimes "looptail") ''. The single-storey form derives from the majuscule (uppercase) form by raising the serif that distinguishes it from 'c' to the top of the loop (thus closing the loop), and extending the vertical stroke downward and to the left. The double-storey form () had developed similarly, except that some ornate forms then extended the tail back to the right, and to the left again, forming a closed bowl or loop. The initial extension to the left was absorbed into the upper closed bowl. The double-storey version became popular when printing switched to "Roman type" because the tail was effectively shorter, making it possible to put more lines on a page. In the double-storey version, a small top stroke in the upper-right, often terminating in an orb shape, is called an "ear".
Generally, the two forms are complementary, but occasionally the difference has been exploited to provide contrast. In the International Phonetic Alphabet, opentail ⟨ɡ⟩ has always represented a voiced velar plosive, while ⟨⟩ was distinguished from ⟨ɡ⟩ and represented a voiced velar fricative from 1895 to 1900. In 1948, the Council of the International Phonetic Association recognized ⟨ɡ⟩ and ⟨⟩ as typographic equivalents, and this decision was reaffirmed in 1993. While the 1949 Principles of the International Phonetic Association recommended the use of ⟨⟩ for a velar plosive and ⟨ɡ⟩ for an advanced one for languages where it is preferable to distinguish the two, such as Russian, this practice never caught on. The 1999 Handbook of the International Phonetic Association, the successor to the Principles, abandoned the recommendation and acknowledged both shapes as acceptable variants.
Wong et al. (2018) found that native English speakers have little conscious awareness of the looptail form (). They write: "Despite being questioned repeatedly, and despite being informed directly that G has two lowercase print forms, nearly half of the participants failed to reveal any knowledge of the looptail 'g', and only 1 of the 38 participants was able to write looptail 'g' correctly."
In Unicode, the two appearances are generally treated as glyph variants with no semantic difference. Most serif typefaces use the looptail form (for example, g) and most sans-serif typefaces use the opentail form (for example, g) but the code point in both cases is U+0067. For applications where the single-storey variant must be distinguished (such as strict IPA in a typeface where the usual g character is double-storey), the character U+0261 ɡ LATIN SMALL LETTER SCRIPT G is available, as well as an upper case version, U+A7AC Ɡ LATIN CAPITAL LETTER SCRIPT G.
See also: Hard and soft G
|Arabic||/ɡ/||Latinization; corresponding to ⟨ق⟩ or ⟨ج⟩ in Arabic|
|Catalan||/(d)ʒ/||Before e, i|
|English||/dʒ/||Before e, i, y (see exceptions below)|
|/ʒ/||Before e, i in "modern" loanwords from French|
|silent||Some words, initial <gn>, and word-finally before a consonant|
|Faroese||/j/||soft, lenited; see Faroese phonology|
|/v/||after a, æ, á, e, o, ø and before u|
|/w/||after ó, u, ú and before a, i, or u|
|silent||after a, æ, á, e, o, ø and before a|
|/ʒ/||Before e, i, y|
|Galician||/ɡ/~/ħ/||Usually||See Gheada for consonant variation|
|/ʃ/||Before e, i||obsolete spelling, replaced by the letter x|
|/ɟ/||Before ai, e, i, oi, y||Latinization|
|/ɣ/||hard, lenited; see Icelandic phonology|
|/ɟ/||After i or before e, i|
|/dʒ/||Before e, i|
|Norman||/dʒ/||Before e, i|
|/j/||Before ei, i, j, øy, y|
|/ʒ/||Before e, i, y|
|Romanian||/dʒ/||Before e, i|
|Romansh||/dʑ/||Before e, i|
|/kʲ/||After i or before e, i|
|/x/ or /h/||Before e, i, y||Variation between velar and glottal realizations depends on dialect|
|/j/||Before ä, e, i, ö, y|
|/ɟ/||Before e, i, ö, ü|
In English, the letter appears either alone or in some digraphs. Alone, it represents
⟨g⟩ is predominantly soft before ⟨e⟩ (including the digraphs ⟨ae⟩ and ⟨oe⟩), ⟨i⟩, or ⟨y⟩, and hard otherwise. It is hard in those derivations from γυνή (gynḗ) meaning woman where initial-worded as such. Soft ⟨g⟩ is also used in many words that came into English from medieval church/academic use, French, Spanish, Italian or Portuguese – these tend to, in other ways in English, closely align to their Ancient Latin and Greek roots (such as fragile, logic or magic). There remain widely used a few English words of non-Romance origin where ⟨g⟩ is hard followed by ⟨e⟩ or ⟨i⟩ (get, give, gift), and very few in which ⟨g⟩ is soft though followed by ⟨a⟩ such as gaol, which since the 20th century is almost always written as "jail".
The double consonant ⟨gg⟩ has the value /ɡ/ (hard ⟨g⟩) as in nugget, with very few exceptions: /d͡ʒ/ in exaggerate and veggies and dialectally /ɡd͡ʒ/ in suggest.
The digraph ⟨dg⟩ has the value /d͡ʒ/ (soft ⟨g⟩), as in badger. Non-digraph ⟨dg⟩ can also occur, in compounds like floodgate and headgear.
The digraph ⟨ng⟩ may represent:
Non-digraph ⟨ng⟩ also occurs, with possible values
The digraph ⟨gh⟩ (in many cases a replacement for the obsolete letter yogh, which took various values including /ɡ/, /ɣ/, /x/ and /j/) may represent:
Non-digraph ⟨gh⟩ also occurs, in compounds like foghorn, pigheaded.
The digraph ⟨gn⟩ may represent:
Non-digraph ⟨gn⟩ also occurs, as in signature, agnostic.
The trigraph ⟨ngh⟩ has the value /ŋ/ as in gingham or dinghy. Non-trigraph ⟨ngh⟩ also occurs, in compounds like stronghold and dunghill.
G is the tenth least frequently used letter in the English language (after Y, P, B, V, K, J, X, Q, and Z), with a frequency of about 2.02% in words.
Most Romance languages and some Nordic languages also have two main pronunciations for ⟨g⟩, hard and soft. While the soft value of ⟨g⟩ varies in different Romance languages (/ʒ/ in French and Portuguese, [(d)ʒ] in Catalan, /d͡ʒ/ in Italian and Romanian, and /x/ in most dialects of Spanish), in all except Romanian and Italian, soft ⟨g⟩ has the same pronunciation as the ⟨j⟩.
In Italian and Romanian, ⟨gh⟩ is used to represent /ɡ/ before front vowels where ⟨g⟩ would otherwise represent a soft value. In Italian and French, ⟨gn⟩ is used to represent the palatal nasal /ɲ/, a sound somewhat similar to the ⟨ny⟩ in English canyon. In Italian, the trigraph ⟨gli⟩, when appearing before a vowel or as the article and pronoun gli, represents the palatal lateral approximant /ʎ/.
Other languages typically use ⟨g⟩ to represent /ɡ/ regardless of position.
Amongst European languages, Czech, Dutch, Estonian and Finnish are an exception as they do not have /ɡ/ in their native words. In Dutch, ⟨g⟩ represents a voiced velar fricative /ɣ/ instead, a sound that does not occur in modern English, but there is a dialectal variation: many Netherlandic dialects use a voiceless fricative ([x] or [χ]) instead, and in southern dialects it may be palatal [ʝ]. Nevertheless, word-finally it is always voiceless in all dialects, including the standard Dutch of Belgium and the Netherlands. On the other hand, some dialects (like Amelands) may have a phonemic /ɡ/.
Faroese uses ⟨g⟩ to represent /dʒ/, in addition to /ɡ/, and also uses it to indicate a glide.
In Māori, ⟨g⟩ is used in the digraph ⟨ng⟩ which represents the velar nasal /ŋ/ and is pronounced like the ⟨ng⟩ in singer.
The Samoan and Fijian languages use the letter ⟨g⟩ by itself for /ŋ/.
In older Czech and Slovak orthographies, ⟨g⟩ was used to represent /j/, while /ɡ/ was written as ⟨ǧ⟩ (⟨g⟩ with caron).
The Azerbaijani Latin alphabet uses ⟨g⟩ exclusively for the "soft" sound, namely /ɟ/. The sound /ɡ/ is written as ⟨q⟩. This leads to unusual spellings of loanwords: qram 'gram', qrup 'group', qaraj 'garage', qallium 'gallium'.
|Unicode name||LATIN CAPITAL LETTER G||LATIN SMALL LETTER G||LATIN CAPITAL LETTER SCRIPT G||LATIN SMALL LETTER SCRIPT G|
|UTF-8||71||47||103||67||234 158 172||EA 9E AC||201 161||C9 A1|
|Numeric character reference||G