Hard and soft G



In the Latin-based orthographies of many European languages, the letter $\langleg\rangle$ is used in different contexts to represent two distinct phonemes that in English are called hard and soft $\langleg\rangle$. The sound of a hard $\langleg\rangle$ (which often precedes the non-front vowels $\langlea o u\rangle$ or a consonant) is usually the voiced velar plosive (as in gain or go) while the sound of a soft $\langleg\rangle$ (typically before, , or ) may be a fricative or affricate, depending on the language. In English, the sound of soft $\langleg\rangle$ is the affricate, as in general, giant, and gym. A $\langleg\rangle$ at the end of a word usually renders a hard $\langleg\rangle$ (as in "rag"), while if a soft rendition is intended it would be followed by a silent $\langlee\rangle$ (as in "rage").

History
This alternation has its origins in a historical palatalization of which took place in Late Latin, and led to a change in the pronunciation of the sound  before the front vowels  and. Later, other languages not descended from Latin, such as English, inherited this feature as an orthographic convention. The Scandinavian languages, however, have undergone their shift independently.

English
In English orthography, the pronunciation of hard $\langleg\rangle$ is and that of soft $\langleg\rangle$ is ; the French soft $\langleg\rangle$,, survives in a number of French loanwords (e.g. regime, genre), [ʒ] also sometimes occurs as an allophone of [dʒ] in some accents in certain words.

In words of Greco-Latinate origin, the soft $\langleg\rangle$ pronunciation occurs before $\langlee i y\rangle$ while the hard $\langleg\rangle$ pronunciation occurs elsewhere. In some words of Germanic origin (e.g. get, give), loan words from other languages (e.g. geisha, pierogi), and irregular Greco-Latinate words (e.g. gynecology), the hard pronunciation may occur before $\langlee i y\rangle$ as well. The orthography of soft $\langleg\rangle$ is fairly consistent: a soft $\langleg\rangle$ is almost always followed by $\langlee i y\rangle$. The notable exceptions are gaol (now more commonly spelled jail) and margarine (a French borrowing whose original hard $\langleg\rangle$ softened for unknown reasons, even though the name Margaret has a hard $\langleg\rangle$). The soft pronunciation of algae, the only one heard in North America, is sometimes cited as an exception, but it is actually conformant, $\langleae\rangle$ being an alternate digraph spelling for a vowel in the $\langlee i y\rangle$ family. Though this pronunciation is listed first in some British dictionaries, hard pronunciation due to misinterpretation of orthographic $\langleae\rangle$ is widespread in British English and is listed second or alone in some British dictionaries. In some words, a soft $\langleg\rangle$ has lost its trailing $\langlee\rangle$ due to suffixing, but the combination $\langledg\rangle$ would imply the soft pronunciation anyway (e.g. fledgling, judgment, pledgor).

Digraphs and trigraphs, such as $\langleng\rangle$, $\langlegg\rangle$, and $\langledge\rangle$, have their own pronunciation rules.

While $\langlec\rangle$, which also has hard and soft pronunciations, exists alongside $\langlek\rangle$ (which always indicates a hard pronunciation), $\langleg\rangle$ has no analogous letter or letter combination which consistently indicates a hard $\langleg\rangle$ sound, even though English uses $\langlej\rangle$ consistently for the soft $\langleg\rangle$ sound (the rationale for the spelling change of "gaol" to "jail"). This leads to special issues regarding the coherence of orthography when suffixes are added to words that end in a hard-$\langleg\rangle$ sound. This additionally leads to many words spelled with g $\langlee i y\rangle$ and pronounced with a hard $\langleg\rangle$, including what may be the most common g $\langlee i y\rangle$ word "get". It has also resulted in the file format GIF having two possible pronunciations, with both hard $\langleg\rangle$ and soft $\langleg\rangle$ in common use.

Suffixation
When suffixes are added to words ending with a hard or soft $\langleg\rangle$ (such as -ed, -ing, -er, -est, -ism, -ist, -edness, -ish(ness), -ily, -iness, -ier, -iest, -ingly, -edly, and -ishly), the sound is normally maintained. Sometimes the normal rules of spelling changes before suffixes can help signal whether the hard or soft sound is intended. For example, as an accidental byproduct of the rule that doubles consonants in this situation after a short vowel, a double $\langlegg\rangle$ will normally indicate the hard pronunciation (e.g. bagged is pronounced, not as ).

There are occasional exceptions where alternations between the hard and soft sound occur before different suffixes. Examples are analogous (hard) vs. analogy (soft); similarly, prodigal with prodigy. These are generally cases where the entire word, including the suffix, has been imported from Latin, and the general Romance-language pattern of soft $\langleg\rangle$ before front vowels, but hard $\langleg\rangle$ otherwise, is preserved.

Sometimes a silent letter is added to help indicate pronunciation. For example, a silent $\langlee\rangle$ usually indicates the soft pronunciation, as in change; this may be maintained before a suffix to indicate this pronunciation (as in changeable), despite the rule that usually drops this letter. A silent $\langlei\rangle$ can also indicate a soft pronunciation, particularly with the suffixes -gion and -gious (as in region, contagious). A silent can indicate a hard pronunciation in words borrowed from French (as in analogue, league, guide) or words influenced by French spelling conventions (guess, guest); a silent  serves a similar purpose in Italian-derived words (ghetto, spaghetti).

A silent $\langlee\rangle$ can occur at the end of a word – or at the end of a component root word that is part of a larger word – after $\langleg\rangle$ as well as word-internally. In this situation, the $\langlee\rangle$ usually serves a marking function that helps to indicate that the $\langleg\rangle$ immediately before it is soft. Examples include image, management, and pigeon. Such a silent $\langlee\rangle$ also indicates that the vowel before $\langleg\rangle$ is a historic long vowel, as in rage, oblige, and range. When adding one of the above suffixes, this silent $\langlee\rangle$ is often dropped and the soft pronunciation remains. While $\langledge\rangle$ commonly indicates a soft pronunciation, the silent $\langlee\rangle$ may be dropped before another consonant while retaining the soft pronunciation in a number of words such judgment and abridgment. Also, the word veg, a clipped form of vegetate, retains the soft pronunciation despite being spelled without a silent $\langlee\rangle$ (i.e., pronounced as if spelled vedge). Similarly, soft $\langleg\rangle$ is sometimes replaced by $\langlej\rangle$ in some names of commercial entities, such as with "Enerjy Software", or "Majic 105.7" in Cleveland, Ohio and some names commonly spelled with $\langlej\rangle$ are given unusual soft $\langleg\rangle$ spellings such as Genna and Gennifer.

Letter combinations
English has many words of Romance origin, especially from French and Italian. The ones from Italian often retain the conventions of Italian orthography whereby $\langlegh\rangle$ represents hard $\langleg\rangle$ before e and i and gi and ge represent soft $\langleg\rangle$ (often even without any semivowel/vowel sound, thus representing /dʒ/ just as j usually does in English orthography). The ones from French and Spanish often retain the conventions of French orthography and Spanish orthography whereby $\langlegu\rangle$ represents hard $\langleg\rangle$ before e and i and gi and ge represent soft $\langleg\rangle$ (often realized as /ʒ/ in French and as /h/ or /χ/ in Spanish). A consequence of these orthographic tendencies is that g before o or a is almost never soft $\langleg\rangle$ in English—one way in which English orthography, which is generally not especially phonemic or regular, displays strong regularity in at least one aspect. A few exceptions include turgor and digoxin, for which the most common pronunciations use soft $\langleg\rangle$ despite the lack of "softness signal" gi or ge. But both of those words also have hard $\langleg\rangle$ pronunciations that are accepted variants, which reflects the spelling pronunciation pressure generated by the strong regularity of the digraph conventions.

A number of two-letter combinations (digraphs) follow their own pronunciation patterns and, as such, may not follow the hard/soft distinction of $\langleg\rangle$. For example, $\langleng\rangle$ often represents (as in ring) or  as in finger. The letters $\langlenge\rangle$, when final, represent, as in orange; when not final their pronunciation varies according to the word's etymology (e.g. in danger,  in anger,  in banger). In most cases, $\langlegg\rangle$ represents as in dagger, but it may also represent  as in suggest and exaggerate. (The same pair of facts can also be said of how $\langlecc\rangle$ relates to hard and soft C, as, for example, in succinct and flaccid.) Other letter combinations that don't follow the paradigm include $\langlegh\rangle$, $\langlegn\rangle$, and $\langlegm\rangle$.

The digraph $\langlegu\rangle$ is sometimes used to indicate a hard $\langleg\rangle$ pronunciation before $\langlei e y\rangle$ (e.g. guess, guitar, Guinness), including cases where $\langlee\rangle$ is silent (e.g., rogue, intrigue, catalogue, analogue). In some cases, the intervening $\langleu\rangle$ is pronounced as /w/ (distinguish, unguent).

Latin script
All modern Romance languages make the hard/soft distinction with $\langleg\rangle$, except a few that have undergone spelling reforms such as Ladino (Judaeo-Spanish) or Haitian Creole and archaic variants like Sardinian. The hard $\langleg\rangle$ is in almost all those languages (with the exception of Galician, which may instead be a voiceless pharyngeal fricative), though the soft $\langleg\rangle$ pronunciation, which occurs before $\langlei e y\rangle$, differs amongst them as follows:
 * in Italian and Romanian
 * in French and Portuguese
 * in Catalan
 * or in Spanish, depending on the dialect

Different languages use different strategies to indicate a hard pronunciation before front vowels:
 * Italian and Romanian writing systems use $\langlegh\rangle$ (e.g. Italian laghi, Romanian ghìd),
 * French, Catalan, Spanish, and Portuguese orthographies use a silent $\langleu\rangle$ (e.g. French guerre, Catalan guerra, Spanish guitarra, Portuguese guitarra). With the exception of Portuguese, a trema over the $\langleu\rangle$ is used to indicate that it is not silent (e.g. Spanish vergüenza is pronounced, with both a hard $\langleg\rangle$ and non-mute $\langleu\rangle$).
 * In Portuguese (especially Brazilian Portuguese) this was also used until the most recent orthographic reform (the new orthography now being compulsory in Brazil after a 2009-2016 transition period). The new orthography maintains the $\langlegu\rangle$ for a hard g, but there is no marking of whether the $\langleu\rangle$ is silent; the reader must already know the pronunciation of words with a $\langlegu\rangle$ (or $\langlequ\rangle$) digraph (previous: guitarra vs pingüim, current: guitarra and pinguim).

A soft pronunciation before non-front vowels is usually indicated by a silent $\langlee\rangle$ or $\langlei\rangle$ (e.g. Italian giorno, French mangeons), though Spanish, Portuguese, French and Catalan use $\langlej\rangle$ as in jueves.

Several North Germanic languages also make a hard/soft distinction. Again, the hard $\langleg\rangle$ is in most of these languages, but the soft $\langleg\rangle$ differs as follows:
 * in Swedish before $⟨e i y ä ö⟩$
 * in Norwegian before $⟨i y ei øy⟩$
 * in Faroese before $⟨e i y ey⟩$, but not before $⟨ei⟩$

Icelandic orthography is a bit more complicated by having lenited pronunciations of $\langleg\rangle$.

In German, the g is mostly a hard g, also before e and i: geben (to give), Geld (money), Gier (greed), Gift (poison, venom). Soft g occurs in loanwords, usually preserving the original pronunciation. So in words of French origin like Orange (orange), logieren (to lodge) or Etage (floor), the g is pronounced as ; words taken from English like Gin or Gender use the -sound. However others, such as agieren (act, agitate), Generation (generation) or Gymnasium (academic high school), are pronounced with a hard g. Some pronunciations vary by region: The word Giraffe is pronounced with a soft G in Austria, but with a hard G in Germany. The g in Magnet is pronounced as a hard g, but the gn in Champagner is pronounced like the French gn in champagne. The letter combination ng is usually merged to a velar nasal, and the g is not spoken in its own right; e.g., in the German word Finger, it is not audible as in the English word finger. However, when those letters are pronounced separately, as in compound words like Eingabe (input) or also in verbs like fingieren (to feign), both the n and the hard g is clearly audible. There are exceptions in loanwords like French-derived rangieren (to rank, to shunt), spoken with a velar nasal and a soft g.

Other languages typically have hard $\langleg\rangle$ pronunciations except possibly in loanwords where it may represent or.

The orthography of Luganda is similar to Italian in having a soft $\langleg\rangle$ pronunciation before front vowels (namely $\langlei y\rangle$) and $\langlegy\rangle$ indicates this soft pronunciation.

Because Esperanto orthography is phonemic, $\langleg\rangle$ always represents a hard g; a soft g is represented by the accented letter Ĝ|$\langleĝ\rangle$.

The Vietnamese alphabet does not have a hard or a soft $\langleg\rangle$ per se. However, since it was inherited from European Romance languages (Portuguese and Italian) except the diacritics which were from Greek; the letter $\langleg\rangle$ never occurs in "soft positions", i.e. before $\langlee\rangle$, $\langleê\rangle$ and $\langlei\rangle$ where the digraph $\langlegh\rangle$ (colloquially known as gờ ghép "composed $\langleg\rangle$") is used instead. Likewise, the trigraph $\langlengh\rangle$ (ngờ ghép "composed $\langleng\rangle$") also replaces the digraph $\langleng\rangle$ in those positions. "gh" can be explained as following Italian convention, and "ngh" as a form of analogy. However, there still is $\langlegi\rangle$ which is considered a digraph on its own, shortened to $\langleg\rangle$ before $\langlei\rangle$, even in the word gì.

Other scripts
In Modern Greek, which uses the Greek alphabet, the Greek letter gamma (uppercase: $\langleΓ\rangle$; lowercase: $\langleγ\rangle$) – which is ancestral to the Roman letters $\langleg\rangle$ and $\langlec\rangle$ – has "soft-type" and "hard-type" pronunciations, though Greek speakers do not use such a terminology. The "soft" pronunciation (that is, the voiced palatal fricative ) occurs before $\langleαι\rangle$ and $\langleε\rangle$ (both which represent ), and before $\langleει\rangle$, $\langleη\rangle$, $\langleι\rangle$, $\langleοι\rangle$, and $\langleυι\rangle$ (which all represent ). In other instances, the "hard" pronunciation (that is, the voiced velar fricative ) occurs.

In the Russian alphabet (a variant of Cyrillic), $\langleг\rangle$ represents both hard (твёрдый ) and soft (мягкий ) pronunciations, and, respectively. The soft pronunciation of $\langleг\rangle$ occurs before any of the "softening" vowels $\langleе ё и ю я ь\rangle$ and the hard pronunciation occurs elsewhere. However, the letter $\langleж\rangle$ functions as a "soft g" in the Romance sense, with alterations between $\langleг\rangle$ and $\langleж\rangle$ common in the language (e.g. ложиться, "to lie (down)", past tense лёг; подруга, "girlfriend", diminutive подружка). In other Slavic languages, there are similar phenomena involving $\langleg\rangle$ (or $\langleh\rangle$) and $\langlež\rangle$ (or $\langleż\rangle$).

In Modern Hebrew, which uses the Hebrew alphabet, the letter gimel ($\langleג\rangle$) typically has the sound within Hebrew words, although in some Sephardic dialects, it represents  or  when written with a dagesh (i.e., a dot placed inside the letter: $\langleגּ\rangle$), and  when without a dagesh. An apostrophe-like symbol called a Geresh can be added immediately to the left of a gimel (i.e., $\langleג׳\rangle$) to indicate that the gimel represents an affricate ).