Hard and soft C

In the Latin-based orthographies of many European languages, including English, a distinction between hard and soft $\langlec\rangle$ occurs in which $\langlec\rangle$ represents two distinct phonemes. The sound of a hard $\langlec\rangle$ often precedes the non-front vowels $\langlea\rangle$, $\langleo\rangle$ and $\langleu\rangle$, and is that of the voiceless velar stop, (as in car). The sound of a soft $\langlec\rangle$, typically before $\langlee\rangle$, $\langlei\rangle$ and $\langley\rangle$, may be a fricative or affricate, depending on the language. In English (and not coincidentally also French), the sound of soft $\langlec\rangle$ is (as in the first and last ⟨c⟩s in " c ircumferen c e").

There was no soft $\langlec\rangle$ in classical Latin, where it was always pronounced as.

History
This alternation is caused by a historical palatalization of which took place in Late Latin, and led to a change in the pronunciation of the sound  before the front vowels  and. Later, other languages not directly descended from Latin, such as English, inherited this feature as an orthographic convention.

General overview
In English orthography, the pronunciation of hard $\langlec\rangle$ is and of soft $\langlec\rangle$ is generally. Yod-coalescence has altered instances of ─ particularly in unstressed syllables ─ to  in most varieties of English, affecting words such as ocean, logician and magician. Generally, the soft $\langlec\rangle$ pronunciation occurs before $\langlei e y\rangle$; it also occurs before $\langleae\rangle$ and $\langleoe\rangle$ in a number of Greek and Latin loanwords (such as coelacanth, caecum, caesar). The hard $\langlec\rangle$ pronunciation occurs everywhere else except in the letter combinations $\langlesc\rangle$, $\langlech\rangle$, and $\langlesch\rangle$ which have distinct pronunciation rules. $\langlecc\rangle$ generally represents before $\langlei e y\rangle$, as in accident, succeed, and coccyx.

There are exceptions to the general rules of hard and soft $\langlec\rangle$:
 * The $\langlec\rangle$ in the words Celt and Celtic was traditionally soft, but since the late 19th century, the hard pronunciation has also been recognized in conscious imitation of the classical Latin pronunciation of Celtae; see Pronunciation of Celtic. Welsh and Gaelic loanwords in English which retain their native spelling, such as ceilidh, cistvaen (alternatively spelled $\langlekistvaen\rangle$) or Cymric, are also pronounced hard. The Irish and Welsh languages have no letter K, so all Cs are pronounced hard.
 * The $\langlec\rangle$ is hard in a handful of words like arcing, synced/syncing, chicer, and Quebecer (alternatively spelled $\langleQuebecker\rangle$) that involve a word normally spelled with a final $\langlec\rangle$ followed by an affix starting with $\langlee\rangle$ or $\langlei\rangle$; soccer and recce also have a hard $\langlec\rangle$.
 * The $\langlesc\rangle$ in sceptic, and its derivatives such as sceptical and scepticism, represents . These words are alternative spellings to $\langleskeptical\rangle$ and $\langleskepticism\rangle$, respectively.
 * The $\langlecc\rangle$ of flaccid now sometimes represents a single soft $\langlec\rangle$ pronunciation, which is a simplification of.
 * The $\langlec\rangle$ is silent before $\langlet\rangle$ in indict and its derivatives such as indictment, in the name of the U.S. state Connecticut, and in some pronunciations of Arctic and Antarctic.
 * In a few cases such as facade and limacon, a soft $\langlec\rangle$ appears before $\langlea o u\rangle$ and is optionally indicated to be soft by means of attaching a cedilla to its bottom, giving façade, limaçon.

A silent $\langlee\rangle$ can occur after $\langlec\rangle$ at the end of a word or component root word part of a larger word. The $\langlee\rangle$ can serve a marking function indicating that the preceding $\langlec\rangle$ is soft, as in dance and enhancement. The silent $\langlee\rangle$ often additionally indicates that the vowel before $\langlec\rangle$ is a long vowel, as in rice, mace, and pacesetter.

When adding suffixes with $\langlei e y\rangle$ (such as -ed, -ing, -er, -est, -ism, -ist, -y, and -ie) to root words ending in $\langlece\rangle$, the final $\langlee\rangle$ of the root word is often dropped and the root word retains the soft $\langlec\rangle$ pronunciation as in danced, dancing, and dancer from dance. The suffixes -ify and -ise/-ize can be added to most nouns and adjectives to form new verbs. The pronunciation of $\langlec\rangle$ in newly coined words using these suffixes is not always clear. The digraph $\langleck\rangle$ may be used to retain the hard $\langlec\rangle$ pronunciation in inflections and derivatives of a word such as trafficking from the verb traffic.

There are several cases in English in which hard and soft $\langlec\rangle$ alternate with the addition of suffixes as in critic/criticism and electric/electricity (electrician has a soft $\langlec\rangle$ pronunciation of because of yod-coalescence).

Letter combinations
A number of two-letter combinations or digraphs follow distinct pronunciation patterns and do not follow the hard/soft distinction of $\langlec\rangle$. For example, $\langlech\rangle$ may represent (as in chicken),  (as in chef), or  (as in choir). Other letter combinations that don't follow the paradigm include $\langlecz\rangle$, $\langlesc\rangle$, $\langlecs\rangle$, $\langletch\rangle$, $\langlesch\rangle$, and $\langletsch\rangle$. These come primarily from loanwords.

Besides a few examples (recce, soccer, Speccy), $\langlecc\rangle$ fits neatly with the regular rules of $\langlec\rangle$: Before $\langlei e y\rangle$, the second $\langlec\rangle$ is soft while the first is hard. Words such as accept and success are pronounced with and words such as succumb and accommodate are pronounced with. Exceptions include loanwords from Italian such as cappuccino with for $\langlecc\rangle$.

Many placenames and other proper nouns with -cester (from Old English ceaster, meaning Roman station or walled town) are pronounced with such as Worcester, Gloucester ( or ), and Leicester. The pronunciation occurs as a combination of a historically soft $\langlec\rangle$ pronunciation and historical elision of the first vowel of the suffix.

Italian loanwords
The original spellings and pronunciations of Italian loanwords have mostly been kept. Many English words that have been borrowed from Italian follow a distinct set of pronunciation rules corresponding to those in Italian. The Italian soft $\langlec\rangle$ pronunciation is (as in cello and ciao), while the hard $\langlec\rangle$ is the same as in English. Italian orthography uses $\langlech\rangle$ to indicate a hard pronunciation before $\langlee\rangle$ or $\langlei\rangle$, analogous to English using $\langlek\rangle$ (as in kill and keep) and $\langlequ\rangle$ (as in mosquito and queue).

In addition to hard and soft $\langlec\rangle$, the digraph $\langlesc\rangle$ represents or, if between vowels,  when followed by $\langlee\rangle$ or $\langlei\rangle$ (as in scena or sciarpa with, crescendo and fascia with ). Meanwhile, $\langlesch\rangle$ in Italian always represents, not , but English-speakers commonly pronounce it as , perhaps in part due to familiarity with the German pronunciation; thus bruschetta often is realized not with the of Italian , but with. Italian uses $\langlecc\rangle$ to indicate the geminate before $\langlea\rangle$, $\langleo\rangle$, $\langleu\rangle$ or  before $\langlee\rangle$ or $\langlei\rangle$. English does not have geminate phonemes, thus loanwords with soft $\langlecc\rangle$ that are pronounced with in Italian, such as cappuccino, are normally pronounced in English with the geminate simplified:.

Suffixation issues
Rarely, the use of unusual suffixed forms to create neologisms occurs. For example, the words ace and race are both standard words but adding -ate or -age (both productive affixes in English) would create spellings that seem to indicate hard $\langlec\rangle$ pronunciations. (acate and racage). Potential remedies include altering the spelling to asate and rasage, though no standard conventions exist.

Replacement with $\langlek\rangle$
Sometimes $\langlek\rangle$ replaces $\langlec\rangle$, $\langleck\rangle$, or $\langlequ\rangle$, as a trope for giving words a hard-edged or whimsical feel. Examples include the Mortal Kombat franchise and product names such as Kool-Aid and Nesquik. More intensely, this use of $\langlek\rangle$ has also been used to give extremist or racist connotations. Examples include Amerika or Amerikkka (where the $\langlek\rangle$ is reminiscent of German and the totalitarian Nazi regime and the racist Ku Klux Klan, respectively).

Other languages
Most modern Romance languages make the hard/soft distinction with $\langlec\rangle$, except a few that have undergone spelling reforms such as Ladino and archaic variants like Sardinian. Some non-Romance languages like German, Danish and Dutch use $\langlec\rangle$ in loanwords and also make this distinction. The soft $\langlec\rangle$ pronunciation, which occurs before $\langlei\rangle$, $\langlee\rangle$ and $\langley\rangle$, is: The hard $\langlec\rangle$ occurs in all other positions and represents in all these aforementioned languages, including in the case of ⟨c⟩ that comes before the Romanian letter î, which is different from i.
 * 1) in Italian, Romanian, and Old English;
 * 2) in English, French, Portuguese, Catalan, Latin American Spanish, and in words loaned into Dutch and the Scandinavian languages;
 * 3) in European and Equatoguinean Spanish;
 * 4) in words loaned into German. This is one of the more archaic pronunciations, and was also the pronunciation in Old Spanish, Old French and other historical languages where it is now pronounced . Most languages in eastern and central Europe came to use $\langlek\rangle$ only for, and $\langlec\rangle$ only for  (this would include those Slavic languages that use Latin script, Hungarian, Albanian, and the Baltic languages).

In Italian and Romanian, the orthographic convention for representing before front vowels is to add $\langleh\rangle$ (Italian chiaro,  'clear'). $\langlequ\rangle$ is used to accomplish the same purpose in Catalan, Portuguese, Spanish, and French. Rarely, the use of unusual suffixed forms to create neologisms occurs. For example, the words saco and taco are both standard words but adding -es or -ez (both productive affixes in Spanish) would create spellings that seem to indicate soft $\langlec\rangle$ pronunciations. (saces and tacez). Potential remedies include altering the spelling to saques and taquez, though no standard conventions exist. In French, Catalan, Portuguese, and Old Spanish a cedilla is used to indicate a soft pronunciation when it would otherwise seem to be hard. (French garçon, 'boy'; Portuguese coração , 'heart'; Catalan caçar , 'to hunt'). Spanish is similar, though $\langlez\rangle$ is used instead of $\langleç\rangle$ (e.g. corazón, 'heart'). However, this is essentially equivalent because despite common misconception the symbol $⟨Ç⟩$ is actually derived from a Visigothic Z.

In the orthographies of Irish and Scottish Gaelic, most consonants including $\langlec\rangle$ have a "broad" (velarized) vs "slender" distinction (palatalized) for many of its other consonants generally based on whether the nearest vowel is $\langlea o u\rangle$ or $\langlei e\rangle$, respectively. In Irish, ⟨c⟩ usually represents a hard, but represents before e or i, or after i. In Scottish Gaelic, broad $\langlec\rangle$ is one of /kʰ ʰk ʰk k/, and slender $\langlec\rangle$ is one of /kʰʲ ʰkʲ ʰkʲ kʲ/, depending on the phonetic environment.

A number of orthographies do not make a hard/soft distinction. The $\langlec\rangle$ is always hard in Welsh but is always soft in Slavic languages, Hungarian, and in Hanyu Pinyin transcription system of Mandarin Chinese, where it represents and in Indonesian and many of the transcriptions of the languages of India such as Sanskrit and Hindi, where it always represents. See also C § Other languages.

Swedish has a similar phenomenon with hard and soft $\langlek\rangle$: this results from a similar historical palatalization development. Soft $\langlek\rangle$ is typically a palatal or an alveolo-palatal, and occurs before not only  $\langlei\rangle$, $\langlee\rangle$ and $\langley\rangle$, but also $\langlej\rangle$, $\langleä\rangle$, and $\langleö\rangle$. Another similar system with hard and soft $\langlek\rangle$ is found in Faroese with the hard $\langlek\rangle$ being and the soft being, and Turkish where the soft $\langlek\rangle$ is.

The Vietnamese alphabet, while based on European orthographies, does not have a hard or a soft $\langlec\rangle$ per se. The letter $\langlec\rangle$, outside of the digraph $\langlech\rangle$, always represents a hard /k/ sound. However, it never occurs in "soft positions", i.e. before $\langlei y e ê\rangle$, where $\langlek\rangle$ is used instead, while $\langlek\rangle$ never occurs elsewhere except in the digraph $\langlekh\rangle$ and a few loanwords. Quite ironically, the names of the letters "c" and "k" are borrowed from Europe and those letters don't even occur in their own letter names (C: xê and K: ca.) Hồ Chí Minh had proposed a simplified spelling, as shown in the title of one of his books, 'Đường kách mệnh'.

Old Bohemian had hard c, but it was pronounced [x], as in Schecowitz, Tocowitz, and Crudim.