Estonian phonology

This article is about the phonology and phonetics of the Estonian language.

Vowels
There are 9 vowels and 36 diphthongs, 28 of which are native to Estonian. All nine vowels can appear as the first component of a diphthong, but only occur as the second component. A vowel characteristic of Estonian is the unrounded back vowel, which may be close-mid back, close back, or close-mid central.


 * Vowels can occur in both initial and non-initial syllables. Vowels  generally occur in initial syllables; they can occur in non-initial syllables only in compound words, exclamations, proper names and unnaturalized loanwords.
 * The front vowels are phonetically near-front.
 * Before and after, the back vowels can be fronted to.
 * The unrounded vowel transcribed can be realized as close back, close-mid central  or close-mid back , depending on the speaker.
 * The mid vowels are phonetically close-mid.
 * Word-final is often realized as mid.
 * The open vowels are phonetically near-open.

There are very few instances of vowel allophony; for instance, the long is pronounced as the diphthong  before.

Simple vowels can be inherently short or long, written with single and double vowel letters respectively. Diphthongs are always inherently long. Furthermore, long vowels and diphthongs have two suprasegmental lengths. This is described further below.

Consonants

 * are bilabial, whereas are labiodental.
 * The fricatives appear only in loanwords. Some speakers merge  with  to.
 * is realized as velar before a velar consonant (e.g. panga   'bank [gen.sg.]').
 * are velar, whereas is palatal.
 * When it appears between non-high vowels, the intervocalic can be realized as a non-syllabic close-mid front unrounded vowel.
 * The stops are voiceless unaspirated, but the short versions may be partially or fully  voiced when they appear before or between vowels.
 * In spontaneous speech, word-initial is usually dropped. It is mostly retained in formal speech, and can be realized as voiced  between two voiced sounds.

Like the vowels, most consonants can be inherently short or long. For the plosives, this distinction is reflected as a distinction in tenseness/voicing, with short plosives being voiced and long plosives being voiceless. This distinction only applies fully for single consonants after stressed syllables. In other environments, the length or tenseness/voicing distinctions may be neutralized:
 * After unstressed syllables or in consonant clusters, only obstruents can be long, other consonants are always short.
 * In consonant clusters, voiced plosives are devoiced when next to another obstruent. That is, voiced plosives only occur if they are next to sounds exclusively from the classes of vowels and sonorants.
 * Word-initially, obstruents are always voiceless, while the remaining consonants are always short. Recent loanwords may have voiced initial plosives, however.

In addition, long consonants and clusters also have two suprasegmental lengths, like the vowels. This is described below.

Non-phonemic palatalization generally occurs before front vowels. All word-internal and word-final alveolar consonants (except /r/) can be palatalized. In addition, about 0.15% of the vocabulary features fully phonemic palatalization, where palatalization occurs without the front vowel (e.g. /palʲk/ 'log, beam' vs /palk/ 'wage'). A front vowel did historically occur there, but was lost, leaving the palatalization as its only trace (a form of cheshirization). It occurs word-finally or word-medially. Thus, palatalization does not necessarily need a front vowel, and palatalized vs. plain continuants can be articulated. Estonian has pre-palatalization: palatalization occurs before rather than after the consonant and is characterized by a longer i-like transition from vowel to consonant. Palatalization is not indicated in the standard orthography.

Prosody
The stress in Estonian is usually on the first syllable, as was the case in Proto-Finnic. There are a few exceptions with the stress on the second syllable: aitäh ('thanks'), sõbranna ('female friend'). In loanwords, the original stress can be borrowed as well: ideaal ('ideal'), professor ('professor'). The stress is weak, and as length levels already control an aspect of "articulation intensity", most words appear evenly stressed.

A secondary stress, which is not always predictable, falls usually on odd-numbered syllables or some suffixes, e.g. -mine (laulmine [ˈlɑuːlˌmine] ‘singing’), -line (aluseline [ˈɑluˑseˌline] ‘basic, alkaline’), -lik (ohtlikku [ˈohːtˌlikːku] ‘dangerous (part. pl.)’). Words of more than three syllables can consist of combinations of monosyllabic, disyllabic and trisyllabic feet.

Syllables can be divided into short and long. Syllables ending in a short vowel are short, while syllables ending in a long vowel, diphthong or consonant are long. The length of vowels, consonants and thus syllables is "inherent" in the sense that it is tied to a particular word and is not subject to morphological alternations.

Suprasegmental length
All stressed long syllables can possess a suprasegmental length feature. When a syllable has this feature, any long vowel or diphthong in the syllable is lengthened further, as is any long consonant or consonant cluster at the end of that syllable. A long syllable without suprasegmental length is termed "long", "half-long", "light" or "length II" and is denoted in IPA as $⟨ˑ⟩$ or $⟨ː⟩$. A long syllable with suprasegmental length is termed "overlong", "long", "heavy" or "length III", denoted in IPA as $⟨ː⟩$ or $⟨ːː⟩$. For consistency, this article employs the terms "half-long" and "overlong" and uses $⟨ː⟩$ and $⟨ːː⟩$, respectively, to denote them.

Both the regular short-long distinction and the suprasegmental length are distinctive, so that Estonian effectively has three distinctive vowel and consonant lengths, the distinction between the second and third length levels being at a level larger than the phoneme, such as the syllable or the foot. In addition to realizing greater phonetic duration, overlength in modern Estonian involves a pitch distinction where falling pitch is realized in syllables that are overlong and level pitch is realized in syllables that are short or half-long.

The suprasegmental length is not indicated in the standard orthography except for the plosives for which a single voiceless letter represents a half-long consonant while a double voiceless letter represents an overlong consonant. There are many minimal pairs and also some minimal triplets which differ only by length:
 * vere 'blood [gen.sg.]' (short) &mdash; veere  'edge [gen. sg.]' (long) &mdash; veere  'roll [imp. 2nd sg.] ' (overlong)
 * lina 'sheet' (short) &mdash; linna  'town [gen. sg.]' (long) &mdash; linna  'town [ill. sg.]' (overlong)
 * kabi 'hoof' (short) &mdash; kapi  'wardrobe [gen. sg.]' (long) &mdash; kappi  'wardrobe [ill. sg.]' (overlong)

The extra length distinction has a number of origins:
 * Single-syllable words are always overlong, if they have a long syllable.
 * Overlong syllables appear in strong-grade environments, while half-long syllables appear in weak-grade environments. This is traceable to an earlier (Proto-Finnic) distinction between open and closed syllables: closed syllables shortened and weakened a preceding syllable.
 * Syncopation of a medial syllable lengthens the preceding syllable.
 * When a consonant disappears altogether in the weak grade, coalescence of the two adjacent vowels produces an overlong syllable.
 * Compensatory lengthening in the short illative singular form of nominals produces an overlong syllable, even from an originally short syllable.