Altaic languages

Altaic is a controversial proposed language family that would include the Turkic, Mongolic and Tungusic language families and possibly also the Japonic and Koreanic languages. The hypothetical language family has long been rejected by most comparative linguists, although it continues to be supported by a small but stable scholarly minority. Speakers of the constituent languages are currently scattered over most of Asia north of 35° N and in some eastern parts of Europe, extending in longitude from the Balkan Peninsula to Japan. The group is named after the Altai mountain range in the center of Asia.

The Altaic family was first proposed in the 18th century. It was widely accepted until the 1960s and is still listed in many encyclopedias and handbooks, and references to Altaic as a language family continue to percolate to modern sources through these older sources. Since the 1950s, most comparative linguists have rejected the proposal, after supposed cognates were found not to be valid, hypothesized sound shifts were not found, and Turkic and Mongolic languages were found to have been converging rather than diverging over the centuries. The relationship between the Altaic languages is now generally accepted to be the result of a sprachbund rather than common ancestry, with the languages showing influence from prolonged contact.

The continued use of the term "Altaic" to refer to the various iterations of an Altaic theory, for the "Altaic sprachbund", and infrequently as a general term for the region has resulted in confusion around the status of the Altaic hypothesis. As a result, many Altaicists have adopted instead the name "Transeurasian" in relation to modifications of the family proposal, in order to avoid such confusion. This confusion is compounded further by literature that still - contrary to the current scholarly consensus - refers to Altaic as an accepted hypothesis.

Altaic has maintained a limited degree of scholarly support, in contrast to some other early macrofamily proposals. Continued research on Altaic is still being undertaken by a core group of academic linguists, but their research has not found wider support. In particular it has support from the Institute of Linguistics of the Russian Academy of Sciences and remains influential as a substratum of Turanism, where a hypothetical common linguistic ancestor has been used in part as a basis for a multiethnic nationalist movement.

Earliest attestations
The earliest attested expressions in Proto-Turkic are recorded in various Chinese sources. Anna Dybo identifies in Shizi (330 BCE) and the Book of Han (111 CE) several dozen Proto-Turkic exotisms in Chinese Han transcriptions. Lanhai Wei and Hui Li reconstruct the name of the Xiōngnú ruling house as PT *Alayundluğ /alajuntˈluγ/ 'piebald horse clan.'

The earliest known texts in a Turkic language are the Orkhon inscriptions, 720–735 AD. They were deciphered in 1893 by the Danish linguist Vilhelm Thomsen in a scholarly race with his rival, the German–Russian linguist Wilhelm Radloff. However, Radloff was the first to publish the inscriptions.

The first Tungusic language to be attested is Jurchen, the language of the ancestors of the Manchus. A writing system for it was devised in 1119 AD and an inscription using this system is known from 1185 (see List of Jurchen inscriptions).

The earliest Mongolic language of which we have written evidence is known as Middle Mongol. It is first attested by an inscription dated to 1224 or 1225 AD, the Stele of Yisüngge, and by the Secret History of the Mongols, written in 1228 (see Mongolic languages). The earliest Para-Mongolic text is the Memorial for Yelü Yanning, written in the Khitan large script and dated to 986 AD. However, the Inscription of Hüis Tolgoi, discovered in 1975 and analysed as being in an early form of Mongolic, has been dated to 604–620 AD. The Bugut inscription dates back to 584 AD.

Japanese is first attested in the form of names contained in a few short inscriptions in Classical Chinese from the 5th century AD, such as found on the Inariyama Sword. The first substantial text in Japanese, however, is the Kojiki, which dates from 712 AD. It is followed by the Nihon shoki, completed in 720, and then by the Man'yōshū, which dates from c. 771–785, but includes material that is from about 400 years earlier.

The most important text for the study of early Korean is the Hyangga, a collection of 25 poems, of which some go back to the Three Kingdoms period (57 BC–668 AD), but are preserved in an orthography that only goes back to the 9th century AD. Korean is copiously attested from the mid-15th century on in the phonetically precise Hangul system of writing.

Origins
The earliest known reference to a unified language group of Turkic, Mongolic and Tungusic languages is from the 1692 work of Nicolaes Witsen which may be based on a 1661 work of Abu al-Ghazi Bahadur, Genealogy of the Turkmens.

A proposed grouping of the Turkic, Mongolic, and Tungusic languages was published in 1730 by Philip Johan von Strahlenberg, a Swedish officer who traveled in the eastern Russian Empire while a prisoner of war after the Great Northern War. However, he may not have intended to imply a closer relationship among those languages. Later proposals to include the Korean and Japanese languages into a "Macro-Altaic" family have always been controversial. The original proposal was sometimes called "Micro-Altaic" by retronymy. Most proponents of Altaic continue to support the inclusion of Korean, but fewer do for Japanese. Some proposals also included Ainuic but this is not widely accepted even among Altaicists themselves. A common ancestral Proto-Altaic language for the "Macro" family has been tentatively reconstructed by Sergei Starostin and others.

Micro-Altaic includes about 66 living languages, to which Macro-Altaic would add Korean, Jeju, Japanese, and the Ryukyuan languages, for a total of about 74 (depending on what is considered a language and what is considered a dialect). These numbers do not include earlier states of languages, such as Middle Mongol, Old Korean, or Old Japanese.

Uralo-Altaic hypothesis
In 1844, the Finnish philologist Matthias Castrén proposed a broader grouping which later came to be called the Ural–Altaic family, which included Turkic, Mongolian, and Manchu-Tungus (=Tungusic) as an "Altaic" branch, and also the Finno-Ugric and Samoyedic languages as the "Uralic" branch (though Castrén himself used the terms "Tataric" and "Chudic"). The name "Altaic" referred to the Altai Mountains in East-Central Asia, which are approximately the center of the geographic range of the three main families. The name "Uralic" referred to the Ural Mountains.

While the Ural-Altaic family hypothesis can still be found in some encyclopedias, atlases, and similar general references, since the 1960s it has been heavily criticized. Even linguists who accept the basic Altaic family, such as Sergei Starostin, completely discard the inclusion of the "Uralic" branch.

The term continues to be used for the central Eurasian typological, grammatical and lexical convergence zone. Indeed, "Ural-Altaic" may be preferable to "Altaic" in this sense. For example, Juha Janhunen states that "speaking of 'Altaic' instead of 'Ural-Altaic' is a misconception, for there are no areal or typological features that are specific to 'Altaic' without Uralic."

Korean and Japanese languages
In 1857, the Austrian scholar Anton Boller suggested adding Japanese to the Ural–Altaic family.

In the 1920s, G.J. Ramstedt and E.D. Polivanov advocated the inclusion of Korean. Decades later, in his 1952 book, Ramstedt rejected the Ural–Altaic hypothesis but again included Korean in Altaic, an inclusion followed by most leading Altaicists (supporters of the theory) to date. His book contained the first comprehensive attempt to identify regular correspondences among the sound systems within the Altaic language families.

In 1960, Nicholas Poppe published what was in effect a heavily revised version of Ramstedt's volume on phonology that has since set the standard in Altaic studies. Poppe considered the issue of the relationship of Korean to Turkic-Mongolic-Tungusic not settled. In his view, there were three possibilities: (1) Korean did not belong with the other three genealogically, but had been influenced by an Altaic substratum; (2) Korean was related to the other three at the same level they were related to each other; (3) Korean had split off from the other three before they underwent a series of characteristic changes.

Roy Andrew Miller's 1971 book Japanese and the Other Altaic Languages convinced most Altaicists that Japanese also belonged to Altaic. Since then, the "Macro-Altaic" has been generally assumed to include Turkic, Mongolic, Tungusic, Korean, and Japanese.

In 1990, Unger advocated a family consisting of Tungusic, Korean, and Japonic languages, but not Turkic or Mongolic.

However, many linguists dispute the alleged affinities of Korean and Japanese to the other three groups. Some authors instead tried to connect Japanese to the Austronesian languages.

In 2017, Martine Robbeets proposed that Japanese (and possibly Korean) originated as a hybrid language. She proposed that the ancestral home of the Turkic, Mongolic, and Tungusic languages was somewhere in northwestern Manchuria. A group of those proto-Altaic ("Transeurasian") speakers would have migrated south into the modern Liaoning province, where they would have been mostly assimilated by an agricultural community with an Austronesian-like language. The fusion of the two languages would have resulted in proto-Japanese and proto-Korean.

In a typological study that does not directly evaluate the validity of the Altaic hypothesis, Yurayong and Szeto (2020) discuss for Koreanic and Japonic the stages of convergence to the Altaic typological model and subsequent divergence from that model, which resulted in the present typological similarity between Koreanic and Japonic. They state that both are "still so different from the Core Altaic languages that we can even speak of an independent Japanese-Korean type of grammar. Given also that there is neither a strong proof of common Proto-Altaic lexical items nor solid regular sound correspondences but, rather, only lexical and structural borrowings between languages of the Altaic typology, our results indirectly speak in favour of a “Paleo-Asiatic” origin of the Japonic and Koreanic languages."

The Ainu language
In 1962, John C. Street proposed an alternative classification, with Turkic-Mongolic-Tungusic in one grouping and Korean-Japanese-Ainu in another, joined in what he designated as the "North Asiatic" family. The inclusion of Ainu was adopted also by James Patrie in 1982.

The Turkic-Mongolic-Tungusic and Korean-Japanese-Ainu groupings were also posited in 2000–2002 by Joseph Greenberg. However, he treated them as independent members of a larger family, which he termed Eurasiatic.

The inclusion of Ainu is not widely accepted by Altaicists. In fact, no convincing genealogical relationship between Ainu and any other language family has been demonstrated, and it is generally regarded as a language isolate.

Early criticism and rejection
Starting in the late 1950s, some linguists became increasingly critical of even the minimal Altaic family hypothesis, disputing the alleged evidence of genetic connection between Turkic, Mongolic and Tungusic languages.

Among the earlier critics were Gerard Clauson (1956), Gerhard Doerfer (1963), and Alexander Shcherbak. They claimed that the words and features shared by Turkic, Mongolic, and Tungusic languages were for the most part borrowings and that the rest could be attributed to chance resemblances. In 1988, Doerfer again rejected all the genetic claims over these major groups.

Modern controversy
A major continuing supporter of the Altaic hypothesis has been Sergei Starostin, who published a comparative lexical analysis of the Altaic languages in 1991. He concluded that the analysis supported the Altaic grouping, although it was "older than most other language families in Eurasia, such as Indo-European or Finno-Ugric, and this is the reason why the modern Altaic languages preserve few common elements".

In 1991 and again in 1996, Roy Miller defended the Altaic hypothesis and claimed that the criticisms of Clauson and Doerfer apply exclusively to the lexical correspondences, whereas the most pressing evidence for the theory is the similarities in verbal morphology.

In 2003, Claus Schönig published a critical overview of the history of the Altaic hypothesis up to that time, siding with the earlier criticisms of Clauson, Doerfer, and Shcherbak.

In 2003, Starostin, Anna Dybo and Oleg Mudrak published the Etymological Dictionary of the Altaic Languages, which expanded the 1991 lexical lists and added other phonological and grammatical arguments.

Starostin's book was criticized by Stefan Georg in 2004 and 2005, and by Alexander Vovin in 2005.

Other defenses of the theory, in response to the criticisms of Georg and Vovin, were published by Starostin in 2005, Blažek in 2006, Robbeets in 2007, and Dybo and G. Starostin in 2008.

In 2010, Lars Johanson echoed Miller's 1996 rebuttal to the critics, and called for a muting of the polemic.

List of supporters and critics of the Altaic hypothesis
The list below comprises linguists who have worked specifically on the Altaic problem since the publication of the first volume of Ramstedt's Einführung in 1952. The dates given are those of works concerning Altaic. For supporters of the theory, the version of Altaic they favor is given at the end of the entry, if other than the prevailing one of Turkic–Mongolic–Tungusic–Korean–Japanese.

Major supporters

 * Pentti Aalto (1955). Turkic–Mongolic–Tungusic–Korean.
 * Anna V. Dybo (S. Starostin et al. 2003, A. Dybo and G. Starostin 2008).
 * Frederik Kortlandt (2010).
 * Karl H. Menges (1975). Common ancestor of Korean, Japanese and traditional Altaic dated back to the 7th or 8th millennium BC (1975: 125).
 * Roy Andrew Miller (1971, 1980, 1986, 1996). Supported the inclusion of Korean and Japanese.
 * Oleg A. Mudrak (S. Starostin et al. 2003).
 * Nicholas Poppe (1965). Turkic–Mongolic–Tungusic and perhaps Korean.
 * Alexis Manaster Ramer.
 * Martine Robbeets (2004, 2005, 2007, 2008, 2015, 2021) (in the form of "Transeurasian").
 * G. J. Ramstedt (1952–1957). Turkic–Mongolic–Tungusic–Korean.
 * George Starostin (A. Dybo and G. Starostin 2008).
 * Sergei Starostin (1991, S. Starostin et al. 2003).
 * John C. Street (1962). Turkic–Mongolic–Tungusic and Korean–Japanese–Ainu, grouped as "North Asiatic".
 * Talât Tekin (1994). Turkic–Mongolic–Tungusic–Korean.

Major critics

 * Gerard Clauson (1956, 1959, 1962).
 * Gerhard Doerfer (1963, 1966, 1967, 1968, 1972, 1973, 1974, 1975, 1981, 1985, 1988, 1993).
 * Susumu Ōno (1970, 2000)
 * Juha Janhunen (1992, 1995) (tentative support of Mongolic-Tungusic).
 * Claus Schönig (2003).
 * Stefan Georg (2004, 2005).
 * Alexander Vovin (2005, 2010, 2017). Formerly an advocate of Altaic (1994, 1995, 1997, 1999, 2000, 2001), later a critic.
 * Alexander Shcherbak.
 * Alexander B. M. Stiven (2008, 2010).

Advocates of alternative hypotheses

 * James Patrie (1982) and Joseph Greenberg (2000–2002). Turkic–Mongolic–Tungusic and Korean–Japanese–Ainu, grouped in a common taxon (cf. John C. Street 1962).
 * J. Marshall Unger (1990). Tungusic–Korean–Japanese ("Macro-Tungusic"), with Turkic and Mongolic as separate language families.
 * Lars Johanson (2010). Agnostic, proponent of a "Transeurasian" verbal morphology not necessarily genealogically linked.

"Transeurasian" renaming
In Robbeets and Johanson (2010), there was a proposal to replace the name "Altaic" with the name "Transeurasian". While "Altaic" has sometimes included Japonic, Koreanic, and other languages or families, but only on the consideration of particular authors, "Transeurasian" was specifically intended to always include Turkic, Mongolic, Tungusic, Japonic, and Koreanic. Robbeets and Johanson gave as their reasoning for the new term: 1) to avoid confusion between the different uses of Altaic as to which group of languages is included, 2) to reduce the counterproductive polarization between "Pro-Altaists" and "Anti-Altaists"; 3) to broaden the applicability of the term because the suffix -ic implies affinity while -an leaves room for an areal hypothesis; and 4) to eliminate the reference to the Altai mountains as a potential homeland.

In Robbeets and Savelyev, ed. (2020) there was a concerted effort to distinguish "Altaic" as a subgroup of "Transeurasian" consisting only of Turkic, Mongolic, and Tungusic, while retaining "Transeurasian" as "Altaic" plus Japonic and Koreanic.

Phonological and grammatical features
The original arguments for grouping the "micro-Altaic" languages within a Uralo-Altaic family were based on such shared features as vowel harmony and agglutination.

According to Roy Miller, the most pressing evidence for the theory is the similarities in verbal morphology.

The Etymological Dictionary by Starostin and others (2003) proposes a set of sound change laws that would explain the evolution from Proto-Altaic to the descendant languages. For example, although most of today's Altaic languages have vowel harmony, Proto-Altaic as reconstructed by them lacked it; instead, various vowel assimilations between the first and second syllables of words occurred in Turkic, Mongolic, Tungusic, Korean, and Japonic. They also included a number of grammatical correspondences between the languages.

Shared lexicon
Starostin claimed in 1991 that the members of the proposed Altaic group shared about 15–20% of apparent cognates within a 110-word Swadesh-Yakhontov list; in particular, Turkic–Mongolic 20%, Turkic–Tungusic 18%, Turkic–Korean 17%, Mongolic–Tungusic 22%, Mongolic–Korean 16%, and Tungusic–Korean 21%. The 2003 Etymological Dictionary includes a list of 2,800 proposed cognate sets, as well as a few important changes to the reconstruction of Proto-Altaic. The authors tried hard to distinguish loans between Turkic and Mongolic and between Mongolic and Tungusic from cognates; and suggest words that occur in Turkic and Tungusic but not in Mongolic. All other combinations between the five branches also occur in the book. It lists 144 items of shared basic vocabulary, including words for such items as 'eye', 'ear', 'neck', 'bone', 'blood', 'water', 'stone', 'sun', and 'two'.

Robbeets and Bouckaert (2018) use Bayesian phylolinguistic methods to argue for the coherence of the "narrow" Altaic languages (Turkic, Mongolic, and Tungusic) together with Japonic and Koreanic, which they refer to as the Transeurasian languages. Their results include the following phylogenetic tree:

Martine Robbeets et al. (2021) argues that early Transeurasian speakers were originally agriculturalists in Northeastern Asia, only becoming pastoralists later on.

The analysis conducted by Kassian et al. (2021) on a 110-item word list, specifically developed for each of the languages—Proto-Turkic, Proto-Mongolic, Proto-Tungusic, Middle Korean and Proto-Japonic— indicated support for the Altaic macrofamily. While acknowledging that considering prehistoric contacts as an alternative explanation for the results is plausible, they deem such a scenario less likely for Turkic and Japonic languages. This assessment is based on the substantial geographical distances involved, which can only be explained if a mutual relationship is assumed.

Weakness of lexical and typological data
According to G. Clauson (1956), G. Doerfer (1963), and A. Shcherbak (1963), many of the typological features of the supposed Altaic languages, particularly agglutinative strongly suffixing morphology and subject–object–verb (SOV) word order, often occur together in languages.

Those critics also argued that the words and features shared by Turkic, Mongolic, and Tungusic languages were for the most part borrowings and that the rest could be attributed to chance resemblances. They noted that there was little vocabulary shared by Turkic and Tungusic languages, though more shared with Mongolic languages. They reasoned that, if all three families had a common ancestor, we should expect losses to happen at random, and not only at the geographical margins of the family; and that the observed pattern is consistent with borrowing.

According to C. Schönig (2003), after accounting for areal effects, the shared lexicon that could have a common genetic origin was reduced to a small number of monosyllabic lexical roots, including the personal pronouns and a few other deictic and auxiliary items, whose sharing could be explained in other ways; not the kind of sharing expected in cases of genetic relationship.

The Sprachbund hypothesis
Instead of a common genetic origin, Clauson, Doerfer, and Shcherbak proposed (in 1956–1966) that Turkic, Mongolic, and Tungusic languages form a Sprachbund: a set of languages with similarities due to convergence through intensive borrowing and long contact, rather than common origin.

Asya Pereltsvaig further observed in 2011 that, in general, genetically related languages and families tend to diverge over time: the earlier forms are more similar than modern forms. However, she claims that an analysis of the earliest written records of Mongolic and Turkic languages shows the opposite, suggesting that they do not share a common traceable ancestor, but rather have become more similar through language contact and areal effects.

Hypothesis about the original homeland
The prehistory of the peoples speaking the "Altaic" languages is largely unknown. Whereas for certain other language families, such as the speakers of Indo-European, Uralic, and Austronesian, it is possible to frame substantial hypotheses, in the case of the proposed Altaic family much remains to be done.

Some scholars have hypothesised a possible Uralic and Altaic homeland in the Central Asian steppes.



Chaubey and van Driem propose that the dispersal of ancient Altaic language communities is reflected by the early Holocene dissemination of haplogroup C2 (M217): "If the paternal lineage C2 (M217) is correlated with Altaic linguistic affinity, as appears to be the case for Turkic, Mongolic and Tungusic, then Japanese is no Father Tongue, and neither is Korean. This Y-chromosomal haplogroup accounts for 11% of Korean paternal lineages, and the frequency of the lineage is even more reduced in Japan. Yet this molecular marker may still be a tracer for the introduction of Altaic language to the archipelago, where the paternal lineage has persisted, albeit in a frequency of just 6%."

Juha Janhunen hypothesized that the ancestral languages of Turkic, Mongolic, Tungusic, Korean, and Japanese were spoken in a relatively small area comprising present-day North Korea, Southern Manchuria, and Southeastern Mongolia. However Janhunen is sceptical about an affiliation of Japanese to Altaic, while András Róna-Tas remarked that a relationship between Altaic and Japanese, if it ever existed, must be more remote than the relationship of any two of the Indo-European languages. Ramsey stated that "the genetic relationship between Korean and Japanese, if it in fact exists, is probably more complex and distant than we can imagine on the basis of our present state of knowledge".

Supporters of the Altaic hypothesis formerly set the date of the Proto-Altaic language at around 4000 BC, but today at around 5000 BC or 6000 BC. This would make Altaic a language family older than Indo-European (around 3000 to 4000 BC according to mainstream hypotheses) but considerably younger than Afroasiatic (c. 10,000 BC or 11,000 to 16,000 BC  according to different sources).