Linguistic homeland

From Wikipedia, the free encyclopedia
  (Redirected from Urheimat)
Jump to navigation Jump to search

In historical linguistics, the homeland or Urheimat (/ˈʊərhmɑːt/, from German ur- "original" and Heimat, home) of a proto-language is the region in which it was spoken before splitting into different daughter languages. A proto-language is the reconstructed or historically-attested parent language of a group of languages that are genetically related.

Depending on the age of the language family under consideration, its homeland may be known with near-certainty (in the case of historical or near-historical migrations) or it may be very uncertain (in the case of deep prehistory). Next to internal linguistic evidence, the reconstruction of a prehistoric homeland makes use of a variety of disciplines, including archaeology and archaeogenetics.


There are several methods to determine the homeland of a given language family. One method is based on the vocabulary that can be reconstructed for the proto-language. This vocabulary – especially terms for flora and fauna – can provide clues for the geographical and ecological environment in which the proto-language was spoken. An estimate for the time-depth of the proto-language is necessary in order to account for prehistorical changes in climate and the distribution of flora and fauna.[1][2]

Another method is based on the linguistic migration theory (first proposed by Edward Sapir), which states that the most likely candidate for the last homeland of a language family can be located in the area of its highest linguistic diversity.[2] This presupposes an established view about the internal subgrouping of the language family. Different assumptions about high-order subgrouping can thus lead to very divergent proposals for a linguistic homeland (e.g. Isidore Dyen's proposal for New Guinea as the center of dispersal of the Austronesian languages).[1] The linguistic migration theory has its limits because it only works when linguistic diversity evolves continuously without major disruptions. Its results can be distorted e.g. when this diversity is wiped out by more recent migrations.[2]

Limitations of the concept[edit]

The concept of a (single, identifiable) "homeland" of a given language family implies a purely genealogical view of the development of languages. This assumption is often reasonable and useful, but it is by no means a logical necessity, as languages are well known to be susceptible to areal change such as substrate or superstrate influence.

Time depth[edit]

Over a sufficient period of time, in the absence of evidence of intermediary steps in the process, it may be impossible to observe linkages between languages that have a shared Urheimat: given enough time, natural language change will obliterate any meaningful linguistic evidence of a common genetic source. This general concern is a manifestation of the larger issue of "time depth" in historical linguistics.[3]

For example, the languages of the New World are believed to be descended from a relatively "rapid" peopling of the Americas (relative to the duration of the Upper Paleolithic) within a few millennia (roughly between 20,000 and 15,000 years ago),[4] but their genetic relationship has become completely obscured over the more than ten millennia which have passed between their separation and their first written record in the early modern period. Similarly, the Australian Aboriginal languages are divided into some 28 families and isolates for which no genetic relationship can be shown.[5]

The Urheimaten reconstructed using the methods of comparative linguistics typically estimate separation times dating to the Neolithic or later. It is undisputed that fully developed languages were present throughout the Upper Paleolithic, and possibly into the deep Middle Paleolithic (see origin of language, behavioral modernity). These languages would have spread with the early human migrations of the first "peopling of the world", but they are no longer amenable to linguistic reconstruction. The Last Glacial Maximum (LGM) has imposed linguistic separation lasting several millennia on many Upper Paleolithic populations in Eurasia, as they were forced to retreat into "refugia" before the advancing ice sheets. After the end of the LGM, Mesolithic populations of the Holocene again became more mobile, and most of the prehistoric spread of the world's major linguistic families seem to reflect the expansion of population cores during the Mesolithic followed by the Neolithic Revolution.

The Nostratic theory is the best-known attempt to expand the deep prehistory of the main language families of Eurasia (excepting Sino-Tibetan and the languages of Southeast Asia) to the beginning of the Holocene. First proposed in the early 20th century, the Nostratic theory still receives serious consideration, but it is by no means generally accepted. The more recent and more speculative ""Borean" hypothesis attempts to unite Nostratic with Dené–Caucasian and Austric, in a "mega-phylum" that would unite most languages of Eurasia, with a time depth going back to the Last Glacial Maximum.

The argument surrounding the "Proto-Human language", finally, is almost completely detached from linguistic reconstruction, instead surrounding questions of phonology and the origin of speech. Time depths involved in the deep prehistory of all the world's extant languages are of the order of at least 100,000 years.[6]

Language contact and creolization[edit]

The concept of an Urheimat only applies to populations speaking a proto-language defined by the tree model. This is not always the case.

For example, in places where language families meet, the relationship between a group that speaks a language and the Urheimat for that language is complicated by "processes of migration, language shift and group absorption are documented by linguists and ethnographers" in groups that are themselves "transient and plastic." Thus, in the contact area in western Ethiopia between languages belonging to the Nilo-Saharan and Afroasiatic families, the Nilo-Saharan-speaking Nyangatom and the Afroasiatic-speaking Daasanach have been observed to be closely related to each other but genetically distinct from neighboring Afroasiatic-speaking populations. This is a reflection of the fact that the Daasanach, like the Nyangatom, originally spoke a Nilo-Saharan language, with the ancestral Daasanach later adopting an Afroasiatic language around the 19th century.[7]

Creole languages are hybrids of languages that are sometimes unrelated. Similarities arise from the creole formation process, rather than from genetic descent.[8] For example, a creole language may lack significant inflectional morphology, lack tone on monosyllabic words, or lack semantically opaque word formation, even if these features are found in all of the parent languages of the languages from which the creole was formed.[9]


Some languages are language isolates. That is to say, they have no well accepted language family connection, no nodes in a family tree, and therefore no known Urheimat. An example is the Basque language of Northern Spain and south west France. Nevertheless, it is a scientific fact that all languages evolve. An unknown Urheimat may still be hypothesized, such as that for a Proto-Basque, and may be supported by archaeological and historical evidence.

Sometimes relatives are found for a language originally believed to be an isolate. An example is the Etruscan language, which, even though only partially understood, is believed to be related to the Rhaetic language and to the Lemnian language. A single family may be an isolate. In the case of the non-Austronesian indigenous languages of Papua New Guinea and the indigenous languages of Australia, there is no published linguistic hypothesis supported by any evidence that these languages have links to any other families. Nevertheless, an unknown Urheimat is implied. The entire Indo-European family itself is a language isolate: no further connections are known. This lack of information does not prevent some professional linguists from formulating additional hypothetical nodes (Nostratic) and additional homelands for the speakers.

Homelands of major language families[edit]




North America[edit]

  • Eskimo–Aleut – the Eskimo–Aleut languages originated in the region of the Bering Strait or Southwest Alaska.[15]
  • Na-Dené and Yeniseian – the Dené–Yeniseian hypothesis proposes that the Na-Dené languages of North America and the Yeniseian languages of Central Siberia share a common ancestor. It was originally suggested that this hypothesis would point to a homeland in Central or West Asia.[16] More recent research indicates that it is more likely that both language families originated in Beringia, with Yeniseian representing a "back-migration" of Native American populations to Asia.[17]
  • Uto-Aztecan – some authorities on the history of the Uto-Aztecan language group place the Proto-Uto-Aztecan homeland in the border region between the USA and Mexico, namely the upland regions of Arizona and New Mexico and the adjacent areas of the Mexican states of Sonora and Chihuahua, shown on the map (below left) roughly corresponding to the Sonoran Desert. The proto-language would have been spoken by foragers, about 5,000 years ago. Hill (2001) proposes instead a homeland further south, making the assumed speakers of Proto-Uto-Aztecan maize cultivators in Mesoamerica, who were gradually pushed north, bringing maize cultivation with them, during the period of roughly 4,500 to 3,000 years ago, the geographic diffusion of speakers corresponding to the breakup of linguistic unity.[18]

South America[edit]


Modern Dravidian languages

The Dravidian languages have been found mainly in South India since at least the second century BCE,[20] but Dravidian speakers may have been more widespread throughout India, including the northwest region,[21] before the arrival of Indo-European speakers.

Kolipakam et al. (2018) estimate the Dravidian language family to be approximately 4,500 years old.[22] According to Krishnamurti, linguistic evidence suggests that the South Dravidian language group had separated from a Proto-Dravidian language around 1100 BCE.[23] Russian linguist M.S. Andronov puts the split between Tamil (a written Southern Dravidian language) and Telugu (a written Central Dravidian language) between 1500 BCE and 1000 BCE.[24]

Hypotheses regarding the original homeland have centered on the Indus Valley Civilization. According to Asko Parpola, the Indus sign system represented an ancient Dravidian language.[25] In the 1970s David McAlpin proposed the Elamo-Dravidian hypothesis, suggesting an origin in Elam, whose Elamite language was spoken in the hills to the east of the ancient Sumerian civilization with whom the Indus Valley Civilization traded and shared domesticated species. This theory is mostly rejected.


The countries and autonomous regions where a Turkic language has official status.

There is considerable dispute over the time and place of origin of the Turkic languages, with candidates for their ancient homeland ranging from the Transcaspian steppe to Manchuria in Northeast Asia and South-Central-Siberia.[26][27][28] The lack of written records prior to the earliest Chinese accounts, and the fact that the early Turkic peoples were nomadic pastoralists, and hence mobile, makes localizing and dating the earliest homeland of the Turkic language difficult. Attempts to localize the proto-Turkic Urheimat are usually connected with the early archaeological horizon of west and central Siberia and in the region south of it.[29]

The Turkic peoples lived in the Eurasian Steppe including North China, especially Xinjiang Province, Inner Mongolia, Mongolia and West Siberian Plain possibly as far west as Lake Baikal and the Altai Mountains, by the 6th century CE. After Turkic migration, by the 10th century CE, most of Central Asia, formerly dominated by Iranian peoples, was settled by Turkic tribes. Then, the Seljuk Turks from the 11th century invaded Anatolia, ultimately resulting in permanent Turkic settlement there and the establishment of the Turkish nation. The Turkic languages are now spoken in Turkey, Iran, Central Asia and Siberia.

The inferred population genetic contributions of Turkic populations show a cline from a high point in the East to the a low point in the West.[30] In Turkey, the Central Asian contribution to the local population genetic mix is about 9%[31]


Korea in 576 CE.

The Korean language is spoken in Korea and among emigrants from Korea. Conservative historical linguists tend to classify the Korean language as a language isolate, although other suggest a relationship to the proposed Altaic language family or to Japonic languages.

Old Korean is attested in Chinese histories, in the Three Kingdoms period of Korea (ca. 0 to 900 CE), when the Silla Kingdom (in Eastern Korea), Baekje Kingdom (in Southwestern Korea), and Goguryeo Kingdom (in Northern Korea) were simultaneously present on the Korean peninsula, although Korean was not a literary language until later; the hangul script of Korean was invented in the 15th century CE (the earlier Idu script dates to the 6th century CE).

There was a group of similar languages called the Koguryoic languages in the northern Korean Peninsula and southern Manchuria, which included, according to Chinese records, the languages of Buyeo, Goguryeo, Baekje, Dongye, Okjeo—and possibly Gojoseon, but was different from ancient Tungusic languages like Mohe. Gojoseon was a kingdom in Northern Korea that is said by tradition to have been founded in 2333 BC (archaeological evidence and Chinese histories support a cultural civilization from around 1500 BCE and a kingdom fused from a federation of smaller states around the 7th century BCE), that was conquered by Han Dynasty China in 108 BC, and re-emerged from Chinese rule as the Kingdom Buyeo. The Three Kingdoms era kingdoms of Goguryeo and Baekje were successors to the Kingdom of Buyeo. Dongye was a vassal state of Goguryeo in Northeast Korea founded in the 3rd-century BCE that was eventually absorbed by Goguryeo around the 5th century CE. Okjeo was a minor state in Northern Korea to the North of Dongye that was a subordinate unit of Gojoseon from the 3rd century BCE to 108 BCE, then came under Han rule, and then was a subordinate state of Goguryeo. None of these Buyeo language family kingdoms ever included the Kingdom of Silla, which was just a small kingdom on the Southern coast of Korea until the Three Kingdoms period during which it expanded and conquered the other two kingdoms.

Linguists including Christopher Beckwith argue for Japanese as a descendant of Goguryeo, and for Korean as a descendant of the Silla language, based on lexical similarities between Goguryeo and Japanese, and based upon Silla's ultimate triumph in the quest for political control of Korea. Other linguists, including Kim Banghan, Alexander Vovin, and J. Marshall Unger argue that Japanese is related to the pre-Goguryeo language of the central and southern part of Korean peninsula, including what would become the Kingdom of Silla, and that Old Korean is Goguryeo with a pre-Goguryeo Japonic substrate, in part, because Japanese-like toponyms found in the historical homeland of Silla were also distributed in southern part of Korean peninsula, and are not found in the northern part of Korean peninsula or south-western Manchuria.[32] None of the extinct languages is attested in writing well enough to reach definitive conclusions resolving the debate.


The Japonic languages are spoken in Japan and among emigrants from Japan and is attested in Japanese language writing from the 8th century CE, and in imperfect Chinese transcriptions from the late 5th century CE. Conservative historical linguists tend to classify a small number of Japanese languages as a language family of their own. The Ainu languages are a barely surviving family of languages or dialects that are spoken by indigenous populations on the island of Hokkaidō in what is now northern Japan.

There are similarities between the Japanese language and the Korean language in lexicon and grammatical features, but there is dispute over whether these denote a common origin, or mere linguistic borrowing due to a sprachbund of neighboring languages that are adjacent to each other. Samuel E. Martin, Roy Andrew Miller, and Sergei Starostin are linguists who have argued that they have common origins.[33][34][35][36][37] In contrast, Alexander Vovin has argued for a regional borrowing model to explain the linguistic similarities.[38]

One hypothesis proposes that Japanese is a relative of the extinct languages spoken by the Buyeo-Goguryeo cultures of Korea, southern Manchuria, and Liaodong of which the best attested is the extinct language Goguryeo.[39][40][41] This proposal is attributed to Shinmura Izuru, who proposed it in 1916. Modern Korean, in contrast, according to proponents of this hypothesis, appears to have stronger connections to the Silla language, spoken in the ancient kingdom of Silla (57 BCE – 935 CE), one of the Three Kingdoms of Korea, whose similarity to the Goguryeo language is not clearly established.

The earliest Chinese historical records concerning the "Wa" in Japan indicate that they were fractured into many warring states. But, modern Japanese dialects show a common origin, rather than a "bushy" one. So, it is possible that there were many Yayoi dialects in the period before Old Japanese emerged, of which the dialect of the warring states that ended up prevailing politically as the Japanese state was unified superseded other early Yayoi languages or dialects.[42]

After a new wave of immigration, probably from the Korean Peninsula some 2,300 years ago, of the Yayoi people, the Jōmon were pushed into northern Japan. Genetic data suggest that modern Japanese are descended from both the Yayoi and the Jōmon. Tradition, as documented by the Nihon Shoki, a legendary account of Japan's history, puts the date of the Yayoi arrival in Japan at 660 BCE. Chinese historical records mention the existence of the Yayoi (called "Wa") starting in 57 BCE. The existing Japanese language has its origins at approximately this point in time, if not earlier (to the extent that Japanese derives primarily from either the language of the Bronze Age Yayoi people, as it existed prior to their arrival in Japan, or derives primarily from a language of the Jōmon at that point in time, rather than being a creole of some sort). Skeletal remains suggests that the two cultures had fused into a group with a homogeneous physical appearance in Southern Japan by 250 CE.[42] It is possible that the Japanese language has roots related to the Ainu language, the historical language of the Yayoi, whatever that may have been, or could have been a creole of both. It is also possible the Japanese language has roots in a language spoken in Southern Japan that is lost and now unknown.[42]

Location of Ezo

The Ainu people are genetic descendants of the Jōmon, with some contribution from the Okhotsk people.[43] The Ainu languages that are now spoken by Ainu minorities in Hokkaidō; and were formerly spoken in southern and central Sakhalin, and the Kuril Islands (an area also known as Ezo), and perhaps northern Honshū island by the Emishi people (until approximately 1000 CE), are associated with the founding Jōmon people of Japan from than 14,000 years ago or earlier, and the Satsumon culture of Hokkaidō, although the Ainu also had contact with the Paleo-Siberian Okhotsk culture whose modern descendants include the Nivkh people (whose original homeland was mostly occupied by the Tungusic peoples), which could have linguistically influenced the Ainu language.[44] Thus, as a result of this important outside cultural influence, it is impossible to know with certainty how similar the language of the original language of the Jōmon people was to that spoken by the Ainu people today. Some linguists have suggested other language family connections for the Ainu language: Shafer has suggested a distant connection to the Austroasiatic languages.[45] Vovin, had viewed that suggestion as merely preliminary.[46] Japanese linguist Shichirō Murayama tried to link Ainu to the Austronesian languages, which include the languages of the Philippines, Taiwan, and Indonesia through both vocabulary and cultural comparisons. There is no consensus, however, that the Ainu languages have sources in any other known language, and the unique population genetics of the Ainu people support the hypothesis that they were largely isolated from the rest of the world for many thousands of years.

The Yayoi people had strong physical, genetic and cultural similarities to the Chinese during the Western Han Dynasty (202 BCE- 9 CE) in the Jiangsu province on China's Eastern Coast.[47] The Yayoi also have strong cultural similarities to the Koreans of that time period.[42][48]

Location of Ryukyu Islands

Some linguists, such as Turchin,[49] see a connection between Japanese and Korean and an Altaic language family or similar larger grouping of languages, with those speakers coming from an area North of Korea, based in part upon similarities in lexical roots. The statistical method used by Turchin, however, would not discriminate between Jōmon and Yayoi sources for any Altaic linguistic affinities. Turchin's analysis also did not look at the various proposed ancient predecessors of the Korean language in Korea or the relationship of those languages to any of the proto-Altaic languages, despite the fact that the hypothesis would require one of those ancient Korean peninsular languages to be intermediate between Japanese and one of the proto-Altaic languages. Old Japanese when first attested had eight vowels, rather than the current five (which were lost within a century of the oldest preserved writings) which was close to the vowel system seen in Uralic and Altaic languages.[50] Old Japanese also had more grammatical similarity to Altaic languages than modern Japanese.

These classifications of the origins of Japanese language origins ignore significant borrowing from other languages in recent times. Current estimates are that "wago" (i.e. words attributable to the original Yayoi language) make up 33.8% of the Japanese lexicon, that "kango" (i.e. words with roots borrowed from Chinese since the 5th century CE) make up 49.1% of Japanese words (and in addition, the Chinese ideograms used in the Japanese written language), that foreign words called gairaigo make up 8.8% of Japanese words, and that 8.3% of Japanese words are konshugo that draw upon multiple languages.[51] This account attributes only a small number of words in modern Japanese to Ainu roots.

The six Ryukyuan languages spoken in the islands to the South of Japan, are descended from Proto-Japonic but are not mutually intelligible with Japanese with which they share about 72% of their words (or each other) and started to diverge from Japanese around the 7th century CE. These islands were united in a Ryukyuan kingdom from 1429 CE (prior to that there were multiple divided kingdoms which were tributary states of China after 1372 CE); the kingdom was a tributary state of China until 1609 when it became a vassal state of Japan, until it was annexed by Japan in 1879. These languages were then suppressed and while they have about a million native speakers, there are relatively few native speakers under the age of twenty. They are effectively minority languages at this point due to the government's recognition of them as dialects.


Europe during the Neolithic period

The Uralic homeland is unknown. A possible focus is the Comb Ceramic Culture of ca 4200 – ca 2000 BCE (shown on the map to the right). This is suggested by the high language diversity around the middle Volga River, where three highly distinct branches of the Uralic family, Mordvinic, Mari, and Permic, are located. Reconstructed plant and animal names (including spruce, Siberian pine, Siberian Fir, Siberian larch, brittle willow, elm, and hedgehog) are consistent with this location. This is adjacent to the proposed homeland for Proto-Indo-European under the Kurgan hypothesis.

As noted below, many notable linguists have proposed that the Eskimo-Aleut languages and Uralic languages have a common origin, although there is no consensus that this connection is genuine. A genetic relationship between Uralic and the Indo-European languages has also been proposed (see Indo-Uralic languages).


The Afro-Asiatic languages include Arabic, Hebrew, Berber, and a variety of other languages now found mostly in Northeast Africa, although the exact boundaries of this language family are disputed in the case of a small number of languages spoken by small numbers of individuals in a few localized areas of Sudan and East Africa.

The limited area of the Afro-Asiatic Sprachraum (prior to its expansion to new areas in the historic era) has limited the potential areas where that family's Urheimat could be. Generally speaking, two proposals have been developed: that Afro-Asiatic arose in a Semitic Urheimat in the Middle East aka Southwest Asia, or that Afro-Asiatic languages arose in northeast Africa (generally, either between Darfur and Tibesti or in Ethiopia and the other countries of the Horn of Africa). The African hypothesis is considered to be rather more likely at the present time, because of the greater diversity of languages with more distant relationships to each other there.

There have been serious linguistic proponents of almost every conceivable possible set of relationships of the Afro-Asiatic language subfamilies to each other, although there is reasonably great consensus concerning the subfamily classification of all but a few of the Afro-Asiatic languages. Some of this difficulty in resolving the Afro-Asiatic family tree flows from the time depth of these languages. The Afro-Asiatic Egyptian language of ancient Egypt (whose latest stage is known as Coptic) is one of the two oldest written languages on Earth (the other being the Sumerian language, a language isolate) dating in written form to approximately 3000 BCE, and the Semitic Akkadian language was also attested in writing from a very early date (ca. 2000 BCE). A common Afro-Asiatic proto-language is necessarily older than these very old written languages which belonged to language families that had already diverged from each other considerably by that point. There is also no one genetic profile that is uniform among Afro-Asiatic language speakers that clearly unites them. There are also competing theories on whether the Afro-Asiatic language family owes its expansion to the Neolithic revolution that originated in an area that includes the range of the Afro-Asiatic language, or was already widespread in the Upper Paleolithic era.


There has been speculation regarding the specific Semitic subfamily of Afro-Asiatic languages, again with the Horn of Africa and Southwest Asia—specifically the Levant—being the most common proposals. The large number of Semitic languages present in the Horn of Africa seems at first glance to support the hypothesis that the Semitic homeland lies there. However, the Semitic languages in the Horn of Africa all belong to the South Semitic subfamily and appear to all have relatively recent common origins in a single Ethio-Semitic proto-language, while the East and Central Semitic languages are native solely to Asia. These features, and the presence of certain common Semitic lexical items in all Ethio-Semitic languages referring to items that arrived in Africa from the Levant at a time after Semitic languages were known to have been spoken in the Levant, have lent weight to the Levantine proposal.

Hebrew is relatively closely related to the Arabic language even within the Semitic language family, being part of the same Central Semitic group.

The Maltese language, the only other Semitic language of Europe, is a derivative of the Arabic language as it was spoken in Sicily starting sometime after the rise of the Islamic empire in North Africa.


The homeland of the Niger–Congo languages, which has as its subfamily the Benue–Congo languages, which in turn includes the Bantu languages, is not known in time or place, beyond the fact that it probably originated in or near the area where these languages were spoken prior to Bantu expansion (i.e. West Africa or Central Africa) and probably predated the Bantu expansion of ca. 3000 BCE through 500 CE by many thousands of years.[52][53] Its expansion may have been associated with the expansion of Sahel agriculture in the African Neolithic period.[52]

According to linguist Roger Blench, as of 2004, all specialists in Niger–Congo languages believe the languages to have a common origin, rather than merely constituting a typological classification, for reasons including their shared noun-class system, their shared verbal extensions and their shared basic lexicon.[54][55] Similar classifications have been made ever since Diedrich Westermann in 1922.[56] Joseph Greenberg continued that tradition making it the starting point for modern linguistic classification in Africa, with some of his most notable publications going to press starting in the 1960s.[57] But, there has been active debate for many decades over the appropriate subclassifications of the languages in that language family, which is a key tool used in localizing a language's place of origin.[54] No definitive "Proto-Niger–Congo" lexicon or grammar has been developed for the language family as a whole.

An important unresolved issue in determining the time and place where the Niger–Congo languages originated and their range prior to recorded history is this language family's relationship to the Kordofanian languages now spoken in the Nuba mountains of Sudan, which is not contiguous with the remainder of the Niger–Congo language speaking region and is at the northeasternmost extent of the current Niger–Congo linguistic region. The current prevailing linguistic view is that Kordofanian languages are part of the Niger–Congo language family, and that among the many languages still surviving in that region these may be the oldest.[58] The evidence is insufficient to determine if this outlier group of Niger–Congo language speakers represent a prehistoric range of a Niger–Congo linguistic region that has since contracted as other languages have intruded, or if instead, this represents a group of Niger–Congo language speakers who migrated to the area at some point in prehistory where they were an isolated linguistic community from the beginning.

The prehistoric range for the Niger–Congo languages has implications, not just for the history of the Niger–Congo languages, but for the origins of the Afro-Asiatic languages and Nilo-Saharan languages whose homelands have been hypothesized by some to overlap with the Niger–Congo linguistic range prior to recorded history. If the consensus view regarding the origins of the Nilo-Saharan languages which came to East Africa is adopted, and a North African or Southwest Asian origin for Afro-Asiatic languages is assumed, the linguistic affiliation of East Africa prior to the arrival of Nilo-Saharan and Afro-Asiatic languages is left open. The overlap between the potential areas of origin for these languages in East Africa is particularly notable because includes the regions from which the Proto-Eurasians who brought anatomically modern humans Out of Africa, and presumably their original proto-language or languages originated.

However, there is more agreement regarding the place of origin of the Benue–Congo subfamily of languages, which is the largest subfamily of the group, and the place of origin of the Bantu languages and the time at which it started to expand is known with great specificity.

The classification of the relatively divergent family of Ubangian languages which are centered in the Central African Republic, as part of the Niger–Congo language family where Greenberg classified them in 1963 and subsequently scholars concurred,[59] was called into question, by linguist Gerrit Dimmendaal in a 2008 article.[60]


The Benue-Congo homeland

Roger Blench, relying particularly on prior work by Professor Kay Williamson of the University of Port Harcourt, and the linguist P. De Wolf, who each took the same position, has argued that a Benue–Congo linguistic subfamily of the Niger–Congo language family, which includes the Bantu languages and other related languages and would be the largest branch of Niger–Congo, is an empirically supported grouping which probably originated at the confluence of the Benue and Niger Rivers in Central Nigeria.[54][61][62][63][64][65] These estimates of the place of origin of the Benue-Congo language family do not fix a date for the start of that expansion other than that it must have been sufficiently prior to the Bantu expansion to allow for the diversification of the languages within this language family that includes Bantu.

There is a widespread consensus among linguistic scholars that Bantu languages of the Niger–Congo family have a homeland near the coastal boundary of Nigeria and Cameroon, prior to a rapid expansion from that homeland starting about 3000 BCE.[66][52][67][68][69][70][71]


The Sino-Tibetan languages

The Sino-Tibetan Urheimat has been long debated with various scholars supporting an origin in North China, or in West China, or in the Himalayas. Population genetic evidence, favors an origin for Proto-Sino-Tibetan languages in the upper and middle Yellow River basin, with part of that source population branching off to settle in the Himalayas, with the split of the population that would provide the genesis of the Chinese language from the population that would provide the genesis of the larger Sino-Tibetan language family in the East Asian Neolithic era:[72]

"[T]he closest relatives of the Tibetans are the Yi people, who live in the Hengduan Mountains and were originally formed through fusion with natives along their migration routes into the mountains. The Tibetan and Yi languages belong to the Tibeto-Burman language group and their ancestries can be traced back to an ancient tribe, the Di-Qiang . . . After the ancestors of Sino-Tibetans reached the upper and middle Yellow River basin, they divided into two subgroups: Proto-Tibeto-Burman and Proto-Chinese. . . . The ancestral component which was dominant in Tibetan and Yi arose from the Proto-Tibeto-Burman subgroup, which marched on to south-west China and later, through one of its branches, became the ancestor of modern Tibetans. Proto-Tibeto-Burmans also spread over the Hengduan Mountains where the Yi have lived for hundreds of generations. Taking the optimal living condition and the easiest migration route into account, we favor the single-route hypothesis; it is more likely that their migration into the Tibetan Plateau through the Hengduan Mountain valleys occurred after Tibetan ancestors separated from the other Proto-Tibeto-Burman groups and diverged to form the modern Tibetan population."

According to the Sino-Tibetan Etymological Dictionary and Thesaurus project of the University of California at Berkeley (2011), the Proto-Sino-Tibetan (PST) homeland may have been in the general area in the east of the Tibetan Plateau. Regarding the time depth of Sino-Tibetan separation, they estimate an age of at least 6,000 years, comparable to the age of Proto-Indo-European.[73] Some scholars place the Tibeto-Burman homeland in the area encompassing western Sichuan, northern Yunnan and eastern Tibet.[74]

Additional studies also suggest the homeland of Sino-Tibetan in northern China near the Yellow River basin.[75][76] One of the earliest Neolithic cultures of China in the upper to middle Yellow River basin was the Peiligang culture of 7000 BCE to 5000 BCE, so the population genetic reference in the quoted material is to a date on or after this time period. The Neolithic era concluded in the Yellow River around 1500 BCE. This is not inconsistent with the linguistically based estimate from the Sino-Tibetan Etymological Dictionary and Thesaurus project. By the early and middle Zhou Dynasty (1122 BCE–256 BCE), the language spoken in the Zhou court had become the standardized dialect for that kingdom.[77]

In contrast, four of the other main language families of East Asia and Southeast Asia outside the Sino-Tibetan language family, Austroasiatic, Austronesian, Hmong–Mien and Kra-Dai, are generally believed to have at origins at some stage of their development in South China.


Austroasiatic languages

The homeland of the Austroasiatic languages (e.g. Vietnamese, Cambodian) which are found from Southeast Asia to India is hypothesized to be located in "the hills of southern Yunnan in China," between 4000 BCE and 2000 BCE,[78] with influences from Aryan and Dravidian languages at the Western edge of its expanse in India, and influence from Chinese at the Eastern edge of the regions where it is found. The disjoint distribution of Austroasiatic languages suggests that they were once spoken in most of the areas where the Kra–Dai languages (e.g. Thai, Lao) are now dominant.

However, Paul Sidwell has recently advocated a homeland in Southeast Asia instead,[79] preferring a late date of dispersal of about 2000 BCE.[80]

There is a strong correlation between the population genetic distribution Y-Chromosomal haplogroup O2a1-M95 and the distribution of Austroasiatic language speakers.[81]

See also[edit]


  1. ^ a b Blust, Robert (1984). "The Austronesian Homeland: A Linguistic Perspective". Asian Perspectives. 26 (1): 45–67.
  2. ^ a b c Campbell, Lyle (2013). Historical Linguistics: An Introduction (3rd ed.). Edinburgh University Press. pp. 423ff.
  3. ^ Renfrew, Colin; McMahon, April; Trask, Larry, eds. (1999). Time Depth in Historical Linguistics. ISBN 978-1-902937-06-9.
  4. ^ O'Rourke, Dennis H.; Raff, Jennifer A. (2010), "The Human Genetic History of the Americas: The Final Frontier", Current Biology, 20 (4): R202–7, doi:10.1016/j.cub.2009.11.051, PMID 20178768, S2CID 14479088
  5. ^ Bowern, Claire; Atkinson, Quentin (2012). "Computational Phylogenetics and the Internal Structure of Pama-Nyungan". Language. 84 (4): 817–845. Kayser, Manfred (2010), "The Human Genetic History of Oceania: Near and Remote Views of Dispersal", Current Biology, 20 (4): R194–201, doi:10.1016/j.cub.2009.12.004, PMID 20178767, S2CID 7282462
  6. ^ Bengtson and Ruhlen (1994) offered a list of 27 "global etymologies". Bengtson, John D. and Merritt Ruhlen. 1994. "Global etymologies" Archived 2007-09-28 at the Wayback Machine. In Ruhlen 1994a, pp. 277–336. This approach has been criticized as flawed by Campbell and Poser (2008) who used the same criteria employed by Bengtson and Ruhlen to identify "cognates" in Spanish known to be false. Campbell, Lyle, and William J. Poser. 2008. Language Classification: History and Method. Cambridge: Cambridge University Press, 370–372.
  7. ^ Poloni, ES; Naciri, Y; Bucho, R; Niba, R; Kervaire, B; Excoffier, L; Langaney, A; Sanchez-Mazas, A. (Nov 2009). "Genetic evidence for complexity in ethnic differentiation and history in East Africa". Ann Hum Genet. 73 (6): 582–600. doi:10.1111/j.1469-1809.2009.00541.x. PMID 19706029. S2CID 2488794.
  8. ^ McWhorter, J. H. (1998), "Identifying the Creole Prototype: Vindicating a Typological Class", Language, 74 (4): 788–818, doi:10.2307/417003, JSTOR 417003
  9. ^ McWhorter, John H. (1999), "The Afrogenesis Hypothesis of Plantation Creole Origin", in Huber, Magnus; Parkvall, Mikael (eds.), Spreading the Word: The Issue of Diffusion among the Atlantic Creoles, London: Westminster University Press, pp. 111–152
  10. ^ Dimmendaal, Gerrit J. (2020). "Nilo-Saharan and Its Limits". In Rainer Vossen; Gerrit J. Dimmendaal (eds.). The Oxford Handbook of African Languages. Oxford: Oxford University Press. pp. 364–382. doi:10.1093/oxfordhb/9780199609895.013.15.
  11. ^ Anthony, David W.; Ringe, Don (1 January 2015). "The Indo-European Homeland from Linguistic and Archaeological Perspectives". Annual Review of Linguistics. 1 (1): 199–219. doi:10.1146/annurev-linguist-030514-124812. ISSN 2333-9683.
  12. ^ Roger Blench, "Stratification in the peopling of China: how far does the linguistic evidence match genetics and archaeology?," Paper for the Symposium "Human migrations in continental East Asia and Taiwan: genetic, linguistic and archaeological evidence". Geneva June 10–13, 2004. Université de Genève.
  13. ^ Ostapirat, Weera. (2005). "Kra–Dai and Austronesian: Notes on phonological correspondences and vocabulary distribution", pp. 107–131 in Sagart, Laurent, Blench, Roger & Sanchez-Mazas, Alicia (eds.), The Peopling of East Asia: Putting Together Archaeology, Linguistics and Genetics. London/New York: Routledge-Curzon.
  14. ^ Blust, Robert (2013). The Austronesian Languages (revised ed.). Australian National University. p. 756. hdl:1885/10191. ISBN 978-1-922185-07-5.
  15. ^ Holton, Gary. "Language Relationships". Alaska Native Language Center. Retrieved 19 November 2020.
  16. ^ Ruhlen, Merritt (10 November 1998). "The origin of the Na-Dene". Proceedings of the National Academy of Sciences. 95 (23): 13994–13996. doi:10.1073/pnas.95.23.13994. ISSN 0027-8424.
  17. ^ Sicoli, Mark A.; Holton, Gary (12 March 2014). "Linguistic Phylogenies Support Back-Migration from Beringia to Asia". PLOS ONE. 9 (3): e91722. doi:10.1371/journal.pone.0091722. ISSN 1932-6203.
  18. ^ Jane H. Hill, "Proto-Uto-Aztecan", American Anthropologist, 2001. JSTOR 684121.
  19. ^ Rodrigues, Aryon Dall'Igna, and Ana Suelly Arruda Câmara Cabral (2012). "Tupían". In Campbell, Lyle, and Verónica Grondona (eds). The indigenous languages of South America: a comprehensive guide. Berlin: De Gruyter Mouton.
  20. ^ Inscriptions, ed. I. Mahadevan 2003
  21. ^ "Dravidian languages." Encyclopædia Britannica. 2008. Encyclopædia Britannica Online. 5 June 2008
  22. ^ "Dravidian language family is approximately 4,500 years old, new linguistic analysis finds". ScienceDaily. Retrieved 2018-05-17.
  23. ^ Krishnamurti, Bhadriraju (2003). The Dravidian Languages. Cambridge University Press. ISBN 978-0-521-77111-5. Lay summaryFrontline (Chennai) 20 (22) (October 25, 2003).
  24. ^ Moorti, Etukoori Balaraama in Andhra Samkshipta Charitra. "Proto-Dravidian Study of Dravidian Linguistics and Civilization".
  25. ^ Parpola, Asko. "Introduction to Study of the Indus Script". Archived from the original on 2010-04-20. Retrieved 2010-03-01.
  26. ^ Yunusbayev, B. (2014). "The Genetic Legacy of the Expansion of Turkic-Speaking Nomads Across Eurasia". PLOS Genetics. 11 (4): e1005068. bioRxiv 10.1101/005850. doi:10.1371/journal.pgen.1005068. PMC 4405460. PMID 25898006. S2CID 196602588.
  27. ^ Römer, Claudia. Von den Hunnen zu den Türken – dunkle Vorgeschichte, in: Zentralasien. 13. bis 20. Jahrhundert. Geschichte und Gesellschaft, Wien 2006, p. 61
  28. ^ Róna-Tas, András. "The Reconstruction of Proto-Turkic and the Genetic Question." In: The Turkic Languages, pp. 67–80. 1998.
  29. ^ Róna-Tas, András. "The Reconstruction of Proto-Turkic and the Genetic Question." In: The Turkic Languages, pp. 67–80. 1998.
  30. ^ Martínez-Cruz, Begoña; Vitalis, Renaud; Ségurel1, Laure; et al. (8 September 2010). "In the heartland of Eurasia: the multilocus genetic landscape of Central Asian populations". European Journal of Human Genetics. 19 (2): 216–23. doi:10.1038/ejhg.2010.153. PMC 3025785. PMID 20823912.
  31. ^
  32. ^ Blažek, Václav. 2006. "Current progress in Altaic etymology." Linguistica Online, 30 January 2006
  33. ^ Martin, Samuel E. (1966): Lexical Evidence Relating Japanese to Korean. Language 42/2: 185–251.
  34. ^ Martin, Samuel E. (1990): Morphological clues to the relationship of Japanese and Korean. In: Philip Baldi (ed.): Linguistic Change and Reconstruction Methodology. Trends in Linguistics: Studies and Monographs 45: 483–509.
  35. ^ Miller, Roy Andrew (1971): Japanese and the Other Altaic Languages. Chicago: University of Chicago Press. ISBN 0-226-52719-0.
  36. ^ Miller, Roy Andrew (1996): Languages and History: Japanese, Korean and Altaic. Oslo: Institute for Comparative Research in Human Culture. ISBN 974-8299-69-4.
  37. ^ Sergei Starostin. Altaiskaya problema i proishozhdeniye yaponskogo yazika (The Altaic Problem and the Origins of the Japanese Language).
  38. ^ Vovin, Alexander: Koreo-Japonica. University of Hawai'i Press. 2008.
  39. ^ Beckwith, Christopher I. 2004. Koguryo: The Language of Japan's Continental Relatives: An Introduction to the Historical-Comparative Study of the Japanese-Koguryoic Languages. Leiden: Brill.
  40. ^ Beckwith, Christopher I. 2006. "Methodological observations on some recent studies of the early ethnolinguistic history of Korea and vicinity." Altai Hakpo 16, 199–234.
  41. ^ Beckwith, Christopher I. 2006b. "The ethnolinguistic history of the early Korean peninsula region: Japanese-Koguryoic and other languages in the Koguryo, Paekche, and Silla kingdoms." (page 33 ff.) Journal of Inner and East Asian Studies 2.2, 34–64.
  42. ^ a b c d Diamond, Jared (June 1998). "Japanese Roots". Discover. 19 (6).
  43. ^ Nakahori, Yutaka (2005). Y染色体からみた日本人 (Y Senshokutai kara Mita Nihonjin). Iwanami Science Library. ISBN 978-4-00-007450-6.
  44. ^ Chaussonnet, Valerie (1995) Native Cultures of Alaska and Siberia. Page 35. Arctic Studies Center. Washington, D.C. 112p. ISBN 1-56098-661-1
  45. ^ Shafer, R. (1965). "Studies in Austroasian II". Studia Orientalia 30 (5).
  46. ^ Vovin, Alexander (1993). A Reconstruction of Proto-Ainu. Leiden: Brill. ISBN 90-04-09905-0.
  47. ^ 渡来系弥生人の故郷を中国に求めて [Searching in China for the origin of the Yayoi people]. Long Journey to Prehistorical Japan (in Japanese). National Museum of Nature and Science (Japan). Archived from the original on 2015-04-21. Retrieved 2013-09-19.
  48. ^ Mark J. Hudson (1999). Ruins of Identity Ethnogenesis in the Japanese Islands. University Hawai'i Press. ISBN 0-8248-2156-4.
  49. ^ Turchin, Peter; Peiros, Ilia; Gell-Mann, Murray. "Analyzing Genetic Connections between Languages by Matching Consonant Classes" (PDF). Archived from the original (PDF) on 2011-07-21. Retrieved 2010-05-24.
  50. ^ 大野 晋 (1982)『仮名遣いと上代語』(岩波書店)p.65
  51. ^ 新選国語辞典, 金田一京助, 小学館, 2001, ISBN 4-09-501407-5
  52. ^ a b c Jared Diamond, "Guns, Germs and Steel" (2000)
  53. ^ Bostoen, Koen (2018-04-26). "The Bantu Expansion". Oxford Research Encyclopedia of African History. doi:10.1093/acrefore/9780190277734.013.191. ISBN 9780190277734.
  54. ^ a b c Blench, Roger (2004). THE BENUE-CONGO LANGUAGES: A PROPOSED INTERNAL CLASSIFICATION (Unpublished Working Draft) (PDF).
  55. ^ See also Bendor-Samuel, J. ed. 1989. The Niger–Congo Languages. Lanham: University Press of America.
  56. ^ Westermann, D. 1922a. Die Sprache der Guang. Berlin: Dietrich Reimer.
  57. ^ Greenberg, J.H. 1964. Historical inferences from linguistic research in sub-Saharan Africa. Boston University Papers in African History, 1:1–15.
  58. ^ Herman Bell. 1995. The Nuba Mountains: Who Spoke What in 1976?. (The published results from a major project of the Institute of African and Asian Studies: the Language Survey of the Nuba Mountains.)
  59. ^ Williamson, Kay & Blench, Roger (2000) 'Niger–Congo', in Heine, Bernd & Nurse, Derek (eds.) African languages: an introduction, Cambridge: Cambridge University Press.
  60. ^ Gerrit Dimmendaal (2008) "Language Ecology and Linguistic Diversity on the African Continent", Language and Linguistics Compass 2/5:841.
  61. ^ Williamson, K. 1971. The Benue–Congo languages and Ijo. Current Trends in Linguistics, 7. ed. T. Sebeok 245–306. The Hague: Mouton.
  62. ^ Williamson, K. 1988. Linguistic evidence for the prehistory of the Niger Delta. The early history of the Niger Delta, edited by E.J. Alagoa, F.N. Anozie and N. Nzewunwa. Hamburg: Helmut Buske Verlag.
  63. ^ Williamson, K. 1989. Benue–Congo Overview. In The Niger–Congo Languages. J. Bendor-Samuel ed. Lanham: University Press of America.
  64. ^ De Wolf, P. 1971. The noun class system of Proto-Benue–Congo. The Hague: Mouton.
  65. ^ Blench, R.M. 1989. A proposed new classification of Benue–Congo languages. Afrikanische Arbeitspapiere, Köln, 17:115–147.
  66. ^ Michael C. Campbell and Sarah A. Tishkoff, "The Evolution of Human Genetic and Phenotypic Variation in Africa," Current Biology, Volume 20, Issue 4, R166–R173, 23 February 2010
  67. ^ Greenberg, J.H. 1972. Linguistic evidence regarding Bantu origins. Journal of African History, 13.
  68. ^ Vansina, J. (1995), "New Linguistic Evidence and the 'Bantu Expansion'", Journal of African History, 36 (2): 173–195, doi:10.1017/S0021853700034101, JSTOR 182309.
  69. ^ Flight, C. (1980). "Malcolm Guthrie and the reconstruction of Bantu prehistory". History in Africa. 7: 81–118. doi:10.2307/3171657. JSTOR 3171657.
  70. ^ Flight, C. (1988). "The Bantu expansion and the SOAS network". History in Africa. 15: 261–301. doi:10.2307/3171863. JSTOR 3171863.
  71. ^ Bastin, Y. (1994). "Reconstruction formelle et sémantique de la dénomination de quelques mammiferes en Bantou". Afrikanische Arbeitspapiere. 38: 5–132.
  72. ^ Wang, B; Zhang, Y-B; Zhang, F; et al. (2011). "On the Origin of Tibetans and Their Genetic Basis in Adapting High-Altitude Environments". PLOS ONE. 6 (2): e17002. Bibcode:2011PLoSO...617002W. doi:10.1371/journal.pone.0017002. PMC 3046130. PMID 21386899.
  73. ^ "where the great rivers of East and Southeast Asia (including the Yellow, Yangtze, Mekong, Brahmaputra, Salween, and Irrawaddy) have their source. The time of hypothetical ST unity, when the Proto-Han (= Proto-Chinese) and Proto-Tibeto-Burman (PTB) peoples formed a relatively undifferentiated linguistic community, must have been at least as remote as the Proto-Indo-European period, perhaps around 4000 B.C." "The Sino-Tibetan Language Family". Sino-Tibetan Etymological Dictionary and Thesaurus. University of California at Berkeley. June 29, 2011.
  74. ^ George van Driem, "Language change, conjugational morphology and the Sino-Tibetan Urheimat,"(1993)
  75. ^ Zhang, M.; Yan, S.; Pan, W.; Jin, L. (2019). "Phylogenetic evidence for Sino-Tibetan origin in northern China in the Late Neolithic". Nature. 569 (7754): 112–115. doi:10.1038/s41586-019-1153-z. PMID 31019300. S2CID 129946000.
  76. ^ Sagart, Laurent; Jacques, Guillaume; Lai, Yunfan; Ryder, Robin; Thouzeau, Valentin; Greenhill, Simon J.; List, Johann-Mattis (2019). "Dated language phylogenies shed light on the ancestry of Sino-Tibetan". Proceedings of the National Academy of Sciences of the United States of America. 116 (21): 10317–10322. doi:10.1073/pnas.1817972116. PMC 6534992. PMID 31061123.
  77. ^ Schirokauer & Brown 2006. "A Brief history of Chinese civilization: second edition" Wadsworth, a division of Thomson Learning, pp. 25–47
  78. ^ Sidwell, Pascale. "Austroasiatic Languages". BookRags. Archived from the original on 2012-01-21.
  79. ^ Sidwell, Paul (2010). "Seminar: "A SEAsian homeland for the Austroasiatic Languages"". Retrieved 2 August 2011.
  80. ^ Sidwell, Paul (2009). "Family Diversity and the Austroasiatic Homeland" (PDF). Retrieved 2 August 2011.
  81. ^ Kumar, Vikrant et al., Y-chromosome evidence suggests a common paternal heritage of Austroasiatic populations, BMC Evol Biol. 2007, 7: 47.