This article summarizes the genetic makeup and population history of East Asian peoples and their connection to genetically related populations such as Southeast Asians and North Asians, as well as Oceanians, and partly, Central Asians, South Asians, and Native Americans. They are collectively referred to as "East Eurasians" in population genomics. [1]
Population genomic research has studied the origin and formation of modern East Asians. The ancestors of East Asians (Ancient East Eurasians) split from other human populations (Ancient West Eurasians) possibly as early as 70,000 to 50,000 years ago. Possible routes into East Asia include a northern route from Central Asia, beginning north of the Himalayas, and a southern route, beginning south of the Himalayas and moving through South and Southeast Asia. [3] [4] [1] Vallini et al. 2024 noted that the divergence between Ancient East Eurasians and West Eurasians most likely occurred on the Persian Plateau >48,000 years ago. [5]
Phylogenetic data suggests that an early Initial Upper Paleolithic wave (>45kya) "ascribed to a population movement with uniform genetic features and material culture" (Ancient East Eurasians) dispersed throughout Eurasia, with one of its main branches (dubbed as East Eurasian Core; EEC) using a Southern dispersal route into Southern Asia, where they subsequently diverged rapidly into the ancestors of Australasians (Oceanians), the Ancient Ancestral South Indians (AASI), as well as Andamanese and East/Southeast Asians (ESEA), [6] [4] [1] [7] although Papuans may have also received some geneflow from an earlier group (xOoA), [8] around 2%, [9] next to additional archaic admixture in the Sahul region.
Deeply diverged Ancient East Eurasian lineages associated with the spread of IUP-affilated material culture along an early 'northern route' into eastern Europe (Bacho Kiro Cave and Peștera cu Oase), Central Asia (Ust'-Ishim man), and Siberia (Kara-Bom etc.), as well as Northwest China, did contribute only little or no ancestry to modern East Asians, which instead were found to descend primarily from the southern route wave. [11] [12] [13] [14] [7] [15] [1] [4] [5]
The single southern route dispersal into the South-Southeast Asia region gave rise to the AASI, Andamanese, Eastern Asian and Australasian populations. The southern route origin is strongly supported by archaeogenetic data and genetic affinities between these "East Eurasian Core" (EEC) populations. Although the morphological more distinct traits of modern Northeast Asian and Siberian populations raised the question of a possible distinct northern origin, those traits have subsequently been associated with adaptions to the extremely cold climate during the Last Glacial Maximum (LGM) of 24– 16 kya by an ancient people with initially southern affinities, which moved into Northeast Asia and Siberia from Southeast Asia. This is further supported by higher morphological affinities of ancient East Asian specimens, such as the Tianyuan man, the Zhoukoudian Upper Cave remains, the Liujiang man, the Red Deer Cave people, the Jōmon people as well as the Liangdao and Qihe Cave remains to "southern" populations and ancient remains, such as the Niah cave and Wajak remains or Hoabinhians, as well as modern Andamanese, Vedda, and Aboriginal Australians, but being genetically closer or basal to the derived Northeast Asian and Siberian groups. Additionally, genetic diversity within present-day Asian populations, follows a strong correlation with latitude: genetic diversity is decreasing from south to north. The correlation continues to hold true when only mainland Southeast Asian and East Asian populations are considered. This may be attributable to a serial founder effect, and is consistent with a single eastward migration of modern humans along a southern route into South and Southeast Asia. Subsequently, ancestral East Asians diversified in southern East Asia and subsequently dispersed northward across the continent. [7] [1] [16] [17] [18] [19] [20] The dominant paternal haplogroups of East Asians associated with the "southern route" belong to subclades of C-M217, D-M174, O-M175, and N-M231, among some others. [21] The exact patterns of the northwards dispersal of ancestral East Asians from Southeast Asia remains unclear. While there may have been both an "interior route" and a "coastal route", correlating in part with the distribution and frequency of paternal haplogroup subclades, alternative scenarios have been proposed as well. [22] [1]
There is also genetic evidence for migrations to Northern Asia and Siberia from a later Upper Paleolithic West Eurasian source. While the southern migration wave likely diversified after settling within East Asia, the wave associated with Upper Paleolithic Europeans mixed with the southern wave somewhere in Siberia. This encounter and contact event resulted in the formation of the Ancient North Eurasian (ANE) lineage. The ANE derive around 2/3 (50–71%) of their ancestry from a West Eurasian-like source best represented by Upper Paleolithic Europeans, and around 1/3 (29–50%) from an East Eurasian source best represented by the Tianyuan man or Upper Paleolithic East/SoutheastAsians. [4] [23] [24] [25] The legacy of this Paleolithic admixture event is also evident by the later dispersal of the haplogroups Q and R, as well as ANE-like ancestry throughout Northern Eurasia, with only limited influence on modern East Asian groups. [21] [26] [16] [27] [4]
Although Huang et al. (2021) found evidence for light skin being selected among the ancestral populations of West Eurasians and East Eurasians, prior to their divergence, [28] the main period for the selection of light skin alleles (such as rs1800414-G) started around 25,000 to 30,000 years ago in East Asia, after the northwards expansion from Southern Asia. The affiliated alleles are distinct from those observed among European/Middle Eastern populations. [29]
After the peopling of the South and Southeast Asia region by the East Eurasian Core (EEC), this ancestral population diverged rapidly into at least three "deeply branching East Asian lineages, namely the "Ancient Ancestral South Indians" (AASI) staying in Southern Asia, Australasians (AA) heading into Oceania, and the East- and Southeast Asian (ESEA) branch staying in Southeast Asia and subsequently expanding northwards. The ESEA branch would become broadly ancestral to modern East Asians, but also Southeast Asians, Polynesians, and Siberians, as well as Native Americans. The ESEA source population is estimated to have expanded outgoing from Mainland Southeast Asia at c. 40,000 BCE. The deepest split among the ESEA lineage gave rise to the "basal Asian" Hoabinhian hunter-gatherers of Southeast Asia and the c. 39,000-year-old Tianyuan lineage in Northern China and the Amur Region, followed by the divergence of the Longlin lineage in Guangxi and the Jōmon lineages on the Japanese archipelago, and finally the split between Ancient Southern East Asians (ASEA) and Ancient Northern East Asians (ANEA) somewhere in East Asia. The divergence between ASEA and ANEA happened sometimes between 26,000 to 20,000 years ago. [6] ANEA ancestry replaced the ealier Tianyuan-like ancestry in Northern China, the Amur region and parts of Siberia since at least 19,000 years ago, while Longlin/Guangxi ancestry was largely replaced by ASEA ancestry since at least 9,000 years ago. [30] [1]
There are currently eight identified closely related sub-ancestries on the ESEA branch: [16] [1]
The genetic makeup of modern East Asians is primarily characterized by "Yellow River Neolithic" ancestry which formed from a major Ancient Northern East Asian (ANEA) component and a minor Ancient Southern East Asian (ASEA) one. [32] [6] Contemporary East Asians (notably Sino-Tibetan speakers) mostly have Yellow River Neolithic ancestry, which is associated with millet cultivation, and variable amounts of additional ASEA-like admixture associated with rice cultivation ("Sinitic cline"). "East Asian Highlanders" (Tibetans) carry both Ancient Tibetan ancestry and Yellow River Neolithic ancestry. Japanese people were found to have a tripartite origin; consisting of Jōmon ancestry, Amur ancestry, and Yellow River Neolithic ancestry. Koreans derive most of their ancestry from a combination of Yellow River Neolithic and Amur ancestries. [33] [34] [35]
Northeast Asians such as Tungusic, Mongolic, and Turkic peoples derive most of their ancestry from the "Amur" (Ancient Northeast Asian) subgroup of the Ancient Northern East Asians, which expanded massively with millet cultivation and pastoralism. Tungusic peoples display the highest genetic affinity to Ancient Northeast Asians, represented by c. 7,000 and 13,000 year old specimens, whereas Turkic-speaking peoples have variable but significant amounts of West Eurasian admixture. [36] [30]
An early branch of Ancient Northern East Asians absorbed an Ancient North Eurasian population to their North, giving rise to the Ancient Paleo-Siberians, who in turn became ancestral to both "modern Paleo-Siberians" (such as Chukotko-Kamchatkan, Yeniseian, and Nivkh speakers) and contemporary Native Americans. Paleo-Siberian (APS) ancestry was once widespread across Northern Asia, but largely replaced by later waves of Neo-Siberian ancestry due to a major population turnover from the south, possibly involving Uralic and Yukaghir speakers. This was later followed by another expansion from the south in relatively recent times, associated with Amur ancestry (ANA) involving Tungusic, Mongolic, and Turkic speakers. [37] [38]
Austronesians and Kra-Dai speakers in Southeast Asia mainly carry "Fujian Neolithic" ancestry, a subgroup of Ancient Southern East Asians, which is associated with the spread of rice cultivation. Isolated hunter-gatherers in Southeast Asia, specifically in Malaysia and Thailand, such as the Semang, derive most of their ancestry from the Hoabinhian lineage. [39] [40] [41] The emergence of the Neolithic in Southeast Asia went along with a population shift caused by migrations from Southern China. Neolithic Mainland Southeast Asian samples predominantly have Ancient Southern East Asian ancestry (a sister lineage of "Fujian Neolithic" ancestry) with variable amounts of Hoabinhian-related admixture. In modern populations, this admixture of Ancient Southern East Asian and Hoabinhian ancestry is most strongly associated with Austroasiatic speakers. [42]
The Xiongnu, possibly a Turkic, Mongolic, Yeniseian or multi-ethnic group, was a confederation [43] of nomadic peoples who, according to ancient Chinese sources, inhabited the eastern Eurasian Steppe from the 3rd century BC to the late 1st century AD. Chinese sources report that Modu Chanyu, the supreme leader after 209 BC, founded the Xiongnu Empire. [44]
It was found that the "predominant part of the Xiongnu population is likely to have spoken Turkic". However, important cultural, technological and political elements may have been transmitted by Eastern Iranian-speaking Steppe nomads: "Arguably, these Iranian-speaking groups were assimilated over time by the predominant Turkic-speaking part of the Xiongnu population". [45] This is reflected by the average genetic makeup of Xiongnu samples, having approximately 58% East Eurasian ancestry, represented by a Bronze Age population from Khövsgöl, Mongolia, which may be associated with the Turkic linguistic heritage. The rest of the Xiongnu's ancestry (~40%) was related to West Eurasians, represented by the Gonur Depe BMAC population of Central Asia, and the Sintashta culture of the Western steppe. [45] [46] The Xiongnu displayed striking heterogeneity and could be differentiated into two subgroups, "Western Xiongnu" and "Eastern Xiongnu", with the former being of "hybrid" origins displaying affinity to previous Saka tribes, such as represented by the Chandman culture, while the later was of primarily Ancient Northeast Asian (Ulaanzuukh-Slab Grave) origin. [47] [45] High status Xiongnu individuals tended to have less genetic diversity, and their ancestry was essentially derived from the Eastern Eurasian Ulaanzuukh/Slab Grave culture. [48]
A review of the available research has shown that, as a whole, 53% of Xiongnu paternal haplogroups were East Eurasian, while 47% were West Eurasian. [49] In 2012, Chinese researchers published an analysis of the paternal haplogroups of 12 elite Xiongnu male specimens from Heigouliang in Xinjiang, China. Six of the specimens belonged to Q1a, while four belonged to Q1b-M378. 2 belonged to unidentified clades of Q*. [50] In another study, a probable Chanyu of the Xiongnu empire was assigned to haplogroup R1. [51] [52]
The bulk of the genetics research indicates that, as a whole, 73% of Xiongnu maternal haplogroups were East Eurasian, while 27% were West Eurasian. [53] A 2003 study found that 89% of Xiongnu maternal lineages from the Egiin Gol valley were of East Asian origin, while 11% were of West Eurasian origin. [54] A 2016 study of Xiongnu from central Mongolia found a considerably higher frequency of West Eurasian maternal lineages, at 37.5%. [55]
A full genomic analysis performed on multiple Xianbei remains found the population to have derived primarily from the Ancient Northeast Asian gene pool. [56]
A genetic study published in the American Journal of Physical Anthropology in August 2018 noted that the paternal haplogroup C2b1a1b has been detected among the Xianbei and the Rouran, and was probably an important lineage among the Donghu people. [57]
Genetic studies published in 2006 and 2015 revealed that the mitochondrial haplogroups of Xianbei remains were of East Asian origin. According to Zhou (2006) the maternal haplogroup frequencies of the Tuoba Xianbei were 43.75% haplogroup D, 31.25% haplogroup C, 12.5% haplogroup B, 6.25% haplogroup A and 6.25% "other". [58] Zhou (2014) obtained mitochondrial DNA analysis from 17 Tuoba Xianbei, which indicated that these specimens were, similarly, completely East Asian in their maternal origins, belonging to haplogroups D, C, B, A, O and haplogroup G. [59] [60]
The Jōmon people represent the indigenous population of the Japanese archipelago during the Jōmon period. They are inferred to descend from the Paleolithic inhabitants of Japan. Genetic analyses on Jōmon remains found them to represent a deeply diverged East Asian lineage. The Jōmon lineage is inferred to have diverged from Ancient East Asians before the divergence between Ancient Northern East Asians and Ancient Southern East Asians, but after the divergence of the basal Tianyuan man and or Hoabinhians. Beyond their broad affinity with Eastern Asian lineages, the Jōmon also display a weak affinity for Ancient North Eurasians (ANE), which may be associated with the introduction of microblade technology to Northeast Asia and Northern East Asia during the Last Glacial Maximum via the ANE or Ancient Paleo-Siberians. [61] [62]
The Hoabinhians represent a technologically advanced society of hunter-gatherers, primarily living in Mainland Southeast Asia, but also adjacent regions of Southern China. While the Upper Paleolithic origins of this 'Hoabinhian ancestry' are unknown, Hoabinhian ancestry has been found to be related to the main 'East Asian' ancestry component found in most modern East and Southeast Asians, although deeply diverged from it. [63] [64] Together with the Paleolithic Tianyuan man, they form early branches of East Asian genetic diversity, and are described as "Basal Asian" (BA) or "Basal East Asian" (BEA). [65]
A study on the Manchu population of Liaoning reported that they have a close genetic relationship and significant admixture signals from Northern Han Chinese. The Liaoning Manchu were formed from a major ancestral component related to Yellow River farmers and a minor ancestral component linked to ancient populations from the Amur River Basin, or others. The Manchu were therefore an exception to the coherent genetic structure of Tungusic-speaking populations, likely due to the large-scale population migrations and genetic admixtures in the past centuries. [67]
A plurality of Daur males belong to Haplogroup C-M217 (12/39 = 30.8% according to Xue Yali et al. 2006, [68] 88/207 = 42.5% according to Wang Chi-zao et al. 2018 [69] ), with Haplogroup O-M122 being the second most common haplogroup among present-day Daurs (10/39 = 25.6%, [68] 52/207 = 25.1% [69] ). There are also tribes (hala; cf. Kazakh tribes) among the Daurs that belong predominantly to other Y-DNA haplogroups, such as Haplogroup N-M46/M178 (Merden hala) and Haplogroup O1b1a1a-M95 (Gobulo hala). [69] Haplogroup C3b2b1*-M401(xF5483) [70] [71] [72] has been identified as a possible marker of the Aisin Gioro and is found in ten different ethnic minorities in Northern China, but is less prevalent from Han Chinese. [73] [74] [72] The Manchu people also display a significant amount of haplogroup C-M217, but the most often observed Y-DNA haplogroup among present-day Manchus is Haplogroup O-M122, which they share in common with the general population of China. [75] [68] [76]
The exact origins of the early Ainu remains unclear, but it is generally agreed to be linked to the Satsumon culture of the Epi-Jōmon period, with later influences from the nearby Okhotsk culture. [77] The Ainu appear genetically most closely related to the Jōmon period peoples of Japan. The genetic makeup of the Ainu represents a "deep branch of East Asian diversity". Compared to contemporary East Asian populations, the Ainu share "a closer genetic relationship with northeast Siberians". [11] [62]
Japanese populations in modern Japan can be traced to three separate, but related demographics: the Ainu, Ryukyuan and Mainland Japanese (Yamato). The populations are closely related to clusters found in Northeastern Asia [80] [81] [82] with the Ainu group being most similar to the Ryukyuan group, [81] [83] the Ryukyuan group being most similar to the Yamato group, [84] and the Yamato group being most similar to Koreans [85] [86] [87] [88] among other East Asian people.
The majority of Japanese genetic ancestry is derived from sources related to other mainland Asian groups, mostly Koreans, while the other amount is derived from the local Jōmon hunter-gatherers. [88] Evidence for both Northern and Southern mtDNA and Y-DNA haplogroups has been observed in the Japanese, with the North-Eastern DNA taking up majority of the genetic makeup, [80] especially among the Mainland group. [86] In addition to the Northeastern ancestry, the Japanese demographics (alongside the Koreans), are the only ethnicities to have restricted presence of the Jōmon-like M7a DNA [ ja] in East Asia. [89] [90] [66]
A comprehensive study of worldwide Y-DNA diversity (Underhill et al. 2000) included a sample of 23 males from Japan, of whom 35% belonged to haplogroup D-M174, 26% belonged to O-M175, 22% belonged to O-M122, 13% belonged to C-M8 and C-M130, and 4.3% belonged to N-M128. [91] Poznik et al. (2016) reported the haplogroups of a sample of Japanese men from Tokyo: [92] 36% belonged to D2-M179, 32% had O2b-M176, 18% carried O3-M122, 7.1% carried C1a1-M8, 3.6% belonged to O2a-K18, and 3.6% carried C2-M217. [93]
According to an analysis of the 1000 Genomes Project's sample of Japanese collected in the Tokyo metropolitan area, the mtDNA haplogroups found among modern Japanese include D (35.6%), B (13.6%), M7 (10.2%), G (10.2%), N9 (8.5%), F (7.6%), A (6.8%), Z (3.4%), M9 (2.5%), and M8 (1.7%). [94]
Korean populations in modern Korea can be traced to many origins from the people of the Mumun period to the Yemaek people. [95] In modern times, Koreans are related to other populations found in Northeast Asia, however according to recent studies, ancient Koreans included populations related to the Yayoi people, [96] Jōmon people, [97] Siberian influx [98] etc.
Generally speaking, modern Koreans' genetic ancestry is mostly dominated by Northeast Asian DNA with a small mix of Southern Jōmon-like ancestry (6% ±3%). [99] [100] Evidence for both Northern and Southern mtDNA and Y-DNA haplogroups has been observed in Koreans, similar to the Japanese, [66] with the latter also being the closest group to the Koreans in the approximate region due to the overlap of Northeastern DNA and the presence of Jōmon-like M7a haplogroup [ ja]. [88] It is believed that the Jōmon-like ancestry was prominent during Neolithic period of Korea [101] with percentage as high as 34% (±7%), [99] but diminished over time due to incoming populations from the north.
Ancient genome comparisons revealed that the genetic makeup of Koreans can be best described as an admixture between Northeast Asian hunter-gatherers and an influx of rice-farming Southeast Asian agriculturalists from the Yangtze river valley. [90] [102] This is supported by archaeological, historical and linguistic evidence, which suggests that the direct ancestors of Koreans were proto-Koreans who inhabited the northeastern region of China (situated near Liao River) and the northern part of the Korean peninsula during the Neolithic (8,000–1,000 BC) and Bronze (1,500–400 BC) Ages, [103] who later mixed with the Jōmon-like natives in the southern part of the peninsula before the Three Kingdoms period of Korea. [99]
Studies of polymorphisms in the human Y-chromosome have so far produced evidence to suggest that the Korean people have a long history as a distinct, mostly endogamous ethnic group, with successive waves of people moving to the peninsula and three major Y-chromosome haplogroups. [104] A majority of Koreans belong to subclades of haplogroup O-M175 (ca. 79% in total, [103] [105] with about 42% [105] to 44% [103] belonging to haplogroup O2-M122, about 31% [103] to 32% [105] belonging to haplogroup O1b2-M176, and about 2% [103] to 3% [105] belonging to haplogroup O1a-M119), while a significant minority belong to subclades of haplogroup C2-M217 (ca. 12% [103] to 13% [105] in total). Other Y-DNA haplogroups, including haplogroup N-M231, haplogroup D-M55, and haplogroup Q-M242, are also found in smaller proportions of present-day Koreans. [106] [107] [108]
Studies of Korean mitochondrial DNA lineages have shown that there is a high frequency of Haplogroup D4, followed by haplogroup B, and then haplogroup A and haplogroup G. Haplogroups with lower frequency include N9, Y, F, D5, M7, M8, M9, M10, M11, R11, C, and Z. [109] [110] [111]
The ethnogenesis of Mongolic peoples is largely linked with the expansion of Ancient Northeast Asians. They subsequently came into contact with other groups, notably Sinitic peoples to their South and Western Steppe Herders to their far West. The Mongolians pastoralist lifestyle, may in part be derived from the Western Steppe Herders, but without much geneflow between these two groups, suggesting cultural transmission. [112] [113] The Mongols are believed to be the descendants of the Xianbei and the proto-Mongols. The former term includes the Mongols proper (also known as the Khalkha Mongols), Oirats, the Kalmyk people and the Southern Mongols. The latter comprises the Abaga Mongols, Abaganar, Aohans, Baarins, Gorlos Mongols, Jalaids, Jaruud, Khishigten, Khuuchid, Muumyangan and Onnigud. The Daur people are descendants of the para-Mongolic Khitan people. [114]
The majority of Mongols in Mongolia and Russia belong to subclades of haplogroup C-M217, [115] followed by lower frequency of O-M175 and N-M231. [116] A minority belongs to haplogroup Q-M242, and a variety of West Eurasian haplogroups. [117]
The maternal haplogroups are diverse but similar to other northern Asian populations, including Haplogroup D, Haplogroup C, Haplogroup B, and Haplogroup A, which are shared among indigenous American and Asian populations. [118] West Eurasian mtDNA haplogroups makes up a some minority percentages. Haplogroup HV, Haplogroup U, Haplogroup K, Haplogroup I, Haplogroup J are all found in Mongolic people. [119]
The origins of the Han Chinese are primarily from Neolithic Yellow River farmers, which formed from Ancient Northern East Asians (ANEA) and Neolithic groups near the Yangtze, which formed Ancient Southern East Asians (ASEA). [121] [122] [123] [124] [125] Today's modern Han Chinese can be colloquially categorized into two subgroups, Northern and Southern Han Chinese, although it is a clinal population with no significant distinction. [126] The Han Chinese cluster retains a level of singularity with its admixture of ANEA and ASEA ancestries which is unique to the group [124] with the Southern Hans having contributions from the north and its southern natives. [127] In large, when comparing with other East Asian populations, the Northern Han Chinese cluster is placed closer to the "Korean/Mainland Japanese" cluster in terms of a correlative genetic relationship (mostly due to the overlap of ANEA), but is also quite distinguishable from them genetically, [121] [31] due to the presence of ASEA ancestry [128] and the absence of Jōmon ancestry. The Southern Han Chinese also share more alleles with Thai and other Kra–Dai peoples according to principal component analysis than Northern Han Chinese. [31]
The genetic makeup of the modern Han Chinese is not purely uniform in terms of physical appearance and biological structure due to the vast geographical expanse of China and the migratory percolations that have occurred throughout it over the last few millennia. This has also engendered the emergence and evolution of the diverse multiplicity of assorted Han subgroups found throughout the various regions of modern China today. Comparisons between the Y chromosome single-nucleotide polymorphisms (SNPs) and mitochondrial DNA (mtDNA) of modern Northern Han Chinese and 3000 year old Hengbei ancient samples from China's Central Plains show that they are extremely similar to each other. These findings demonstrate that the core fundamental structural basis that shaped the genetic makeup of the present-day Northern Han Chinese was already formed more than three thousand years ago. [126]
Studies of DNA remnants from the Central Plains area of China 3000 years ago show close affinity between that population and those of Northern Han today in both the Y-DNA and mtDNA. Both Northern and Southern Han show similar Y-DNA genetic structure. [129]
Northern Han Chinese populations also have some West Eurasian admixture, [130] especially Han Chinese populations in Shaanxi (~2%-4.6%) [131] and Liaoning (~2%). [132] During the Zhou dynasty, or earlier, peoples with paternal haplogroup Q-M120 also contributed to the ethnogenesis of Han Chinese people. This haplogroup is implied to be widespread in the Eurasian steppe and north Asia since it is found among Cimmerians in Moldova and Bronze Age natives of Khövsgöl. But it is currently near-absent in these regions except for East Asia. In modern China, haplogroup Q-M120 can be found in the northern and eastern regions. [133] [134] Other Y-DNA haplogroups that have been found with notable frequency in samples of Han Chinese include O-P203 (15/165 = 9.1%, 217/2091 = 10.38%, [135] 47/361 = 13.0%), C-M217 (10/168 = 6.0%, 27/361 = 7.5%, 176/2091 = 8.42%, [135] 187/1730 = 10.8%, 20/166 = 12.0%), N-M231 (6/166 = 3.6%, 94/2091 = 4.50%, [135] 18/361 = 5.0%, 117/1729 = 6.8%, 17/165 = 10.3%), O-M268(xM95, M176) (78/2091 = 3.73%, [135] 54/1147 = 4.7%, [136] 8/168 = 4.8%, 23/361 = 6.4%, 12/166 = 7.2%), and Q-M242 (2/168 = 1.2%, 49/1729 = 2.8%, 61/2091 = 2.92%, [135] 12/361 = 3.3%, 48/1147 = 4.2% [136] ).
However, the mtDNA of Han Chinese increases in diversity as one looks from Northern to Southern China, which suggests that the influx of male Han Chinese migrants intermarried with the local female non-Han aborigines after arriving in what is now modern-day Guangdong, Fujian, and other regions of Southern China. [137] [138] Despite this, tests comparing the genetic profiles of Northern Han, Southern Han, and non-Han southern natives determined that haplogroups O1b-M110, O2a1-M88 and O3d-M7, which are prevalent in non-Han southern natives, were only observed in some Southern Han Chinese (4% on average), but not in the Northern Han genetic profile. Therefore, this proves that the male contribution of the southern non-Han natives in the Southern Han genetic profile is limited, assuming that the frequency distribution of Y lineages in southern non-Han natives represents that prior to the expansion of Han culture which originated two thousand years ago from the north. [137] [139]
A recent, and to date the most extensive, genome-wide association study of the Han population, shows that geographic-genetic stratification from north to south has occurred and centrally placed populations act as the conduit for outlying ones. [140] Ultimately, with the exception in some ethnolinguistic branches of the Han Chinese, such as Pinghua and Tanka people, [141] there is a "coherent genetic structure" found in the entirety of the modern Han Chinese populace. [142] Although admixture proportions can vary according to geographic region, the average genetic distance between various Han Chinese populations is much lower than between European populations, for example. [143]
A 2018 study calculated pairwise FST (a measure of genetic difference) based on genome-wide SNPs, among the Han Chinese (Northern Han from Beijing and Southern Han from Hunan, Jiangsu and Fujian provinces), Japanese and Korean populations sampled. It found that the smallest FST value was between Northern Han Chinese (Beijing) (CHB) and Southern Han (Hunan, Fujian, etc.) Chinese (CHS) (FST[CHB-CHS] = 0.0014), while CHB and Korean (KOR) (FST[CHB-KOR] = 0.0026) and between KOR and Japanese (JPT) (FST[JPT-KOR] = 0.0033). Generally, pairwise FST between Han Chinese, Japanese and Korean (0.0026~ 0.0090) are greater than that within Han Chinese (0.0014). These results suggested Han Chinese, Japanese and Korean are different in terms of genetic make-up, and the differences among the three groups are much larger than that between Northern and Southern Han Chinese. [144] Nonetheless, there is also genetic diversity among the Southern Han Chinese. The genetic composition of the Han population in Fujian might not accurately represent that of the Han population in Guangdong.
Another study shows that the Northern and Southern Han Chinese are genetically close to each other and it finds that the genetic characteristics of present-day Northern Han Chinese were already formed prior to three thousand years ago in the Central Plain area. [146]
A recent genetic study on the remains of people (~4,000 years BP) from the Mogou site in the Gansu-Qinghai (or Ganqing) region of China revealed more information on the genetic contributions of these ancient Di-Qiang people to the ancestors of the Northern Han. It was deduced that 3,300 to 3,800 years ago some Mogou people had merged into the ancestral Han population, resulting in the Mogou people being similar to some Northern Han in sharing up to ~33% paternal (O3a) and ~70% maternal (D, A, F, M10) haplogroups. The mixing ratio was possibly 13–18%. [147]
The estimated contribution of Northern Han to Southern Han is substantial in both paternal and maternal lineages and a geographic cline exists for mtDNA. As a result, the Northern Han are one of the primary contributors to the gene pool of the Southern Han. However, it is noteworthy that the expansion process was not only dominated by males, as is shown by both contribution of the Y-chromosome and the mtDNA from Northern Han to Southern Han. Northern Han Chinese and Southern Han Chinese exhibit both Ancient Northern East Asian and Ancient Southern East Asian ancestries. [148] These genetic observations are in line with historical records of continuous and large migratory waves of Northern China inhabitants escaping warfare and famine, to Southern China. Aside from these large migratory waves, other smaller southward migrations occurred during almost all periods in the past two millennia. [149] A study by the Chinese Academy of Sciences into the gene frequency data of Han subpopulations and ethnic minorities in China showed that Han subpopulations in different regions are also genetically quite close to the local ethnic minorities, suggesting that in many cases, ethnic minorities ancestry had mixed with Han, while at the same time, the Han ancestry had also mixed with the local ethnic minorities. [150]
Han Chinese, similar to other East Asian populations, have inherited West Eurasian ancestry, around 2.8% in Northern Han Chinese and around 1.7% in Southern Han Chinese. [151]
An extensive, genome-wide association study of the Han population in 2008, shows that geographic-genetic stratification from north to south has occurred and centrally placed populations act as the conduit for outlying ones. [152] Ultimately, with the exception in some ethnolinguistic branches of the Han Chinese, such as Pinghua, there is "coherent genetic structure" (homogeneity) in all Han Chinese. [153]
The major haplogroups of Han Chinese belong to subclades of Haplogroup O-M175. Y-chromosome O2-M122 is a common DNA marker in Han Chinese, as it appeared in China in prehistoric times, and is found in approximately 50% of Chinese males, with frequencies tending to be high toward the east of the country, ranging from 29.7% to 52% in Han from Southern and Central China, to 55–68% in Han from the eastern and northeastern Chinese mainland and Taiwan. [154]
Other Y-DNA haplogroups that have been found with notable frequency in samples of Han Chinese include O-P203 (9.1–13.0%), C-M217 (6.0–12.0%), N-M231 (3.6–10.3%), O-M268(xM95, M176) (4.7–7%), and Q-M242 (2/168 = 1.2–4.2%). [136] [154]
The mitochondrial-DNA haplogroups of the Han Chinese can be classified into the Northern East Asian-dominating haplogroups, including A, C, D, G, M8, M9, and Z, and the Southern East Asian-dominating haplogroups, including B, F, M7, N*, and R. [149]
These haplogroups account for 52.7% and 33.85% of those in the Northern Han, respectively. Haplogroup D is the modal mtDNA haplogroup among Northern East Asians. Among these haplogroups, D, B, F, and A were predominant in the Northern Han, with frequencies of 25.77%, 11.54%, 11.54%, and 8.08%, respectively.
However, in the Southern Han, the Northern and Southern East Asian-dominating mtDNA haplogroups accounted for 35.62% and 51.91%, respectively. The frequencies of haplogroups D, B, F, and A reached 15.68%, 20.85%, 16.29%, and 5.63%, respectively. [146] [155] [156] [157] [158]
The ethnic roots of Tibetans can be traced back to a deep Eastern Asian lineage representing the indigenous population of the Tibetan plateau since c. 40,000 to 30,000 years ago, and arriving Neolithic farmers from the Yellow River within the last 10,000 years associated, and which can be associated with having introduced the Sino-Tibetan languages. Modern Tibetans derive up to 20% from Paleolithic Tibetans, with the remaining 80% being primarily derived from Yellow River farmers. The present-day Tibetan gene pool was formed at least 5,100 years BP. [159] [160]
Tibetan males predominantly belong to the paternal lineage D-M174 followed by lower amounts of O-M175. [161]
Tibetan females belong mainly to the Northeast Asian maternal haplogroups M9a1a, M9a1b, D4g2, D4i and G2ac, showing continuity with ancient middle and upper Yellow River populations. [162]
Linguistic and genetic evidence strongly suggests an early presence of Turkic peoples in eastern Mongolia. [163] The genetic evidence suggests that the Turkification of Central Asia was carried out by East Asian dominant minorities migrating out of Mongolia. [47]
Genetic data found that almost all modern Turkic-speaking peoples retained at least some shared ancestry associated with "Southern Siberian and Mongolian" (SSM) populations, supporting this region as the "Inner Asian Homeland (IAH) of the pioneer carriers of Turkic languages" which subsequently expanded into Central Asia. [164]
An Ancient Northeast Asian origin of the early Turkic peoples has been corroborated in multiple recent studies. Early and medieval Turkic groups however exhibited a wide range of both (Northern) East Asian and West Eurasian genetic origins, in part through long-term contact with neighboring peoples such as Iranian, Mongolic, Tocharian, Uralic and Yeniseian peoples, and others. [165] [166] [167] [168] [169] [170]
Common Y-DNA haplogroups in Turkic peoples are Haplogroup N-M231 (found with especially high frequency among Turkic peoples living in present-day Russia, especially among Siberian Tatars, as Zabolotnie Tatars have one of the highest frequencies of this haplogroup, second only to Samoyedic Nganasans ), Haplogroup C-M217 (especially in Central Asia, and in particular, Kazakhstan, also in Siberia among Siberian Tatars), Haplogroup Q-M242 (especially in Southern Siberia among the Siberian Tatars, also quite frequent among Lipka Tatars and among Turkmens and the Qangly tribe of Kazakhs), and Haplogroup O-M175 (especially among Turkic peoples living in present-day China, the Naiman tribe of Kazakhs and Siberian Tatars). Some groups also have Haplogroup R1b (notably frequent among the Teleuts, Siberian Tatars, and Kumandins of Southern Siberia, the Bashkirs of the Southern Ural region of Russia, and the Qypshaq tribe of Kazakhs), Haplogroup R1a (notably frequent among the Kyrgyz, Altaians, Siberian Tatars, Lipka Tatars, Volga Tatars, Crimean Tatars and several other Turkic peoples living in present-day Russia), Haplogroup J-M172 (especially frequent among Uyghurs, Azerbaijanis, and Turkish people), and Haplogroup D-M174 (especially among Yugurs, but also observed regularly with low frequency among Southern Altaians, Nogais, Kazakhs, and Uzbeks). [171] [172]
The oldest modern human genome found in Central Asia belongs to the deeply East Eurasian IUP-affilated Ust'Ishim man. The population affilated with this specimen is inferred to have not contributed to modern human populations. During the late Upper Paleolithic period, geneflow from Ancient North Eurasians (ANE) played a significant role in the genetic makeup of Central Asia. The ANE carried both Upper Paleolithic European and East/Southeast Asian ancestry. Post-Paleolithic geneflow included movements of Paleo-Siberian and Northeast Asian groups into Central Asia, admixing with local ANE-rich groups resulting in the formation of the Botai genetic grouping, with close affinities to West Siberian hunter-gatherers (WSHG). Subsequently, massive geneflow from Western Steppe Herders from eastern and central Europe is associated with the formation of the Andronovo culture and the spread of Indo-Iranian languages. Interactions between pre-existing groups with Andronovo and Paleo-Siberian tribes, resulted in the origin of early Scythians. [173] During the Iron Age, the Turkification of Central Asia was carried out by East Asian dominant minorities migrating out of Mongolia. [174] The Turkic-speaking Central Asian populations, such as Kyrgyz, Kazakhs, Uzbeks, and Turkmens share more of their gene pool with various East Asian and Siberian populations than with West Asian or European populations. [175]
All Native American populations descend from an ancient Paleo-Siberian group which emerged by the merger of Ancient East Asians and Ancient North Eurasians. While the East Asian-like ancestry is best represented by Ancient Northern East Asians (Amur14k), it may also include ancestry from further South prior to the divergence between Southern and Northern East Asians. Beyond that, there may be a small and variable amount of deeply branching East Asian admixture best represented by the Onge, Papuans or the Tianyuan man. This deep ghost component has been dubbed as "population Y". [66] [176] [177] [176] [178] [179] [180] [1]
The genetic makeup of modern South Asians can be described as a combination of West Eurasian ancestries with divergent East Eurasian ancestries. The latter primarily include an indigenous South Asian component (termed Ancient Ancestral South Indians, short "AASI") that is distantly related to the Andamanese peoples, as well as to East Asians and Aboriginal Australians, and further include additional, regionally variable East/Southeast Asian components. [181] [6] The East Asian-related ancestry component forms the major ancestry among Tibeto-Burmese and Khasi-Aslian speakers in the Himalayan foothills and Northeast India, [182] [183] and is generally distributed throughout South Asia at lower frequency, with substantial presence in Mundari-speaking groups. [182] [183] [184] Southern East Asian ancestry is primarily associated with the dispersal of Austroasiatic rice farmers, which migrated from Southeast Asia into India. [185] Multiple researches indicate that the Austroasiatic populations in India are derived from (mostly male dominated) migrations from Southeast Asia during the Holocene. [186] [185] [187] [188] [189] [190] According to Van Driem (2007), "...the mitochondrial picture indicates that the Munda maternal lineage derives from the earliest human settlers on the Subcontinent, whilst the predominant Y chromosome haplogroup argues for a Southeast Asian paternal homeland for Austroasiatic language communities in India." [187] : 7
Geneflow from Southeast Asians (particularly Austroasiatic groups) to South Asian peoples is associated with the introduction of rice-agriculture to South Asia. There is significant cultural, linguistic, and political Austroasiatic influence on early India, which can also be observed by the presence of Austroasiatic loanwords within Indo-Aryan languages. [191] [192]
Southeast Asians represent one of the most closely related groups to East Asians, with both being referred to as East/Southeast Asian. While East Asians primarily derive from both ANEA and ASEA components, Southeast Asians derive most of their ancestry from the ASEA component with variable amounts of deeper branchin East Asian-like admixture (mostly Onge/Hoabinhian-like). While Hoabinhian-like ancestry is associated with indigenous hunter-gatherers, ASEA ancestry spread mostly with Neolithic expansions associated with Austroasiatic and Austronesian groups. [39] [194] [16] [1]
Evidence for more complex Mesolithic migration patterns are evident in the remains of a hunter-gatherer specimen from Maritime Southeast Asia, South Sulawesi, which was found to have ancestry from two deeply diverged East Eurasian lineages. The remains had approximately c. 50% "Tianyuan/Onge" ancestry and c. 50% Papuan-like ancestry. [195]
There is also evidence for low proportions (~5%) of South Asian-associated "SAS ancestry" (best samplified by modern South Indian groups such as Irula or Mala) among specific Mainland Southeast Asian (MESA) ethnic groups (~2–16% as inferred by qpAdm), likely as a result of cultural diffision; mainly through maritime trade and Indianized kingdoms of Southeast Asia. Overall, the geneflow event is estimated to have happened between 500 and 1000 YBP. [196]
Melanesians and Aboriginal Australians are deeply related to East Asians. Genetic studies have revealed that Australasians descended from the same Eastern Eurasian source population as East Asians and indigenous South Asians (AASI). [6] The 'Australasian', 'Ancient Ancestral South Indian', and 'East and Southeast Asian' lineages display a closer genetic relationship to each other than to any non-Asian lineages, as well as being closer to each other than to any of the early East Eurasian IUP lineages (Bacho Kiro etc.), and together represent the main branches of "Asian-related ancestry", which diverged from each other >40,000 years ago. [197]
...In contrast, mainland East and Southeast Asians and other Pacific islanders (e.g., Austronesian speakers) are closely related to each other [9,15,16] and here denoted as belonging to an East and Southeast Asian (ESEA) lineage (Box 2). …the ESEA lineage differentiated into at least three distinct ancestries: Tianyuan ancestry which can be found 40,000–33,000 years ago in northern East Asia, ancestry found today across present-day populations of East Asia, Southeast Asia, and Siberia, but whose origins are unknown, and Hòabìnhian ancestry found 8,000–4,000 years ago in Southeast Asia, but whose origins in the Upper Paleolithic are unknown.
A single major migration of modern humans into the continents of Asia and Sahul was strongly supported by earlier studies using mitochondrial DNA, the non-recombining portion of Y chromosomes, and autosomal SNP data [42–45]. Ancestral Ancient South Indians with no West Eurasian relatedness, East Asians, Onge (Andamanese hunter–gatherers) and Papuans all derive in a short evolutionary time from the eastward dispersal of an out-of-Africa population [46,47], although Europeans and East Asians are suggested to share more recent common ancestors than with Papuans [48]. The HUGO (Human Genome Organization) Pan-Asian SNP consortium [44] investigated haplotype diversity within present-day Asian populations and found a strong correlation with latitude, with diversity decreasing from south to north. The correlation continues to hold when only mainland Southeast Asian and East Asian populations are considered, and is perhaps attributable to a serial founder effect [50]. These observations are consistent with the view that soon after the single eastward migration of modern humans, East Asians diverged in southern East Asia and dispersed northward across the continent.
Via the southern route, ancestors of current Asian populations reached Southeast Asia and a part of Oceania around 70000–50000 years ago, probably through a coastal dispersal route (Bae et al., 2017). The oldest samples providing the genetic evidence of the northern migration route come from a high-coverage genome sequence of individuals excavated from the Yana RHS site in northeastern Siberia (Figure 2), which is about 31600 years old (Sikora et al., 2019). A wide range of artifacts, including bone crafts of wooly rhinoceros and mammoths, were excavated at the site (Pitulko et al., 2004). The analysis of genome sequences showed that the samples were deeply diverged from most present-day East Asians and more closely related to present-day Europeans, suggesting that the population reached the area through a route different from the southern route. A 24000-year-old individual excavated near Lake Baikal (Figure 2), also known as the Mal'ta boy, and 17000-year-old individuals from the Afontova Gora II site (Afontova Gora 2 and 3) showed similar genetic features to the Yana individuals (Raghavan et al., 2014; Fu et al., 2016; Sikora et al., 2019). Interestingly, genetic data suggested that Yana individuals received a large amount of gene flow from the East Asian lineage (Sikora et al., 2019; Yang et al., 2020).
Population genomic studies on present-day humans7,8 have exclusively supported the southern route origin of East Asian populations.
Inferences from nuclear (51), Y chromosome (52), and mitochondrial genome (53) data support an early migration of modern humans out of Africa and into Southeast Asia using a southern route by at least 60 ka. Patterns of genetic variation in recent human populations (11, 54, 55) recognize Southeast Asia as an important source for the peopling of East Asia and Australasia via a rapid, early settlement.
Population genomic studies on present-day humans7,8 have exclusively supported the southern route origin of East Asian populations.
Whole-genome analyses extracted from the remains of the Jomon people showed that they were highly differentiated from other East Asians, forming a basal lineage to East and Northeast Asians.8,10,11 The genetic relationship between Jomon individuals and other East Asians suggests that the ancestral population of the Jomon people is one of the earliest wave migrants who might have taken a coastal route from Southeast Asia toward East Asia.
...ancient people perhaps of the "first layer" with Australo-Papuan features moved into Siberia and subsequently adapted to the extremely cold climate during the Last Glacial Maximum (LGM) of 24– 16 kya.
However, genetic sampling in Japan and southern China of populations associated craniometrically with the first layer show that they are more closely related genetically to second-layer East Asian populations, indicating that the two-layer model is not sufficient to describe the population movement, replacement, and mixture in prehistoric Asia.
Positive selection for rs1800414-G in East Asia is thought to date back to the Late Palaeolithic period (~25–30 kya), following the northward migration of modern humans from South Asia (Yang et al., 2016).
We also found the haplogroup M7a1 exclusively in the Korean population. This result is consistent with previous reports that haplogroup M7a is restricted to Japan and south Korea.— Jin, et al
European-prevalent haplogroups (HV, U, K, I, J) are 14.3% in Xinjiang Mongolian, 10% in Mongolia, 8.4% in central Inner Mongolian samples, and only 2% in eastern Xin Barage Zuoqi County samples, showing decreasing frequencies from west to east
The team discovered that the Spirit Cave remains came from a Native American while dismissing a longstanding theory that a group called Paleoamericans existed in North America before Native Americans.
Here the analysis of genome wide data on Indian and East/Southeast Asian demonstrated their restricted distinctive ancestry in India mainly running along the foothills of Himalaya and northeastern part.
It has been further proved that not only linguistic but also certain cultural and political facts of ancient India, can be explained by Austroasiatic elements.
The qpGraph analysis confirmed this branching pattern, with the Leang Panninge individual branching off from the Near Oceanian clade after the Denisovan gene flow, although with the most supported topology indicating around 50% of a basal East Asian component contributing to the Leang Panninge genome (Fig. 3c, Supplementary Figs. 7–11).
At 12 hypothetical ancestral populations, Mon, Khmer from Thailand, Kuy, Nyahkur, Burmese, Thai, Cambodian Khmer, Cham, Ede, Giarai, and Malay... demonstrated a "light pink" ancestry component (accounting for more than 5% of their ancestry, on average) that is enriched in South Asian populations such as Irula and Mala from Southern India (Fig 3)