This article explains the genetic makeup and population history of East Asian peoples and their connection to genetically related populations (i.e. Southeast Asians, Siberians, Polynesians, and Native Americans), as well as Oceanians and partly, Central Asians and South Asians, which are collectively referred to as "East Eurasians" in population genomics.
Population genomic studies have studied the origin and formation of modern East Asians. Ancestors of East Asians split from other human populations between 50,000 and 70,000 years ago. Possible routes into East Asia include a northern route model from Central Asia, beginning north of the Himalayas, and a southern route model, beginning south of the Himalayas and moving through Southeast Asia.[5]
The southern route model for East Asians has been corroborated in multiple recent studies, showing that most of the ancestry of Eastern Asians arrived from the southern route in to Southeast Asia at a very early period, perhaps 70,000 years ago.[6][7][8][9] However, the genetic evidence for northwestern East Asians and north Asians also supports more recent migrations along the northern route, taking place 18,000 years ago and since 3000 years ago, as shown by the presence of haplogroups Q and R.[10]
Sato et al. 2021 concluded that "the southern migration wave seems to have diversified into the local populations in East Asia (defined in this paper as a region including China, Japan, Korea, Mongolia, Taiwan and Southeast Asia), and the northern wave, which probably runs through the Siberian and Eurasian steppe regions and mixed with the southern wave, probably in Siberia."[11] The northern route in to East Asia was taken by the Ancient North Eurasians, who mixed with an Ancient Northeast Asian population. Modern day Native Americans carry substantial genetic ancestry from the Ancient North Eurasians, as do many modern West Eurasian populations (specifically Europeans; through Eastern Hunter-Gatherer admixture).[12][13]
A review paper by Melinda A. Yang (in 2022) summarized and concluded that a distinctive "Basal-East Asian population" referred to as 'East- and Southeast Asian lineage' (ESEA); which is ancestral to modern East Asians, Southeast Asians, Polynesians, and Siberians, originated in Mainland Southeast Asia at ~50,000BC, and expanded through multiple migration waves southwards and northwards respectively. This ESEA lineage is also ancestral to the Hoabinhian hunter-gatherers of Southeast Asia and the ~40,000 year old Tianyuan lineage found in Northern China, but distinct from European-related and Australasian-related lineages of prehistoric Eurasia.[14] There are currently eight detected, closely related, sub-ancestries in the ESEA lineage:
Modern Northeast Asians consist mostly of the "Amur ancestry" which expanded massively with millet cultivation. Modern Southeast Asians (specifically Austronesians) consist mainly of the "Fujian ancestry" component, which is associated with the spread of rice cultivation. Contemporary East Asians (most notably Sino-Tibetan speakers) consist mostly of Yellow River ancestry, associated with both millet and rice cultivation. "East Asian Highlanders" (Tibetans) consist of both the Ancient Tibetan ancestry and Yellow River ancestry. Japanese people were found to have a tripartite origin; consisting of Jōmon ancestry, Amur ancestry, and Yellow River ancestry. Indigenous peoples of the Americas formed from Ancient North Eurasians and from an early East Asian branch, giving rise to "Ancestral Beringians", which gave rise to both "Paleosiberians" and contemporary Native Americans. Isolated hunter-gatherer in Southeast Asia, specifically Malaysia and Thailand, such as the Semang, derive most of their ancestry from the Hoabinhian lineage.[15][16][17][18][19][20][7][21][22]
Genetic studies have revealed that East/Southeast Asian people are also relatively closely related to the indigenous South Asian hunter-gatherers, often termed "Ancient Ancestral South Indians" (AASI), and also show higher affinity to Australasian populations (AA), such as Melanesians. Phylogenetic data suggests that an early initial Eastern Eurasian or "eastern non-African" (ENA) meta-population trifurcated, and gave rise to Australasians (Oceanians), the AASI, as well as East/Southeast Asians, although Papuans may have also received some geneflow from an earlier group (xOoA),[23] around 2%,[24] next to additional archaic admixture in the Sahul region.
Main article: Xiongnu |
The Xiongnu, possibly a Turkic, Mongolic, Yenisseian or multi-ethnic people, were a confederation[25] of nomadic peoples who, according to ancient Chinese sources, inhabited the eastern Eurasian Steppe from the 3rd century BC to the late 1st century AD. Chinese sources report that Modu Chanyu, the supreme leader after 209 BC, founded the Xiongnu Empire.[26]
According to data compiled from several genetic studies, Xiongnu samples were found to have approximately 58% East Eurasian ancestry, represented by a bronze age population from Khövsgöl, Mongolia. The rest of the Xiongnu's ancestry (~40%) was related to West Eurasians, represented by the Gonur Depe BMAC population of Central Asia, and the Sintashta culture of the Western steppe.[27][28]
Over the past decade, Chinese archaeologists have published several reviews regarding the results of excavations in Xinjiang. They imply the genetic composition of Xiongnu's supreme ruling class. Particularly interesting are the tombs in the cemetery at Heigouliang, Xinjiang (the Black Gouliang cemetery, also known as the summer palace of the Xiongnu king), east of the Barkol basin, near the city of Hami.
By typing results of DNA samples during the excavation of one of the tombs, it was determined that of the 12 men: 6 Q1a* (not Q1a1-M120, not Q1a1b-M25, not Q1a2-M3), 4 Q1b-M378, 2 Q* (not Q1a, not Q1b: unable to determine subclades):[29]
In a paper (Lihongjie 2012), the author analyzed the Y-DNAs of the ancient male samples from the 2nd or 1st century BCE cemetery at Heigouliang in Xinjiang – which is also believed to be the site of a summer palace for Xiongnu kings – which is east of the Barkol basin and near the city of Hami. The Y-DNA of 12 men excavated from the site belonged to Q-MEH2 (Q1a) or Q-M378 (Q1b). The Q-M378 men among them were regarded as hosts of the tombs; half of the Q-MEH2 men appeared to be hosts and the other half as sacrificial victims.
In a study of Xiongnu remains belonging to the Egiin Gol necropolis in Northern Mongolia, a majority (89%) of their mitochondrial sequences can be classified as belonging to East Asian haplogroups, and nearly 11% belong to European haplogroups.[30]
Five female Xiongnu samples belonged to the maternal haplogroups D4b2b4, N9a2a, G3a3, D4a6 and D4b2b2b.[31]
The origins of the Xianbei are unclear. Chinese anthropologists Zhu Hong and Zhang Quanchao studied Xianbei crania from several sites in Inner Mongolia and noticed that anthropological features of studied Xianbei crania show that the racial type is closely related to the modern East-Asian Mongoloids, and some physical characteristics of those skulls are closer to modern Mongols, Manchu, Korean and Northern Han Chinese.[32]
A genetic study published in the American Journal of Physical Anthropology in August 2018 noted that the paternal haplogroup C2b1a1b has been detected among the Xianbei and the Rouran, and was probably an important lineage among the Donghu people.[33]
Genetic studies published in 2006 and 2015 revealed that the mitochondrial haplogroups of Xianbei remains were of East Asian origin. According to Zhou (2006) the maternal haplogroup frequencies of the Tuoba Xianbei were 43.75% haplogroup D, 31.25% haplogroup C, 12.5% haplogroup B, 6.25% haplogroup A and 6.25% "other." And the inner Mongolia of Tuoba Xianbei [34] According to Hong Zhu (2007) The maternal haplogroup frequencies of the Tuoba Xianbei in the Inner Mongolia Autonomous Region were 40.10% Haplogroup O (Y-haplo), 30.16% Haplogroup C (Y-haplo) and 27.94% "other"[35]
Zhou (2014) obtained mitochondrial DNA analysis from 17 Tuoba Xianbei, which indicated that these specimens were, similarly, completely East Asian in their maternal origins, belonging to haplogroups D, C, B, A, O and haplogroup G.[36][35]
The research also found a relation between Xianbei individuals with modern Oroqen, Ewenki and Outer Mongolian people. Especially Tungusic Oroqen show close relation to Xianbei.[37]
Haplogroup C3b2b1*-M401(xF5483)[38][39][40] has been identified as a possible marker of the Aisin Gioro and is found in ten different ethnic minorities in northern China, but completely absent from Han Chinese.[41][42][40]
Genetic testing also showed that the haplogroup C3b1a3a2-F8951 of the Aisin Gioro family came to southeastern Manchuria after migrating from their place of origin in the Amur river's middle reaches, originating from ancestors related to Daurs in the Transbaikal area. The Tungusic speaking peoples mostly have C3c-M48 as their subclade of C3 which drastically differs from the C3b1a3a2-F8951 haplogroup of the Aisin Gioro which originates from Mongolic speaking populations like the Daur. Jurchen (Manchus) are a Tungusic people. The Mongol Genghis Khan's haplogroup C3b1a3a1-F3796 (C3*-Star Cluster) is a fraternal "brother" branch of C3b1a3a2-F8951 haplogroup of the Aisin Gioro.[43]
A genetic test was conducted on seven men who claimed Aisin Gioro descent with three of them showing documented genealogical information of all their ancestors up to Nurhaci. Three of them turned out to share the C3b2b1*-M401(xF5483) haplogroup, out of them, two of them were the ones who provided their documented family trees. The other four tested were unrelated.[44] The Daur Ao clan carries the unique haplogroup subclade C2b1a3a2-F8951, the same haplogroup as Aisin Gioro and both Ao and Aisin Gioro only diverged merely a couple of centuries ago from a shared common ancestor. Other members of the Ao clan carry haplogroups like N1c-M178, C2a1b-F845, C2b1a3a1-F3796 and C2b1a2-M48. People from northeast China, the Daur Ao clan and Aisin Gioro clan are the main carriers of haplogroup C2b1a3a2-F8951. The Mongolic C2*-Star Cluster (C2b1a3a1-F3796) haplogroup is a fraternal branch to Aisin Gioro's C2b1a3a2-F8951 haplogroup.[45]
However, the modern Manchu people show relatively high amounts of Haplogroup O2, which is common among the Han Chinese. A study on the Manchu population of Liaoning reported that they have a close genetic relationship and significant admixture signal with northern Han Chinese. The Liaoning Manchu were formed from a major ancestral component related to Yellow River farmers and a minor ancestral component linked to ancient populations from Amur River Basin, or others. The Manchu were therefore an exception to the coherent genetic structure of Tungusic-speaking populations, likely due to the large-scale population migrations and genetic admixtures in the past few hundred years.[46]
Main article: Genetic and anthropometric studies on Japanese people |
Main article: Jōmon people |
Jōmon people is the generic name of people who lived in the Japanese archipelago during the Jōmon period. Today most Japanese historians believe that the Jomon people were not one homogeneous people but were at least two or three distinct groups.[47]
Recent full genome analyses in 2020 by Boer et al. 2020 and Yang et al. 2020, reveal some further information regarding the origin of the Jōmon peoples. They were found to have largely formed from a Paleolithic Siberian population and an East Asian related population.[16][48]
These indigenous people had their own unique lifestyle and culture within Japan for thousands of years. The analyses found that the Jomon maintained a small effective population size of around 1,000 over several millennia, the Jōmon lineage split from modern East Asians between 15,000BC and 20,000BC, but after the divergence of Ancestral Native Americans about ~25,000BC, and became largely isolated from outside populations, but received gene flow from a population related to the Upper-Paleolithic Yana RHS sample from Northern Siberia, a deeply European-related population, also known as Ancient North Eurasians, and widespread in North Eurasia before the Last Glacial Maximum along a North to South cline. Niall Cooke, one of the researchers explained that these results strongly suggest a prolonged period of isolation from the rest of the continent until the introduction of new immigrants associated with wet-rice farming during the Yayoi period of Japanese history.[49]
See also: Yayoi people |
The Yayoi people were migrants to the Japanese archipelago from Asia (Korea or China) during the Yayoi period (1000 BCE–300 CE) and Kofun period (250–538 CE). They are seen as direct ancestors of the modern Yamato people, the majority of Japanese and of the Ryukyuan people. It is estimated that modern Japanese share in average about 90% of their genome with the Yayoi.[50]
This group is thought to have marked the arrival of paddy field rice cultivation which led to an agricultural revolution in the archipelago until the following period.[49]
A study published in the journal Science Advances conducted by a team of Japanese and Irish researchers at Trinity College Dublin[51] found that the people of Japan bore genetic signatures from three ancient populations rather than just two as previously thought, with ancestry stemming from a third migration that occurred during the Kofun period.[52] The study found that up to 71% of the ancient Kofun people shared a common genetic strand with the Han Chinese while the rest shared with the Yayoi people and the Jōmon people.
In addition, The Nikkei published an article that showed the Kofun strand in modern day Japanese was concentrated in specific regions such as Kinki, Hokuriku and Shikoku.[53]
A recent study conducted in 2022 by the University of Vienna discovered that Ancient Japanese people were also present in the Korean Peninsula, specifically in the Gaya region (present day South Gyeongsang Province).[54] According to the study, genome samples of eight different individuals from the ancient Gaya kingdom were examined where six of them were closest to Modern Koreans, Modern Japanese, Kofun Japanese (the Kofun genomes are contemporaneous with individuals from the study) and Neolithic Koreans.[54] The remaining two bore closer resemblance to the Ancient Jōmon people,[54] suggesting that the Ancient Japanese existed outside of the Japanese islands than previously assumed.
A new study in 2022 conducted by the University of Xiamen shed light on the lack of Jōmon genomes found in present-day Koreans and Japanese.[55] Researchers discovered that despite finding evidence of the Jōmon people on the Korean peninsula and the Japanese archipelago, there were little to no traces left of their genetic impact in their respective people's gene pools.[55] According to the study, Ancient Koreans were composed of "northern East Asian-related ancestry and indigenous Jōmon-related ancestry" where the "northern East Asian ancestry was suggested to be related to the Neolithic West Liao River farmers in northeast China".[55] The finding indicated that the "West Liao River-related farmers might have spread the proto-Korean language as their ancestry was found to be predominant in extant Koreans" and these "Proto-Korean groups, in turn, introduced West Liao River-like ancestry into the gene pool of present-day Japan".[55] These people are thought to have caused the displacement of the indigenous Jōmon people causing a significant diminishment of Jōmon genomes in the regions. It was deduced that this event (and the populations remaining genetically homogeneous since then) was what caused modern Koreans and Japanese to share the majority of their genetic makeup[55] as the latter group "can be represented as a mixture of Koreans (91%) with a limited genetic heritage from a basal East Asian lineage related to Jōmon (9%)".[56][55]
See also: Japanese people |
The main paternal haplogroups of modern Yamato Japanese are Haplogroup D-M55 (today ~33%, with the frequency in various samples ranging from 18/70 = 25.7% in a sample from Tokushima Prefecture[57] to 24/59 = 40.7% in a sample of Japanese male volunteers[58] and 11/27 = 40.7% in a sample from Aomori Prefecture[59]), Haplogroup O-M176 (O1b2) (today ~32%, range 37/142 = 26.1% in a sample of Japanese[60] to 35/97 = 36.1% in a sample from Western Japan[61]), Haplogroup O-M122 (O2, formerly O3) (today ~20%, ranging from 4/59 = 6.8% in a sample of Japanese volunteers[58] and 11/102 = 10.8% in a sample of adults from Fukuoka[62] to 25/104 = 24.0% in a sample from Kyushu[63] and 38/157 = 24.2% in a pooled sample of Japanese from Tokushima, Yamaguchi, and Ibaraki[64]), Haplogroup C-M217 (C2, today ~6%, ranging from 0/26 = 0.0% in a sample from Aomori,[57] 1/61 = 1.6% in a sample from Shizuoka,[57] 1/47 = 2.1% in a sample from Kagawa,[65] and 3/137 = 2.2% in a sample from the Kantō region[61] to 15/206 = 7.3% in a sample from Sapporo,[62] 18/241 = 7.5% in a sample from Osaka,[62] 8/104 = 7.7% in a sample from Kyushu,[63] and 8/102 = 7.8% in a sample from Fukuoka[62]), and Haplogroup C-M8 (C1a1, today ~6%, ranging from 3/129 = 2.3% in a sample of male students of Saga Prefectural Chienkan High School[66] to 7/70 = 10.0% in a sample from Tokushima[57]).[61][67][59][62] Haplogroups N-M231, O-M119, O-K18, and Q-M242 also have been observed with low frequency among present-day Japanese.
A comprehensive study of worldwide Y-DNA diversity (Underhill et al. 2000) included a sample of 23 males from Japan, of whom eight (35%) belonged to haplogroup D-M174, six (26%) belonged to O-M175, five (22%) belonged to O-M122, three (13%) belonged to C-M8 and C-M130, and one (4.3%) belonged to N-M128.[68]
Among 259 males from Japan (70 from Tokushima, 61 from Shizuoka, 53 from Kyūshū, 45 from Okinawa, 26 from Aomori, and 4 Ainus) whose Y-DNA has been examined in a 2005 study by Michael F. Hammer, ninety (34.7%) belong to haplogroup D-M55, eighty-two (31.7%) belong to haplogroup O-P31 (including 22% O-47z, 7.7% O-M176(x47z), and 1.9% O-M95(xM111)), fifty-two (20.1%) belong to haplogroup O-M122, fourteen (5.4%) belong to haplogroup C-M8, ten (3.9%) belong to haplogroup NO-M214(xO-M175) (including 2.3% NO-M214(xO-M175, N-LLY22g), 1.2% haplogroup N-LLY22g(xM128, P43, M178), and 0.4% haplogroup N-M178), and eight (3.1%) belong to haplogroup C-M217 (including 1.9% haplogroup C-M217(xM86) and 1.2% haplogroup C-M86).[69]
The patrilines belonging to D-P37.1 were found in all the Japanese samples, but were more frequently found in Okinawa (55.6%) samples and less frequently found in the Tokushima (25.7%) and Kyūshū samples (26.4%).[69] Haplogroups O-M175 and C-M8 were not found in the small Ainu sample of four individuals, and C-M217 was not found in the Okinawa sample of 45 individuals.[69] Haplogroup N was detected in samples of Japanese from Aomori (2/26 N-LLY22g(xM128, P43, M178)), Shizuoka (1/61 N-LLY22g(xM128, P43, M178)), and Tokushima (1/70 N-M178), but was not found in the Kyūshū, Okinawa, or Ainu samples.[69] This study, and others, report that Y-chromosome patrilines crossed from the Asian mainland into the Japanese archipelago, and continue to make up a large proportion of the Japanese male lineage.[70] If focusing haplogroup O-P31 in those researches, the patrilines derived from its subclade O-SRY465 are frequently found in both Japanese (mean 32%,[71] with frequency in various samples ranging from 26%[72][73] to 36%[74]) and Koreans (mean 30%,[75] with frequency in various samples ranging from 19%[72][76] to 40%[74]). According to the research, these patrilines have undergone extensive genetic admixture with the Jōmon period populations previously established in Japan.[69]
A 2007 study by Nonaka et al. reported that among a total of 263 healthy unrelated Japanese male individuals born in 40 of the 47 prefectures of Japan, but especially Tokyo (n=51), Chiba (n=45), Kanagawa (n=14), Saitama (n=13), Shizuoka (n=12), and Nagano (n=11), the frequencies of the D2, O2b, and O3 lineages were 38.8%, 33.5%, and 16.7%, respectively, which constituted approximately 90% of the Japanese population. Haplogroup diversity for the binary polymorphisms was calculated to be 86.3%.[61]
Poznik et al. (2016) have reported that the males in the JPT (Japanese in Tokyo, Japan) sample[77] of the 1000 Genomes Project are 20/56 = 36% D2-M179, 18/56 = 32% O2b-M176, 10/56 = 18% O3-M122, 4/56 = 7.1% C1a1-M8, 2/56 = 3.6% O2a-K18, and 2/56 = 3.6% C2-M217.[78]
In a project approved by the Ethics Committee of Tokai University School of Medicine, Ochiai et al. (2016) have reported finding D-M174 (rs2032602 T>C) in 24/59 (40.7%), O-M268 (rs13447443 A>G) in 21/59 (35.6%), C-M130 (rs35284970 C>T) in 8/59 (13.6%), O-P198 (rs17269816 T>C) in 4/59 (6.8%), N-M231 (rs9341278 G>A) in 1/59 (1.7%), and O-P186(xM268, P198) (rs16981290 C>A, rs13447443 A, rs17269816 T) in 1/59 (1.7%) of a sample obtained through buccal swabs from Japanese male volunteers (n = 59) who had given informed consent to participate in the study.[58]
According to an analysis of the 1000 Genomes Project's sample of Japanese collected in the Tokyo metropolitan area, the mtDNA haplogroups found among modern Japanese include D (42/118 = 35.6%, including 39/118 = 33.1% D4 and 3/118 = 2.5% D5), B (16/118 = 13.6%, including 11/118 = 9.3% B4 and 5/118 = 4.2% B5), M7 (12/118 = 10.2%), G (12/118 = 10.2%), N9 (10/118 = 8.5%), F (9/118 = 7.6%), A (8/118 = 6.8%), Z (4/118 = 3.4%), M9 (3/118 = 2.5%), and M8 (2/118 = 1.7%).[79]
A 2011 SNP consortium study done by the Chinese Academy of Sciences and Max Planck Society consisting of 1719 DNA samples determined that Koreans and Japanese clustered near to each other, confirming the findings of an earlier study that Koreans and Japanese are related.[80]
A 2008 study about genome-wide SNPs of East Asians by Chao Tian et al. reported that Japanese along with other East Asians such as Joseon Koreans and Han Chinese are genetically distinguishable from Southeast Asians [81] Another study (2017) shows a relative strong relation between all East and Southeast Asians.[82]
Hideo Matsumoto, professor emeritus at Osaka Medical College tested Gm types, genetic markers of immunoglobulin G, of Japanese populations for a 2009 study.[83] According to this study, the Gm ab3st gene is found at notably high frequencies across eastern Siberia, northern China, Korea, Mongolia, Japan, and Tibet.[83] The mean frequency of Gm ab3st for the mainstream Japanese population was found to be 26.0%, with a peak in the Yaeyama Islands (36.4% Yonaguni, 32.1% Ishigaki) among all populations in Japan and peaks in Akita (29.5%) and Shizunai (28.3%) among mainstream Japanese.[83] On mainland Asia, peak frequencies of Gm ab3st were found among Oroqen (44.0%) and Tungus (30.0%) in northeast China and among the north Baikal Buryats (30.7%); however, this gene is also frequent among Eskimos (25.4% Alaska, 24.7% Greenland, 20.5% Chaplin, Russia), Luoravetlans (Koryak 20.0%, Chukchi 15.3%), and Athabaskans (New Mexico Apache 19.7%, Alaska Athabascan 14.3%), and it is not uncommon even as far west as the south shore of the Caspian Sea (8.8% Gilani, 8.5% Mazanderani).[83] Minimum frequencies of Gm ab3st were found in Yakushima (22.0%) among all populations in Japan and in Tsu (23.3%) and Ōita (23.6%) among mainstream Japanese.[83] The data from small, isolated island populations, such as those of Yonaguni, Ishigaki, and Yakushima, were not used when calculating the mean for the mainstream Japanese population.[83] The study also considered Ainu and Korean populations and found Gm ab3st with a frequency of 25.2% among Ainu in Hidaka, Hokkaido and a mean frequency of 14.5% (range 13.1% Pusan, South Korea to 18.6% Yanji, China) among Koreans.[83]
Gm afb1b3, on the other hand, is a southern marker gene possibly originating in southern China on the background of the fb1b3 gene (the modal Gm type among Caucasoids) and found at very high frequencies across southern China, Southeast Asia, Taiwan, Sri Lanka, Bangladesh, Nepal, Assam, and the Pacific Islands.[83] Professor Matsumoto has remarked that the center of dispersal of the Gm afb1b3 gene may be in the Yunnan and Guangxi area of southern China; extremely high frequencies of this gene have been observed in samples of mostly Daic peoples from this region (95.2% Shui in Sandu, Guizhou, 94.2% Zhuang in Guangxi, 91.4% Bouyei in Duyun,
Guizhou, 87.5% Miao in Guizhou, 84.0% Dai in Luxi, Yunnan) and from neighboring Laos (97.0% Laotian) and Thailand (89.9% Thai).[83] However, Gm afb1b3 is almost equally common among people in Malaysia (97.3% Kadazan on Borneo, 85.0% Malay), Indonesia (76.6% Sulawesi, 75.2% Java), the Philippines (83.6% Luzon Filipinos, 76.4% Luzon Negritos, 67.2% Mindanao Negritos), Karen people in Thailand (82.3%), Kacharis in Assam (80.9%), Cambodians (76.7%), Taiwanese aborigines (76.2%), Micronesians (88.7%), Melanesians (74.6%), and Polynesians (74.7% Cook Islands, 69.4% Hawaii).[83] The study found that the mean frequency of Gm afb1b3 was 10.6% (range 7.8% Shizunai to 13.0% Osaka) for the general Japanese population. Minimum frequencies (4.0% to 4.4%) of Gm afb1b3 were found among the native people in the Yaeyama and Miyako islands in the extreme south of Japan and among the Ainu (4.3%) in the extreme north of Japan. The author suggested that the somewhat elevated frequency of the Gm afb1b3 gene among the mainstream Japanese compared to the Sakishima islanders and the Ainu may have resulted from some admixture of the mainstream Japanese population at rates as low as 7–8% with southern Asian (from southern China or Southeast Asia as far west as Bangladesh and Nepal) populations having the Gm afb1b3 gene in high frequency.[83]
The other Gm types observed among Japanese are ag (45.8%) and axg (17.6%), which are not so useful for discerning human migrations and genetic relationships because they appear to be retained from a common ancestor of most modern humans and are found in similar proportions (with the frequency of ag being significantly greater than the frequency of axg) in many populations all over the world (aboriginal Australians and Americans, South Asians, Caucasoids, etc.).[83]
A 2017 study conducted by Fumihiko Takeuchi, Tomohiro Katsuya, Ryosuke Kimura, and Norihiro Kato compared three genetically distinct Japanese groups, Hondu (Honshu), Ryukyu and Ainu to 26 other Asian populations to analyze the shared ancestry and genetic differentiation between the Japanese people and other Asians.[84] The study revealed for the Japanese as a whole, some genetic components from all of the Central, East, Southeast and South Asian populations are prevalent in the Japanese population with the major components of ancestry profile coming from the Korean and Han Chinese clusters.[84] The major components of the Japanese Hondo cluster are similar to the Korean (87–94%), followed by Han Chinese 1 (0–8%) clusters.[84] The genetic components from the Southeast Asian (Thais, Vietnamese and Malays) and South Asian (Sinhalese and Tamils) clusters were larger for the Ryukyu cluster – Southeast Asian (4–6%) and South Asian (4–6%) – in comparison to the results found in the Hondo cluster – Southeast Asian (0–1%) and South Asian (1–2%).[84]
An independent research conducted by director Kenichi Shinoda and his team at National Museum of Nature and Science in 2018 was broadcast on NHK Science ZERO and it also verified that genetically, the modern day Japanese are extremely close to the modern day Koreans.[85]
Genetic distance measurements from a large scale genetic study from 2021 titled 'Genomic insights into the formation of human populations in East Asia', Japanese are genetically closest to Koreans with 91% of their genetic makeup being derived from the group and the remaining from the Jōmon people.[56]
Recent research suggests that the historical Ainu culture originated from a merger of the Okhotsk culture with the Satsumon culture, cultures thought to have derived from the diverse Jōmon-period cultures of the Japanese archipelago.[87][88] According to Lee and Hasegawa of the Waseda University, the direct ancestors of the later Ainu people formed during the late Jōmon period from the combination of the local but diverse population of Hokkaido, long before the arrival of contemporary Japanese people. Lee and Hasegawa suggest that the Ainu language expanded from northern Hokkaido and may have originated from a relative more recent Northeast Asian/Okhotsk population, which established themselves in northern Hokkaido and had significant impact on the formation of Hokkaido's Jōmon culture.[89][90]
Recently in 2021, it was confirmed that the Hokkaido Jōmon people formed from "Jōmon tribes of Honshu" and from "Terminal Upper-Paleolithic people" (TUP people) indigenous to Hokkaido and Paleolithic Northern Eurasia. The Honshu Jōmon groups arrived about 15,000 BC and merged with the indigenous "TUP people" to form the Hokkaido Jōmon. The Ainu in turn formed from the Hokkaido Jōmon and from the Okhotsk people.[91]
See also: Koreans |
Studies of polymorphisms in the human Y-chromosome have so far produced evidence to suggest that the Korean people have a long history as a distinct, mostly endogamous ethnic group, with successive waves of people moving to the peninsula and three major Y-chromosome haplogroups.[92] The reference population for Koreans used in Geno 2.0 Next Generation is 94% Eastern Asia and 5% Southeast Asia & Oceania.[93]
Koreans are generally considered a Northeast Asian group. Koreans are overall more similar to northeast Asians than to southeast Asians. This conclusion would be expected from the general correlation between genetic variation and geography observed for human populations, and is supported here by an examination of individual mtDNA haplogroups, genetic distances between populations derived from mtDNA or Y-chromosomal data, and the apportionment of genetic diversity between different groups of populations.[95]
Studies of classical genetic markers showed that Koreans tend to have a close genetic affinity with Mongolians among East Asians. Ancient genome comparisons revealed that the genetic makeup of Koreans can be best described as an admixture of an Northern hunter-gatherer component as well as that of a influx of rice-farming agriculturalists from the Yangtze river valley, which in turn are often linked to O2-M122 and is largely male-biased.[95] Another study concludes however that O2b*-SRY465 and O2b1-47z had an in situ origin among Northeast Asians, particularly among the prehistoric Koreans, rather than in southern China or Southeast Asia as previously envisaged. The combination of the O2b initial settlement (which became an indigenous proto-Korean component) in part with the relatively recent O3 and C3 lineages (which include a Chinese component) explains some of the main events formulating the current Y chromosome composition of the Korean population.[64] This is supported by archeological, historical and linguistic evidence, which suggest that the direct ancestors of Koreans were proto-Koreans who inhabited the northeastern region of China and the Korean Peninsula during the Neolithic (8,000-1,000 BC) and Bronze (1,500-400 BC) Ages.[64] The results from the findings in the Devil's Gate showed that the ancient populations of the area were already admixed from various sources.[96]
Linguistically, Korean shows similarities to Ainu, Nivkh and most "Altaic" languages, while those with Japanese are explained through a more recent convergence starting from the Yayoi period.[97] Those groups correlate closely to modern-day Koreans and Japanese, who form a cluster in regional comparisons, along with various Tungusic groups, such as the Ulchi, Hezhen and Oroqen. Using modern Japanese as a indicator, who show a clear signal towards modern-day populations from Taiwan, southern China, and Vietnam, the southern component of Koreans could be established.[96] A high level of genetic continuity in the region over at least the last 7700 years can be observed. Modern populations still rely on a number of hunter-gatherer-fisher practices and the cold climatic conditions likely provide an explanation for the apparent continuity and lack of major genetic turnover by exogenous farming populations, as has been documented in the case of southeast and central Europe.
One plausibility is that local hunter-gatherers progressively added food-producing practices to their original lifestyle.[96] East Asians, including Northern groups and Southern ones are closer related to another than other population groups and can trace themselves to a common ancestry from several tens of thousands of years ago.[98]
Jin Han-jun et al. (2003) said that the distribution of Y-chromosomal haplogroups shows that Koreans have a complex origin that results from genetic contributions from range expansions, most of which are from a southern-to-northern region in the continent, and genetic contributions from the northern Asian settlement.[72]
The most common Y-DNA haplogroup among present-day Koreans is Haplogroup O-M122 (O2, formerly O3), which is a very ancient (formed 34,042 [95% CI 32,387 <-> 35,708] years before present, TMRCA 24,824 [95% CI 23,125 <-> 26,538] years before present according to Karmin et al. 2015[99]) and frequently observed Y-DNA haplogroup among East and Southeast Asians in general.[100][101] Haplogroup O2-M122 has been found in approximately 41% of sampled Korean males, including 23/79 = 29.1% Korean Chinese,[74] 29/85 = 34.1% Korean (collected in Seoul by the Catholic University of Korea),[74] 16/45 = 35.6% Korean,[102] 109/300 = 36.3% unrelated Korean males obtained from the National Biobank of Korea,[103] 31/84 = 36.9% Gyeongsang,[64] 24/63 = 38.1% Gangwon,[64] 17/43 = 39.5% Korean (Korea),[65] 10/25 = 40.0% Korean (China),[65] 30/75 = 40.0% Korea,[57] 55/133 = 41.4% Daejeon,[104] 23/55 = 41.8% Korean from the KoVariome database,[105] 65/154 = 42.2% Korean,[106][107] 242/573 = 42.2% Seoul,[104] 39/90 = 43.3% Jeolla,[64] 38/87 = 43.7% Jeju,[64] 102/216 = 47.2% Seoul & Daejeon,[76] 36/72 = 50.0% Chungcheong,[64] 56/110 = 50.9% Seoul-Gyeonggi.[64]
The second most common Y-DNA haplogroup among present-day Koreans is Haplogroup O-M176 (O1b2, formerly O2b), a clade that probably has spread mainly from somewhere in the Korean Peninsula or its vicinity.[64][108] Haplogroup O1b2-M176 has been found in approximately 31% of sampled Korean males (including 31/154 = 20.1% Korean,[107] 58/216 = 26.9% Seoul & Daejeon,[76] 12/43 = 27.9% Koreans in Korea,[65] 7/25 = 28.0% Koreans in China,[65] 31/110 = 28.2% Seoul-Gyeonggi,[64] 25/84 = 29.8% Gyeongsang,[64] 22/72 = 30.6% Chungcheong,[64] 28/90 = 31.1% Jeolla,[64] 14/45 = 31.1% O-M175(xO1a-M119, O2a1-M95, O3-M122) Korean,[102] 181/573 = 31.6% Seoul,[104] 28/87 = 32.2% Jeju,[64] 28/85 = 32.9% Korean (collected in Seoul by the Catholic University of Korea),[74] 45/133 = 33.8% Daejeon,[104] 28/75 = 37.3% Korea,[57] 113/300 = 37.7% unrelated Korean males obtained from the National Biobank of Korea,[103] 25/63 = 39.7% Gangwon,[64] 38/79 = 48.1% Korean Chinese[74]).[72][65][69][109][76]
Korean males also exhibit a moderate frequency (approximately 15%) of Haplogroup C-M217. About 2% of Korean males belong to Haplogroup D-M174 (0/216 = 0.0% DE-YAP,[76] 3/300 = 1.0% DE-M145,[103] 1/68 = 1.5% DE-YAP(xE-SRY4064),[65] 8/506 = 1.6% D1b-M55,[64] 3/154 = 1.9% DE, 18/706 = 2.55% D-M174,[110] 5/164 = 3.0% D-M174,[74] 1/75 D1b*-P37.1(xD1b1-M116.1) + 2/75 D1b1a-M125(xD1b1a1-P42) = 3/75 = 4.0% D1b-P37.1,[69] 3/45 = 6.7% D-M174[111]). The D1b-M55 subclade has been found with maximal frequency in a small sample (n=16) of the Ainu people of Japan, and is generally frequent throughout the Japanese Archipelago.[63] Other haplogroups that have been found less commonly in samples of Korean males are Y-DNA haplogroup N-M231 (approx. 4%), haplogroup O-M119 (approx. 3%), haplogroup O-M268(xM176) (approx. 2%), haplogroup Q-M242 and Haplogroup R1 (approx. 2% total), J, Y*(xA, C, DE, J, K), L, C-RPS4Y(xM105, M38, M217), and C-M105.[64][65]
He Miao et al. (2009) created an artificial combination of equal parts of the Y-chromosomes of the HapMap samples of Han Chinese in Beijing and Japanese in Tokyo. The study said that this artificial combination resembled five populations which included Koreans in South Korea and Koreans in China.[112]
Studies of Korean mitochondrial DNA lineages have shown that there is a high frequency of Haplogroup D4, ranging from approximately 23% (11/48) among ethnic Koreans in Arun Banner, Inner Mongolia[113] to approximately 32% (33/103) among Koreans from South Korea.[73][114] Haplogroup D4 is the modal mtDNA haplogroup among northern East Asians (Japanese, Ryukyuans, Koreans, Manchus, Tuoba Xianbei, Oroqens, Manchurian Evenks, Daurs, Mongols, northern Han Chinese, Tibetans) in general, with a peak frequency among Japanese and Ryukyuans in Japan. Haplogroup B, which occurs very frequently in many populations of Southeast Asia, Polynesia, and the Americas, is found in approximately 10% (5/48 ethnic Koreans from Arun Banner, Inner Mongolia) to 20% (21/103 Koreans from South Korea) of Koreans.[113][114] Haplogroup A has been detected in approximately 7% (7/103 Koreans from South Korea) to 15% (7/48 ethnic Koreans from Arun Banner, Inner Mongolia) of Koreans.[113][114][73] Haplogroup A is the most common mtDNA haplogroup among the Chukchi, Eskimo, Na-Dene, and many Amerind ethnic groups of North and Central America.
The other half of the Korean mtDNA pool consists of an assortment of various haplogroups, each found with relatively low frequency, such as G, N9, Y, F, D5, M7, M8, M9, M10, M11, R11, C, and Z.
Hwan Young Lee et al. (2006) studied a sample of 694 Koreans and found the following mtDNA distribution: 32.56% (226/694) D (including 188/694 = 27.09% D4, 37/694 = 5.33% D5, and 1/694 = 0.14% D6a), 14.84% (103/694) B (including 23/694 = 3.31% B4a, 22/694 = 3.17% B4b, 20/694 = 2.88% B5b, 12/694 = 1.73% B4c, 12/694 = 1.73% B4*, 10/694 = 1.44% B5a, 2/694 = 0.29% B4d, and 2/694 = 0.29% B4f), 9.65% (67/694) G (including 27/694 = 3.89% G2a, 18/694 = 2.59% G1a, 11/694 = 1.59% G3, 9/694 = 1.30% G*, 1/694 = 0.14% G2c, and 1/694 = 0.14% G4), 8.79% (61/694) A (including 23/694 = 3.31% A5a, 3/694 = 0.43% A5b, 3/694 = 0.43% A5c, 25/694 = 3.60% A4, and 7/694 = 1.01% A(xA4, A5)), 8.36% (58/694) F, 8.21% (57/694) N9 (including 18/694 = 2.59% N9a2a, 11/694 = 1.59% N9a2*, 11/694 = 1.59% N9a1, 9/694 = 1.30% Y1b, 4/694 = 0.58% N9a*, 2/694 = 0.29% N9b, and 2/694 = 0.29% Y2), 7.78% (54/694) M7 (including 25/694 = 3.60% M7b, 20/694 = 2.88% M7c, and 9/694 = 1.30% M7a), 4.76% (33/694) M8'CZ (including 17/694 = 2.45% C, 7/694 = 1.01% M8a, 7/694 = 1.01% Z, and 2/694 = 0.29% pre-Z), 1.87% (13/694) M9, 1.73% (12/694) M10, 0.72% (5/694) M11, 0.29% (2/694) R11, 0.14% (1/694) R9b, 0.14% (1/694) M12, and 0.14% (1/694) M*.[115]
Derenko et al. (2007) examined a sample of 103 Koreans from South Korea and found that they belonged to the following mtDNA haplogroups: 39.8% D (including 32.0% D4 and 7.8% D5), 20.4% B (including 12.6% B4 and 7.8% B5), 9.7% M7, 6.8% A (including 3.9% A5 and 2.9% A4(xA2, A8)), 6.8% G (including 2.9% G2a, 1.9% G1, and 1.9% G3), 4.9% F1, 4.9% M8a2, 2.9% N9a, 1.9% M9a, 1.0% C(xC1, C4, C5), and 1.0% Y.[114]
A study of the mtDNA of 708 Koreans sampled from six regions of South Korea (134 from Seoul-Gyeonggi, 118 from Jeolla, 117 from Chungcheong, 114 from Gangwon, 113 from Jeju, and 112 from Gyeongsang) found that they belonged to haplogroup D (35.5%, including 14.7% D4(xD4a, D4b), 7.8% D4a, 6.5% D5, 6.4% D4b, and 0.14% D(xD4, D5)), haplogroup B (14.8%, including 11.0% B4 and 3.8% B5), haplogroup A (8.3%), haplogroup M7 (7.6%), haplogroup F (7.1%), haplogroup M8'CZ (6.5%), haplogroup G (6.1%), haplogroup N9a (5.2%), haplogroup Y (3.8%), haplogroup M9 (2.7%), haplogroup M10 (1.6%), haplogroup M11 (0.42%), haplogroup N(xN9, Y, A, F, B4, B5) (0.28%), and haplogroup N9(xN9a) (0.14%).[116]
A study of 1094 individuals in the Korean Genome Project found that they belonged to haplogroup D (34.19%), haplogroup B (13.89%), haplogroup M(xC, D, G, Z) (13.8%), haplogroup A (8.32%), haplogroup G (8.23%), haplogroup F (7.86%), haplogroup N(xA, B, F, R, Y) (5.76%), haplogroup C (3.02%), haplogroup R(xB, F) (2.01%), haplogroup Y (1.74%), and haplogroup Z (1.19%).[117] The individuals sampled for the Korean Genome Project are mostly from the Ulsan metropolitan region.[117]
Jin Han-jun et al. (1999) said that based on genetic studies of classic genetic markers of protein and nuclear DNA, Koreans tend to be closely genetically related to Mongols among East Asians, which is supported by the following studies: Goedde et al. (1987); Saha & Tay (1992); Hong et al. (1993); and Nei & Roychoudhury (1993). The study said that the mtDNA 9‐bp deletion frequency in the intergenic COII/tRNALys region of Mongols (5.1%) is lower than that of Chinese (14.2%), Japanese (14.3%) and Koreans (15.5%). The study said that these 9‐bp deletion frequencies suggest that Koreans are closely related to Japanese and Chinese and that Koreans are not so closely related to Mongols. The study said that the homogeneity in the 9-bp deletion frequencies among Chinese (14.2%), Japanese (14.3%), and Koreans (15.5%), only spanning from a low of 14.2% for Chinese to a high of 15.5% for Koreans, indicates that very few mtDNA are differentiated in these three populations. The study said that the 9‐bp deletion frequencies for Vietnamese (23.2%) and Indonesians (25.0%), which are the two populations constituting Mongoloid Southeast Asians in the study, are relatively high frequencies when compared to the 9-bp deletion frequencies for Mongols(5.1%), Chinese (14.2%), Japanese (14.3%) and Koreans (15.5%), which are the four populations constituting Northeast Asians in the study. The study said that these 9-bp deletion frequencies are consistent with earlier surveys which showed that 9-bp deletion frequencies increase going from Japan to mainland Asia to the Malay Peninsula, which is supported by the following studies: Horai et al. (1987); Hertzberg et al. (1989); Stoneking & Wilson (1989); Horai (1991); Ballinger et al. (1992); Hanihara et al. (1992); and Chen et al. (1995). The study said that Cavalli-Sforza's chord genetic distance (4D), from Cavalli-Sforza & Bodmer (1971), which is based on the allele frequencies of the intergenic COII/tRNALys region, showed that Koreans are more genetically related to Japanese than Koreans are genetically related to the other East Asian populations which were surveyed. The Cavalli-Sforza's chord genetic distance (4D) between Koreans and other East Asian populations in the study, from least to greatest, are as follows: Korean to Japanese (0.0019), Korean to Chinese (0.0141), Korean to Vietnamese (0.0265), Korean to Indonesian (0.0316) and Korean to Mongols (0.0403). The study said that the close genetic affinity between present-day Koreans and Japanese is expected due to the Yayoi migration from China and the Korean Peninsula to Japan which began about 2,300 years ago, a migration that is supported by the following studies: Chard (1974); Hanihara (1991); Hammer & Horai (1995); Horai et al. (1996); Omoto & Saitou (1997). The study said that Horai et al. (1996) detected mtDNA D-loop variation which supports the idea that a large amount of maternal lineages came into Japan from immigrants from the Korean Peninsula after the Yayoi period.[118]
Wook et al. (2000) said that Chu et al. (1998) found that phylogeny which was based on 30 microsatellites indicated that Korean people were closely related to Chinese people from Manchuria and Yunnan, but Kim Wook et al. (2000) found that the high incidence of the DXYS156Y-null variant in northeast Chinese implied that it is possible to exclude these northeastern Chinese populations from being sources which are significant in Korean people. The phylogenetic analysis done by Wook et al. (2000) indicated that Japanese people are genetically closer to Korean people than Japanese people are genetically related to any of the following peoples: Mongolians, Chinese, Vietnamese, Indonesians, Filipinos, and Thais. The study said that mainland Japanese having Koreans as their closest genetic population is consistent with the following previous studies: Hammer and Horai (1995); Horai et al. (1996); and Kim et al. (1998). The study found that Koreans are more genetically homogenous than the Japanese, and the study said that this might be due to different sizes of the founding populations and range expansions. The study said that the moderate mean Y-chromosome haplotype diversity value for Koreans might be the result of migrations from East Asia that had a homogenizing influence.[119]
Kim Jong-jin et al. (2005) did a study about the genetic relationships among East Asians based on allele frequencies, particularly focusing on how close Chinese, Japanese and Koreans are genetically related to each other. Most Koreans were hard to distinguish from Japanese, and the study was not able to clearly distinguish Koreans and Japanese. Koreans and Japanese clustered together in the principal component analysis and the best least-squares tree. The study said that "[c]ommon ancestry and/or extensive gene flow" historically between Koreans and Japanese appears to be "likely" and results in a lot of difficulty finding population-specific alleles that could assist in differentiating Koreans and Japanese.[120]
Jung Jongsun et al. (2010) used the following Korean samples for a study: Southeast Korean (sample regions: Gyeongju, Goryeong and Ulsan), Middle West Korean (sample regions: Jecheon, Yeoncheon, Cheonan and Pyeongchang) and Southwest Korean (sample regions: Gimje, Naju and Jeju). Due to political reasons, the study said that it did not use North Korean samples, but the study said that the "historical migration event of Baekje from Goguryeo Empire (BC 37–AD 668) in Northern Korea imply that Northern lineages remain in South Korea." The study said that the "Northern people of the Goguryeo Empire" are closely related to Mongolians, and the study said that this group of people ruled most of Southwest Korea. The study said that "some of the royal families and their subjects in the Goguryeo Empire moved to this region and formed the Baekje Empire in BC 18–22." Southwest Koreans are closer to Mongolians in the study's genome map than the other two Korean regions in the study are close to Mongolians. Southwest Koreans also display genetic connections with the HapMap sample of Japanese in Tokyo, and, in the neighbor joining tree, the nodes for Southwest Korea is close to Japan. In the study's Korea-China-Japan genome map, some Southwest Korean samples overlap with samples from Japan. The study said that the fairly close relationship, in both the study's genetic structure analysis and genome map, of the Jeju Southwest Korean sample and the HapMap sample of Japanese in Tokyo, Japan, has made the evolutionary relationship of Chinese, Japanese and Koreans become clearer. Southeast Koreans display some genetic similarities with people of Kobe, Japan, which indicates that there might have been links between these regions. The study said that it is possible that outliers in the Gyeongju sample, one of the sampled Southeast Korean regions, and outliers in the Kobe, Japan, sample both have Siberian lineage due to Southeast Koreans having connections with Siberian lineages with respect to grave patterns and culture. The overall result for the study's Korea-Japan-China genome map indicates that some signals for Siberia remain in Southeast Korea. In contrast to the Gyeongju sample, the Goryeong and Ulsan samples, which are both Southeast Korean samples, displayed average signals for the Korean Peninsula. The study said that Middle West Korea was a melting pot in the Korean Peninsula with people traveling from North to South, South to North, and people traveling from East China, including from the Shandong Peninsula. Western Chinese, which included those in the Shandong Peninsula, travelled across the Yellow Sea, and these Western Chinese lived and traded in both China and Korea. In the study's genome map, Middle West Koreans are close to the HapMap sample of Han Chinese in Beijing and, in the neighbor joining tree, the nodes for Middle West Korea is close to China. The overall result for the study's Korea-Japan-China genome map indicates that Middle West Korea displays an average signal for South Korea. Chinese people are located between Korean and Vietnamese people in the study's genome map.[121][122]
Kim Young-jin and Jin Han-jun (2013) said that principal component analysis had Korean HapMap samples clustering with neighboring East Asian populations which were geographically nearby them such as the Chinese and Japanese. The study said that Koreans are genetically closely related to Japanese in comparison to Koreans' genetic relatedness to other East Asians which included the following East and Southeast Asian peoples: Tujia, Miao, Daur, She, Mongols, Naxi, Cambodians, Oroqen, Yakuts, Yi, Southern Han Chinese, Northern Han Chinese, Hezhen, Xibo, Lahu, Dai and Tu. The study said that the close genetic relatedness of Koreans to Japanese has been reported in the following previous studies: Kivisild et al. (2002); Jin et al. (2003); Jin et al. (2009); and Underhill and Kivisild (2007). The study said that Jung et al. (2010) said that there is a genetic substructure in Koreans, but the study said that it found Korean HapMap individuals to be highly genetically similar. The study said that Jin et al. (2009) found that Koreans from different populations are not different in a significant way which indicates that Koreans are genetically homogenous. The study said that the affinity of Koreans is predominately Southeast Asian with an estimated admixture of 79% Southeast Asian and 21% Northeast Asian Koreans, but the study said that this does not mean that Koreans are heterogenous, because all of the Koreans which were analyzed uniformly displayed a dual pattern of Northeast Asian and Southeast Asian origins. The study said that Koreans and Japanese displayed no observable difference between each other in their proportion of Southeast Asian and Northeast Asian admixture. The study said the 79% Southeast Asian and 21% Northeast Asian admixture estimate for Koreans is consistent with the interpretation of Jin et al. (2009) that Koreans descend from a Northeast Asian population which was subsequently followed by a male-centric migration from the southern region of Asia which changed both the autosomal composition and Y-chromosomes in the Korean population.
Veronika Siska et al. (2017) said that the Ulchi people are genetically closest in the study's panel to the human remains from the Devil's Gate Cave which are dated to about 7,700 years ago. Modern Korean and Japanese, the Oroqen people and the Hezhen people display a high affinity to the human remains from Devil's Gate Cave. Considering the geographic distance of Amerindians from Devil's Gate Cave, Amerindians are unusually genetically close to the human remains from Devil's Gate Cave. Korean genomes display similar traits to Japanese genomes on genome-wide SNP data. In an admixture analysis, when the genes of Devil's Gate are made into a unique genetic component, this new Devil's Gate genetic component is highest in peoples of the Amur Basin, including Ulchi, and makes up more than 50% of Koreans and Japanese. It also has a sporadic distribution among other East Asians, Central Asians and Southeast Asians.[123]
Hideo Matsumoto, professor emeritus at Osaka Medical College, tested Gm types, genetic markers of immunoglobulin G, of Korean populations for a 2009 study. The Korean populations were populations in Jeju Island, Busan, Gwangju, Kongsan, Jeonju, Wonju, the Kannung of South Korea and a Korean population in Yanji. Matsumoto said that the Gm ab3st gene is a marker for northern Mongoloid possibly originating in Siberia and found at high frequencies across northeast Asia and Tibet. Matsumoto said that the average frequency of Gm ab3st for Koreans was 14.5% which was intermediate between an average frequency of 26% for general Japanese and a frequency of 11.7% which was for a Han Chinese population in Beijing. Matsumoto said that Gm afb1b3 is a southern marker gene possibly originating in southern China and found at high frequencies across Southeast Asia, southern China, Taiwan, Sri Lanka, Bangladesh, Nepal, Assam, and parts of the Pacific. However, given the result that the Okinawans being genetically most northern among the Japanese with the highest frequency of the Gm ab3st gene which is assigned to be northern, the term northern and southern used in his study is controversial. Matsumoto said that the average frequency of Gm afb1b3 for Koreans was 14.7% which was intermediate between a frequency of 10.6% for general Japanese and a frequency of 24.1% for Beijing Han Chinese. Matsumoto said that Koreans displayed the northern Mongoloid pattern, but Matsumoto said that Koreans displayed a higher frequency of the southern marker gene, Gm afb1b3, than the Japanese. Matsumoto said that "Japanese and Korean populations were originally identical or extremely close to each other", and Matsumoto said, "It seemed to be during the formation of the contemporary Korean population that such a Gm pattern intermediate between Japanese and the northern Han in China emerged." Matsumoto said that the different Gm patterns between Japanese and Koreans most likely came about from frequent inflows of Chinese and/or northern populations into the Korean Peninsula.[124]
See also: Mongolic peoples |
The Mongols are an ethnic group in northern China, Mongolia, parts of Siberia and Western Asia. They are believed to be the descendants of the Xianbei and the proto-Mongols. The former term includes the Mongols proper (also known as the Khalkha Mongols), Oirats, the Kalmyk people and the Southern Mongols. The latter comprises the Abaga Mongols, Abaganar, Aohans, Baarins, Gorlos Mongols, Jalaids, Jaruud, Khishigten, Khuuchid, Muumyangan and Onnigud. The Daur people are descendants of the para-Mongolic Khitan people.[125] Mongolians are also related to the Manchurians.
The majority of Mongolians belong to the y-DNA Haplogroup C-M217. Haplogroup C-M217 among the Mongols is characterized by a diversity that dates back to the very origin of haplogroup C-M217 (TMRCA 33,900 [95% CI 31,300 <-> 36,500] ybp[126]) and very shallow diversity in each of the frequently observed subclades: C-M504 (TMRCA 2,900 [95% CI 2,200 <-> 3,700] ybp), C-M86 (TMRCA 3,700 [95% CI 3,000 <-> 4,500] ybp[126]), C-M407 (TMRCA 4,400 [95% CI 3,500 <-> 5,200] ybp[126]), and C-F1756 (TMRCA 5,000 [95% CI 4,200 <-> 5,800] ybp[126]). Of these four subclades, C-M407 is phylogenetically extremely divergent from the others, and is more closely related to subclades of C-M217 that are found among present-day Chinese, Koreans, Japanese, and other East and Southeast Asians; however, among Mongols, C-M407 is found most frequently toward the north (among Barghuts[127] and Buryats[128] as well as the neighboring Khamnigans and Soyots[129]) and toward the west (among Dorbet Kalmyks[130][131]).
Haplogroup O-M175 and Haplogroup N-M231 are found at medium rates among present-day Mongols. The subclades of Haplogroup O-M175 that have been observed among Mongols tend to be similar to those found among Han Chinese, whereas the subclades of Haplogroup N-M231 that have been observed among Mongols tend to be similar to those found among Nenets, Nganasans, Khakasses, and Tuvans (N-B478) on the one hand or those found among Chukchi, Koryaks, and Asian Eskimos on the other (N-B197).[132] However, N-M2118, a subclade which is most often noted for its extremely high frequency among present-day populations of Yakutia, has been observed in 21.4% (6/28) of a sample of Kalmyk Khoshut,[131] and N-B525 is also widely observed among Mongols with low frequency. In addition, some members of a wide variety of other Y-DNA haplogroups have been found among present-day Mongols, including Haplogroup Q-M242, Haplogroup R-M207 (Haplogroup R1b-M478, Haplogroup R1b-M269, Haplogroup R1a-M17, Haplogroup R2a-M124), Haplogroup D-M174, Haplogroup J2a-M410, Haplogroup J1-Page8, Haplogroup G1-M285, and Haplogroup I2a2-M436.[133]
The maternal haplogroups are diverse but similar to other northern Asian populations, including Haplogroup D, Haplogroup C, Haplogroup B, and Haplogroup A, which are shared among indigenous American and Asian populations.[134]
Derenko et al. (2007) tested a sample of 47 Mongolians from Ulaanbaatar and found that they were 8/47 = 17.0% haplogroup C (including 4/47 = 8.5% C4, 3/47 = 6.4% C(xC1, C4, C5), and 1/47 = 2.1% C5), 7/47 = 14.9% haplogroup B (including 5/47 = 10.6% B4 and 2/47 = 4.3% B5), 6/47 = 12.8% haplogroup A(xA2, A5, A8), 5/47 = 10.6% haplogroup D4(xD2, D3), 5/47 = 10.6% haplogroup G (including 4/47 = 8.5% G2a and 1/47 = 2.1% G1), 4/47 = 8.5% haplogroup U (including 3/47 = 6.4% haplogroup U4 and 1/47 = 2.1% haplogroup K), 3/47 = 6.4% haplogroup F1, 2/47 = 4.3% haplogroup M8a2, 1/47 = 2.1% haplogroup M7, 1/47 = 2.1% haplogroup M9a, 1/47 = 2.1% haplogroup M10, 1/47 = 2.1% haplogroup Z, 1/47 = 2.1% haplogroup M*, 1/47 = 2.1% haplogroup N9a, and 1/47 = 2.1% haplogroup N1.[114]
Jin et al. (2009) tested another sample of 47 Mongolians from Ulaanbaatar and found that they were 10/47 = 21.3% haplogroup C, 8/47 = 17.0% haplogroup G (including 5/47 = 10.6% G2a, 2/47 = 4.3% G(xG1a, G2, G3), and 1/47 = 2.1% G3), 7/47 = 14.9% haplogroup F (including 3/47 = 6.4% F1a, 3/47 = 6.4% F1b, and 1/47 = 2.1% F2a), 6/47 = 12.8% haplogroup D4 (including 5/47 = 10.6% D4(xD4a, D4b) and 1/47 = 2.1% D4b), 4/47 = 8.5% haplogroup B (including 2/47 = 4.3% B4(xB4a, B4b, B4c), 1/47 = 2.1% B4a, and 1/47 = 2.1% B5b), 2/47 = 4.3% haplogroup A(xA5), 2/47 = 4.3% haplogroup M7 (including 1/47 = 2.1% M7b(xM7b1, M7b2) and 1/47 = 2.1% M7c(xM7c1)), 2/47 = 4.3% haplogroup N9a, 1/47 = 2.1% haplogroup M*, 1/47 = 2.1% haplogroup M8a, 1/47 = 2.1% haplogroup U5a, 1/47 = 2.1% haplogroup Y1, 1/47 = 2.1% haplogroup pre-Z, and 1/47 = 2.1% haplogroup Z.[107]
Bai et al. (2018) tested a sample of 35 Khalkha from Ulaanbaatar, Mongolia and found that they were 10/35 = 28.6% haplogroup M80'D, 6/35 = 17.1% haplogroup R11'B, 4/35 = 11.4% haplogroup C/Z, 4/35 = 11.4% haplogroup U/K, 2/35 = 5.7% haplogroup M12'G, 2/35 = 5.7% haplogroup R9'F, 2/35 = 5.7% haplogroup N9/Y, 2/35 = 5.7% haplogroup N/A, 1/35 = 2.9% haplogroup M7, 1/35 = 2.9% haplogroup M9, and 1/35 = 2.9% haplogroup R0'HV/H.[135]
West Eurasian mtDNA haplogroups Haplogroup HV, Haplogroup U, Haplogroup K, Haplogroup I, Haplogroup J, represents 14% in western Xingjang Mongolian, 10% in Mongolia, 8.4% in central Inner Mongolian samples, 2% in eastern Xin Barage Zuoqi County samples.[136] West Eurasian mtDNA haplogroups haplogroup U/K, haplogroup R0'HV/H represents 14.3% in Khalkha.
See also: Han Chinese |
A 2018 study calculated pairwise FST (a measure of genetic difference) based on genome-wide SNPs, among the Han Chinese (Northern Han from Beijing and Southern Han from Hunan, Jiangsu and Fujian provinces), Japanese and Korean populations sampled. It found that the smallest FST value was between Northern Han Chinese (Beijing) (CHB) and Southern Han (Hunan and Fujian etc.) Chinese (CHS) (FST[CHB-CHS] = 0.0014), while CHB and Korean (KOR) (FST[CHB-KOR] = 0.0026) and between KOR and Japanese (JPT) (FST[JPT-KOR] = 0.0033). Generally, pairwise FST between Han Chinese, Japanese and Korean (0.0026~ 0.0090) are greater than that within Han Chinese (0.0014). These results suggested Han Chinese, Japanese and Korean are different in terms of genetic make-up, and the differences among the three groups are much larger than that between northern and southern Han Chinese.[137]
Another study shows that the northern and southern Han Chinese are genetically closest to each other and it finds that the genetic characteristics of present-day northern Han Chinese were already formed prior to three thousand years ago in the Central Plain area.[138]
A recent genetic study on the remains of people (~4000 years BP) from the Mogou site in the Gansu-Qinghai (or Ganqing) region of China revealed more information on the genetic contributions of these ancient Di-Qiang people to the ancestors of the Northern Han. It was deduced that 3300–3800 years ago some Mogou people had merged into the ancestral Han population, resulting in the Mogou people being similar to some northern Han in sharing up to ~33% paternal (O3a) and ~70% maternal (D, A, F, M10) haplogroups. The mixture rate was possibly 13-18%.[139]
The estimated contribution of northern Han to southern Han is substantial in both paternal and maternal lineages and a geographic cline exists for mtDNA. As a result, the northern Han are the primary contributors to the gene pool of the southern Han. However, it is noteworthy that the expansion process was dominated by males, as is shown by a greater contribution to the Y-chromosome than the mtDNA from northern Han to southern Han. These genetic observations are in line with historical records of continuous and large migratory waves of northern China inhabitants escaping warfare and famine, to southern China. Aside from these large migratory waves, other smaller southward migrations occurred during almost all periods in the past two millennia.[101] A study by the Chinese Academy of Sciences into the gene frequency data of Han subpopulations and ethnic minorities in China, showed that Han subpopulations in different regions are also genetically quite close to the local ethnic minorities, meaning that in many cases, blood of ethnic minorities had mixed into Han, while at the same time, the blood of Han had also mixed into the local ethnic minorities.[140]
A recent, and to date the most extensive, genome-wide association study of the Han population, shows that geographic-genetic stratification from north to south has occurred and centrally placed populations act as the conduit for outlying ones.[141] Ultimately, with the exception in some ethnolinguistic branches of the Han Chinese, such as Pinghua, there is "coherent genetic structure" (homogeneity) in all Han Chinese.[142]
Y-chromosome haplogroup O2-M122 is a common DNA marker in Han Chinese, as it appeared in China in prehistoric times. It is found in more than 50% of Chinese males, with frequencies tending to be high toward the east of the country (30/101 = 29.7% Guangxi Pinghua Han,[142] 13/40 = 32.5% Guangdong Han,[57] 11/30 = 36.7% Lanzhou Han,[65] 26/60 = 43.3% Yunnan Han,[64] 251/565 = 44.4% Zhaotong Han,[143] 15/32 = 46.9% Yili Han,[65] 23/49 = 46.9% Lanzhou Han,[144][145] 32/65 = 49.2% South China Han,[146] 18/35 = 51.4% Meixian Han,[65] 22/42 = 52.4% Northern Han,[74] 43/82 = 52.4% Northern Han,[147] 18/34 = 52.9% Chengdu Han,[65] 154/280 = 55.0% Southern Han,[147] 27/49 = 55.1% Northern Han,[63] 73/129 = 56.6% North China Han,[146] 49/84 = 58.3% Taiwan Han,[57] 35/60 = 58.3% Taiwan Minnan,[148] 99/167 = 59.3% East China Han,[146] 33/55 = 60.0% Fujian Han,[148] 157/258 = 60.9% Taiwan Han,[148] 13/21 = 61.9% Taiwan Han,[63] 189/305 = 62.0% Zibo Han,[143] 23/35 = 65.7% Harbin Han,[65] 29/44 = 65.9% Northern Han,[57] 23/34 = 67.6% Taiwan Hakka,[148] 35/51 = 68.6% Beijing Han[64]).[149][150] Other Y-DNA haplogroups that have been found with notable frequency in samples of Han Chinese include O-P203 (15/165 = 9.1%, 47/361 = 13.0%), C-M217 (10/168 = 6.0%, 27/361 = 7.5%, 187/1730 = 10.8%, 20/166 = 12.0%), N-M231 (6/166 = 3.6%, 18/361 = 5.0%, 117/1729 = 6.8%, 17/165 = 10.3%), O-M268(xM95, M176) (54/1147 = 4.7%,[151] 8/168 = 4.8%, 23/361 = 6.4%, 12/166 = 7.2%), and Q-M242 (2/168 = 1.2%, 49/1729 = 2.8%, 12/361 = 3.3%, 48/1147 = 4.2%[151]). However, the mitochondrial DNA (mtDNA) of Han Chinese increases in diversity as one looks from northern to southern China, which suggests that male migrants from northern China married with women from local peoples after arriving in modern-day Guangdong, Shanghai, Nanjing, Fujian, and other regions of southern China.[101][152] Despite this, tests comparing the genetic profiles of northern Han, southern Han and southern natives determined that haplogroups O1b-M110, O2a1-M88 and O3d-M7, which are prevalent in southern natives, were only observed in some southern Han (4% on average), but not in northern Han. Therefore, this proves that the male contribution of southern natives in southern Han is limited, assuming that the frequency distribution of Y lineages in southern natives represents that before the expansion of Han culture that started two-thousand years ago.[101] In contrast, there are consistent strong genetic similarities in the Y chromosome haplogroup distribution between the southern and northern Chinese population, and the result of principal component analysis indicates almost all Han populations form a tight cluster in their Y chromosome. However, other research has also shown that the paternal lineages Y-DNA O-M119,[153] O-P201,[154] O-P203[154] and O-M95[155] are found in both southern Han Chinese and South Chinese minorities, but more commonly in the latter. In fact, these paternal markers are in turn less frequent in northern Han Chinese.[156][157]
The mitochondrial-DNA haplogroups of the Han Chinese can be classified into the northern East Asian-dominating haplogroups, including A, C, D, G, M8, M9, and Z, and the southern East Asian-dominating haplogroups, including B, F, M7, N*, and R.[101] These haplogroups account for 52.7% and 33.85% of those in the Northern Han, respectively. Therefore, haplogroups D is the most (Haplogroups D is the most(Haplogroup D is the modal mtDNA haplogroup among northern East Asians (Manchus, Tuoba Xianbei, Oroqens, Manchurian Evenks). Among these haplogroups, D, B, F, and A were predominant in the Northern Han, with frequencies of 25.77%, 11.54%, 11.54%, and 8.08%, respectively. However, in the Southern Han, the northern and southern East Asian-dominating haplogroups accounted for 35.62% and 51.91%, respectively. The frequencies of haplogroups D, B, F, and A reached 15.68%, 20.85%, 16.29%, and 5.63%, respectively.[138][158][159][160][161]
See also: Tibetan people |
Modern Tibetan populations are genetically most similar to other modern East Asian populations.[162] They also show more genetic affinity for modern Central Asian than modern Siberian populations.[162]
A 2016 study found that the Tibetan gene pool diverged from that of Han Chinese around 15,000 years ago, which can be largely attributed to post-LGM (Last Glacial Maximum) arrivals. Analysis of around 200 contemporary populations showed that Tibetans share ancestry with populations from East Asia (~82%), Central Asia and Siberia (~11%), South Asia (~6%), and western Eurasia and Oceania (~1%). These results support that Tibetans arose from a mixture of multiple ancestral gene pools but that their origins are much more complicated and ancient than previously suspected.[162]
A study in 2010 suggested that the majority of the Tibetan gene pool may have diverged from the Zang around 15,000 years ago.[163] However, there are possibilities of much earlier human inhabitation of Tibet,[164][165] and these early residents may have contributed to the modern Tibetan gene pool.[166]
The date of divergence between Tibetans and Sherpas was estimated to have taken place around 11,000 to 7,000 years ago.[162]
After modern Oceanic populations, modern Tibetan populations show the highest rate of allele sharing with archaic hominins at over 6%.[162] Modern Tibetans show genetic affinities to three archaic populations: Denisovans, Neanderthals, and an unidentified archaic population.[162]
In comparison to modern Han populations, modern Tibetans show greater genetic affinity to Denisovans; however, both the Han and Tibetans have similar ratios of genetic affinity to general Neanderthal populations.[162]
Modern Tibetans were identified as the modern population that has the most alleles in common with Ust'-Ishim man.[162]
The distribution of Haplogroup D-M174 (subclade Haplogroup D-Z27276) is found among nearly all the populations of Central Asia and Northeast Asia south of the Russian border, although generally at a low frequency of 2% or less. A dramatic spike in the frequency of D-M174 occurs as one approaches the Tibetan Plateau. D-M174 is also found at high frequencies among Japanese people, but it fades into low frequencies in Korea and China proper between Japan and Tibet. The claim that the Navajo people and Tibetans are related, while discussed among linguists since Edward Sapir, has not found support in genetic studies. Some light has been shed on their origins, however, by one genetic study in which it was indicated that Tibetan Y-chromosomes had multiple origins, one from Central Asia and the other from East Asia.[167]
See also: Turkic peoples and Genetic studies on Turkish people |
The Turkic peoples are a collection of ethno-linguistic groups of Central-, Eastern-, Northern- and Western-Asia as well as parts of Europe and North Africa. They speak related languages belonging to the Turkic language family.
Proposals for the homeland of the Turkic peoples and their language are far-ranging, from the Transcaspian steppe to Northeastern Asia (Manchuria).[168]
According to Yunusbayev, genetic evidence points to an origin in the region near South Siberia and Mongolia as the "Inner Asian Homeland" of the Turkic ethnicity.[169]
More recent genetic and archeologic studies suggest that the Turkic peoples were descended from a Transeurasian agricultural community based in northeast China, which is to be associated with the Xinglongwa culture and the succeeding Hongshan culture.[170] The East Asian agricultural origin of the Turkic peoples has been corroborated in multiple recent studies. Around 2,200 BC, due to the desertification of northeast China, the agricultural ancestors of the Turkic peoples probably migrated westwards into Mongolia, where they adopted a pastoral lifestyle.[171][172][173]
Linguistic and genetic evidence strongly suggests an early presence of Turkic peoples in eastern Mongolia.[174][175] Genetic studies have shown that the early Turkic peoples were of predominantly East Asian origins with medieval Turkic samples being more heterogeneous, and that Turkic culture and language was spread westwards through elite dominance.[176][173] The genetic evidence suggests that the Turkification of Central Asia was carried out by East Asian dominant minorities migrating out of Mongolia.[177]
Common Y-DNA haplogroups in Turkic peoples are Haplogroup N-M231 (found with especially high frequency among Turkic peoples living in present-day Russia), Haplogroup C-M217 (especially in Central Asia and, in particular, Kazakhstan), Haplogroup Q-M242 (especially in Southern Siberia and among Turkmens and the Qangly tribe of Kazakhs), and Haplogroup O-M175 (especially among Turkic peoples living in present-day China and the Naiman tribe of Kazakhs). Some groups also have Haplogroup R1b (notably frequent among the Teleuts and Kumandins of Southern Siberia, the Bashkirs of the Southern Ural region of Russia, and the Qypshaq tribe of Kazakhs), Haplogroup R1a (notably frequent among the Kyrgyz, Altaians, and several other Turkic peoples living in present-day Russia), Haplogroup J-M172 (especially frequent among Uyghurs, Uzbeks, Azerbaijanis, and Turkish people), and Haplogroup D-M174 (especially among Yugurs, but also observed regularly with low frequency among Uyghurs, Southern Altaians, Nogais, Kazakhs, and Uzbeks).[178][179]
See also: History of Central Asia § Medieval |
The genetic evidence suggests that the Turkification of Central Asia was carried out by East Asian dominant minorities migrating out of Mongolia.[180] According to a recent study, the Turkic Central Asian populations, such as Kyrgyz, Kazakhs, Uzbeks, and Turkmens share more of their gene pool with various East Asian and Siberian populations than with West Asian or European populations. The study further suggests that both migration and linguistic assimilation helped to spread the Turkic languages in Eurasia.[175]
Genetic data suggests that Siberia was populated during the Terminal Upper-Paleolithic (36+-1,5ka) period from a distinct Paleolithic population migrating through Central Asia into Northern Siberia. This population is known as Ancient North Eurasians or Ancient North Siberians.
Between 30,000 to 25,000 years ago, the ancestors of both Paleo-Siberians and Native Americans originated from admixture between Ancient North Eurasians/Siberians and an Ancient East Asian lineage.[181][182] Ancestral Native Americans (or Ancient Beringians) later migrated towards the Beringian region, became isolated from other populations, and subsequently populated the Americas. Further geneflow from Northeast Asia resulted in the modern distribution of "Neo-Siberians" (associated with 'Altaic speakers') through the merger of Paleo-Siberians with Northeast Asians.[183][184][185]
Overall, while Northern Asians cluster closely to East Asians, they are shifted into a distinct position. "Analyses of all 122 populations confirm many known relationships and show that most populations from North Asia form a cluster distinct from all other groups. Refinement of analyses on smaller subsets of populations reinforces the distinctiveness of North Asia and shows that the North Asia cluster identifies a region that is ancestral to Native Americans."[186]
Multiple studies suggests that all Native Americans ultimately descended from a single founding population that initially diverged from" Ancestral Beringians" which shared a common origin with Paleo-Siberians from the merger of Ancient North Eurasians and a Basal-East Asian source population in Mainland Southeast Asia around 36,000 years ago, at the same time at which the proper Jōmon people split from Basal-East Asians, either together or during a separate expansion wave. The basal northern and southern Native American branches, to which all other Indigenous peoples belong, diverged around 16,000 years ago, although earlier dates were also proposed.[187][86] An indigenous American sample from 16,000BC in Idaho, which is craniometrically similar to modern Native Americans, was found to have been closely related to Paleosiberians, confirming that Ancestral Native Americans split from an ancient Siberian source population somewhere in northeastern Siberia. Genetic data on samples with alleged "Paleo-Indian" morphology turned out to be closely related to contemporary Native Americans, disproving a hypothetical earlier migration into the Americas. The scientists suggest that variation within Native American morphology is just that, the natural variation which have arisen during the formation of Ancestral Native Americans. Signals of a hypothetical "population Y", if not a false positive, are likely explained through a now extinct population from East Asia (eg. Tianyuan man, which contributed low amounts of ancestry to the Ancestral Native American gene pool in Asia, and perhaps also towards other Asian and Oceanian populations.[188][187][189][190][191]
Main article: Genetic history of South Asia |
Genomic studies have described the genetic landscape of South Asia as a composite of West-Eurasian and East Asian exogenous components that mixed with the indigenous[193] South Asian groups (termed Ancient Ancestral South Indians ("AASI"), who are genetically isolated from other populations around 45-40kya, and distantly related to the Andamanese people, Aboriginal Australians and East Asians) to create modern-day South Asians.[194][195] The East Asian-related ancestry component forms the major ancestral component among Sino-Tibetan speaking populations in the Northeast India including Himalayan foothills, and is generally distributed throughout the rest of the Indian subcontinent at lower frequency but found substantially among Austroasiatic-speaking groups, as well as among Sinhalese and Bengalis.[196][197][198][199][200][201][202]
According to a genetic research (2015) including linguistic analyses, suggests an East Asian origin for proto-Austroasiatic groups, which first migrated to Southeast Asia and later into India.[203] According to Ness, there are three broad theories on the origins of the Austroasiatic speakers, namely northeastern India, central or southern China, or southeast Asia.[204] Multiple researches indicate that the Austroasiatic populations in India are derived from (mostly male dominated) migrations from Southeast Asia during the Holocene.[205][202][206][203][207][208] According to Van Driem (2007), "...the mitochondrial picture indicates that the Munda maternal lineage derives from the earliest human settlers on the Subcontinent, whilst the predominant Y chromosome haplogroup argues for a Southeast Asian paternal homeland for Austroasiatic language communities in India."[205]: 7
According to Chaubey et al. (2011), "Austroasiatic speakers in India today are derived from dispersal from Southeast Asia, followed by extensive sex-specific admixture with local Indian populations."[202] According to Zhang et al. (2015), Austroasiatic (male) migrations from southeast Asia into India took place after the lates Glacial maximum, circa 4000 years ago.[203] According to Arunkumar et al. (2015), Y-chromosomal haplogroup O2a1-M95, which is typical for Austroasiatic speaking peoples, clearly decreases from Laos to east India, with "a serial decrease in expansion time from east to west," namely "5.7 ± 0.3 Kya in Laos, 5.2 ± 0.6 in Northeast India, and 4.3 ± 0.2 in East India." This suggests "a late Neolithic east to west spread of the lineage O2a1-M95 from Laos."[207][209] According to Riccio et al. (2011), the Munda people are likely descended from Austroasiatic migrants from southeast Asia.[206][210] According to Ness, the Khasi probably migrated into India in the first millennium BCE.[204]
According to Yelmen et al. 2019, the two main components of Indian genetic variation; the South Asian populations that "separated from East Asian and Andamanese populations" form one of the deepest splits among non-african groups compared to the West Eurasian component because of "40,000 years of independent evolution".[211]
Geneflow from Southeast Asians (particularly Austroasiatic groups) to South Asian peoples is associated with the introduction of rice-agriculture to South Asia. There is significant cultural, linguistic, and political Mon-Khmer (Austroasiatic) influence on early India, which can also be observed by the presence of Austroasiatic loanwords within Indo-Aryan languages.[212][213]
A 2020 genetic study about Southeast Asian populations, found that mostly all Southeast Asians are closely related to East Asians and have mostly "East Asian-related" ancestry.[214][18]
Ancient remains of hunter-gatherers in Maritime Southeast Asia, such as one Holocene hunter-gatherer from South Sulawesi, had ancestry from both, the South-Eurasian lineage (represented by Papuans and Aboriginal Australians), and the East-Eurasian lineage (represented by East Asians and or Andamanese Onge). The hunter-gatherer individual had approximately ~50% "Basal-East Asian" ancestry, and was positioned in between modern East Asians and Papuans of Oceania. The authors concluded that East Asian-related ancestry expanded from Mainland Southeast Asia into Maritime Southeast Asia much earlier than previously suggested, as early as 25,000BC, long before the expansion of Austroasiatic and Austronesian groups.[215]
A 2022 genetic study confirmed the close link between East Asians and Southeast Asians, which the authors term "East/Southeast Asian" (ESEA) populations, and also found a low but consistent proportion of South Asian-associated "SAS ancestry" (best samplified by modern Bengalis) among specific Mainland Southeast Asian (MESA) ethnic groups (~2–16% as inferred by qpAdm), likely as a result of cultural diffision; mainly of South Asian merchants spreading Hinduism and Buddhism among the Indianized kingdoms of Southeast Asia. The authors however caution that Bengalis harbor significant East Asian ancestry, which may affect the estimation of shared haplotypes. Overall, the geneflow event is estimated to have happened between 500–1000 YBP.[216]
Genetic studies have revealed that Aboriginal Australians largely descended from an Eastern Eurasian population wave, and are most closely related to other Oceanians, such as Melanesians such as Papuans and also show affinity to other Asian and Pacific populations, such as Negritos, Ancient Ancestral South Indians and East Asian peoples. Phylogenetic data suggests that an early initial eastern non-African lineage (ENA) or East-Eurasian meta-population trifurcated, and gave rise to Australasians (Oceanians), the Ancient Ancestral South Indians/Andamanese, and the East/Southeast Asian lineage including the ancestors of Native Americans.[195]
Phylogenetic data suggests that an early initial Eastern Eurasian (eastern non-African ENA) meta-population trifurcated, and gave rise to Australasians (Oceanians), the AASI, as well as East/Southeast Asians, although Papuans may have also received some geneflow from an earlier group (xOoA),[195][23] around 2%,[217] next to additional archaic admixture in the Sahul region. Vallini et al. 2022 study confirmed the position of Papuans, who belong to the broader Melanesian peoples of the Pacific Islands, as a sister lineage of East Asians: Papuan ancestry can be modelled either as an almost unadmixed sister lineage of East Asians (with potential minimal ingressions from basal Out-of-Africa populations) or as an approximately even mixture of East Asians and a basal East-Eurasian lineage that had split prior to the expansion of East-Eurasian ancestry into East Asia and Oceania.[23]