Genetic studies on the Sinhalese is part of population genetics investigating the origins of the Sinhalese population.
All studies agree that there is a significant relationship between the Sinhalese and the Bengalis and South Indian Tamils, and that there is a significant genetic relationship between Sri Lankan Tamils and Sinhalese. This is also supported by a genetic distance study, which showed low differences in genetic distance between the Sinhalese and the Bengali, Tamil, and Keralite volunteers.
An Alu polymorphism analysis by Mastana S (2007) using Sinhalese, Tamil, Bengali, Gujarati (Patel), and Punjabi as parental populations found the following proportions of genetic contribution. The Sinhalese sample size used was 121 individuals.: [2]
Statistical Method | Bengali | Tamil | North Western |
---|---|---|---|
Point Estimate | 57.49% | 42.5% | - |
Maximum Likelihood Method | 88.07% | - | - |
Using Tamil, Bengali and North West as parental population | 50-66% | 11-30% | 20-23% |
Parental population | Bengali | Tamil | Gujarati | Punjabi |
---|---|---|---|---|
Using Tamil and Bengali as parental population | 70.03% | 29.97% | - | |
Using Tamil, Bengali and Gujarati as parental population | 71.82% | 16.38% | 11.82% | |
Using Bengali, Gujarati and Punjabi as parental population | 82.09% | - | 15.39% | 2.52% |
Analysis of X chromosome STRs by Perera et al. (2021) found the Sinhalese (as well as Sri Lankan Tamils and Sri Lankan Muslims) to be more closely related to Bengalis, than to the Indian Tamils of Sri Lanka. [3]
Genetic distance analysis by Kirk (1976) found the Sinhalese to be closer to the Bengal than they are to populations in Gujarat or the Panjab. [4]
D1S80 allele frequency (a popular allele for genetic fingerprinting) is also similar between the Sinhalese and Bengalis, suggesting the two groups are closely related. [5]
The Sinhalese also have similar frequencies of the allele MTHFR 677T (13%) to West Bengalis (17%). [6] [7]
A genetic admixture study by Kshatriya (1995) found the Sinhalese to have a higher contribution from Indian Tamils (69.86% +/- 0.61), compared with the Bengalis (25.41% +/- 0.51). [8]
Genetic distance analysis by Roychoudhury AK et al. (1985) suggested the Sinhalese are more closely related to South and West Indian populations, than the Bengalis. [9]
Genetic distance analysis by Kirk (1976) suggested the Sinhalese are closer to the Tamils and Keralites of South India, than they are to the populations in Gujarat or the Panjab. [4]
A 2023 study by Singh et al using higher resolution markers than previous studies found that there was higher gene flow from South India to the Sinhalese than from North India, with the Sinhalese sharing the highest Identity by descent with Tamils specially Piramalai Kallars, compared to the other Indian populations studied. The study also found heightened sharing with the Maratha of India , consistent with a West Eurasian contribution .This excess sharing of segments suggests common roots of Sinhala with the Marāṭhā corroborating the linguistic hypothesis of Lazarus Geiger, Ralph Lilley Turner, and George van Driem. The total Sinhalese sample size used was 9 individuals. [10]
An Alu polymorphism analysis by Mastana S (2007) found a North West Indian contribution (20-23%). [11]
Analysis of X chromosome STRs by Perera et al., (2011) showed that the Sinhalese, Sri Lankan Tamil, Moor and Indian Tamils of Sri Lanka, share affinities with the Bhil (an Indigenous group) of North West India. [12]
A study looking at genetic variation of the FUT2 gene in the Sinhalese and Sri Lankan Tamil population, found similar genetic backgrounds for both ethnic groups, with little genetic flow from other neighbouring Asian population groups. [13] Studies have also found no significant difference with regards to blood group, blood genetic markers (Saha, 1988) and single-nucleotide polymorphism between the Sinhalese and other ethnic groups in Sri Lanka. [14] [15] [16] Another study has also found "no significant genetic variation among the major ethnic groups in Sri Lanka". [17] This is further supported by a study which found very similar frequencies of alleles MTHFR 677T, F2 20210A & F5 1691A in Indian Tamil, Sinhalese, Sri Lankan Tamil, and Sri Lankan Moor populations. [7]
A 1985 study conducted by Roychoudhury AK and Nei M indicating the values of genetic distance showed that the Sinhalese, along with the four Indian subcontinent populations from Punjab, Gujarat, Andhra Pradesh, and Bangladesh, were closer to Afghans and Iranians than the neighboring East/Southeast Asian groups represented by the Bhutanese, Malays, Bataks in northern Sumatra, and the Chinese. [9]
Genetic markers of immunoglobulin among the Sinhalese show high frequencies of afb1b3 which has its origins in the Yunnan and Guangxi provinces of southern China. [18] It is also found at high frequencies among Odias, certain Nepali and Northeast Indian, southern Han Chinese, Southeast Asian and certain Austronesian populations of the Pacific Islands. [18] At a lower frequency, ab3st is also found among the Sinhalese and is generally found at higher frequencies among northern Han Chinese, Tibetan, Mongolian, Korean and Japanese populations. [18] The Transferrin TF*Dchi allele which is common among East Asian and Native American populations is also found among the Sinhalese. [9] HumDN1*4 and HumDN1*5 are the predominant DNase I genes among the Sinhalese and are also the predominant genes among southern Chinese ethnic groups and the Tamang people of Nepal. [19] A 1988 study conducted by N. Saha, showed the high GC*1F and low GC*1S frequencies among the Sinhalese are comparable to those of the Chinese, Japanese, Koreans, Thais, Malays, Vietnamese, Laotians and Tibetans. [20] Hemoglobin E a variant of normal hemoglobin, which originated in and is prevalent among populations in Southeast Asia, is also common among the Sinhalese and can reach up to 40% in Sri Lanka. [21]
Ranweera et al. (2014) found the most common mtDNA haplogroup in the Sinhalese to be, Haplogroup M and Haplogroup U (U7a) , Haplogroup R and Haplogroup G (G3a1′2). [22] [23]
Haplogroup M represents the dispersal of modern humans around 60.000 years ago along the southern Asian coastline following a southern coastal route across Arabia and India to reach Australia short after. [24]
Haplogroup U7 is considered a West Eurasian–specific mtDNA haplogroup, believed to have originated in the Black Sea area approximately 30,000 years ago. In South Asia, U7 occurs in about 12% in Gujarat, while for the whole of India its frequency stays around 2%, and 5% in Pakistan. In the Vedda people of Sri Lanka it reaches its highest frequency of 13.33% (subclade U7a). It is speculated that large-scale immigration carried these mitochondrial haplogroups into India. [25]
Kivisild et al.(2003) found the most common Y-chromosome DNA haplogroups found in the Sinhalese are Haplogroup R2, Haplogroup L, Haplogroup R1a and F in that order. [26]
Population | n | C | E | F | G | H | I | J | K | L | N | O | P | Q | R | R1 | R1a | R1b | R2 | T | Others | Reference |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Sinhalese | 39 | 0 | 0 | 10.3% | 0 | 10.3% | 0 | 10.3% | 0 | 18% | 0 | 0 | 0 | 0 | 0 | 0 | 12.8% | 0 | 38.5% | 0 | Kivisild2003 [26] | |
Chaubey states that "considerable number of maternal lineages of Sri Lanka is shared with India, more precisely with southern part of India." [27]
The Sinhalese people, also known as the Sinhalese or Sinhala people are an Indo-Aryan ethno-linguistic group native to the island of Sri Lanka. They are the largest ethnic group in Sri Lanka, constituting about 75% of the Sri Lankan population and number more than 15.2 million.
The Vedda, or Wanniyalaeto, are a minority indigenous group of people in Sri Lanka who, among other sub-communities such as Coast Veddas, Anuradhapura Veddas and Bintenne Veddas, are accorded indigenous status. The Vedda minority in Sri Lanka may become completely assimilated. Most speak Sinhala instead of their indigenous languages, which are nearing extinction. It has been hypothesized that the Vedda were probably the earliest inhabitants of Sri Lanka and have lived on the island since before the arrival of other groups from the Indian mainland.
Genetics and archaeogenetics of South Asia is the study of the genetics and archaeogenetics of the ethnic groups of South Asia. It aims at uncovering these groups' genetic histories. The geographic position of the Indian subcontinent makes its biodiversity important for the study of the early dispersal of anatomically modern humans across Asia.
Haplogroup J-M304, also known as J, is a human Y-chromosome DNA haplogroup. It is believed to have evolved in Western Asia. The clade spread from there during the Neolithic, primarily into North Africa, the Horn of Africa, the Socotra Archipelago, the Caucasus, Europe, Anatolia, Central Asia, South Asia, and Southeast Asia.
Haplogroup H (Y-DNA), also known as H-L901/M2939, is a Y-chromosome haplogroup.
Haplogroup L-M20 is a human Y-DNA haplogroup, which is defined by SNPs M11, M20, M61 and M185. As a secondary descendant of haplogroup K and a primary branch of haplogroup LT, haplogroup L currently has the alternative phylogenetic name of K1a, and is a sibling of haplogroup T.
Haplogroup Q or Q-M242 is a Y-chromosome DNA haplogroup. It has one primary subclade, Haplogroup Q1 (L232/S432), which includes numerous subclades that have been sampled and identified in males among modern populations.
In human genetics, a human Y-chromosome DNA haplogroup is a haplogroup defined by specific mutations in the non-recombining portions of DNA on the male-specific Y chromosome (Y-DNA). Individuals within a haplogroup share similar numbers of short tandem repeats (STRs) and single-nucleotide polymorphisms (SNPs). The Y-chromosome accumulates approximately two mutations per generation, and Y-DNA haplogroups represent significant branches of the Y-chromosome phylogenetic tree, each characterized by hundreds or even thousands of unique mutations.
Haplogroup R, or R-M207, is a Y-chromosome DNA haplogroup. It is both numerous and widespread among modern populations.
Haplogroup R2a, or haplogroup R-M124, is a Y-chromosome haplogroup characterized by genetic markers M124, P249, P267, L266, and is mainly found in South Asia as well as in Central Asia, Caucasus, West Asia and North Africa / MENA.
Sri Lankan Moors are an ethnic minority group in Sri Lanka, comprising 9.3% of the country's total population. Most of them are native speakers of the Tamil language. The majority of Moors who are not native to the North and East also speak Sinhalese as a second language. They are predominantly followers of Islam. The Sri Lankan Muslim community is mostly divided between Sri Lankan Moors, Indian Moors, Sri Lankan Malays and Sri Lankan Bohras. These groups are differentiated by lineage, language, history, culture and traditions.
Haplogroup T-M184, also known as Haplogroup T, is a human Y-chromosome DNA haplogroup. The unique-event polymorphism that defines this clade is the single-nucleotide polymorphism known as M184.
Y-DNA haplogroups in populations of South Asia are haplogroups of the male Y-chromosome found in South Asian populations.
The tables below provide statistics on the human Y-chromosome DNA haplogroups most commonly found among ethnolinguistic groups and populations from East and South-East Asia.
Although Sri Lankan Tamils are culturally and linguistically distinct, genetic studies indicate that they are closely related to other ethnic groups in the island while being related to the Indian Tamils from South India and Bengalis from the East India as well. There are various studies that indicate varying degrees of connections between Sri Lankan Tamils, Sinhalese and Indian ethnic groups.
In human mitochondrial genetics, haplogroup M18 is a human mitochondrial DNA (mtDNA) haplogroup. It is an India-specific lineage.
Genetic studies on Serbs show close affinity to other neighboring South Slavs.
The study of the genetics and archaeogenetics of the Gujarati people of India aims at uncovering these people's genetic history. According to the 1000 Genomes Project, "Gujarati" is a general term used to describe people who trace their ancestry to the region of Gujarat, located in the northwestern part of the Indian subcontinent, and who speak the Gujarati language, an Indo-European language. They have some genetic commonalities as well as differences with other ethnic groups of India.
Haplogroup D-M55 (M64.1/Page44.1) also known as Haplogroup D1a2a is a Y-chromosome haplogroup. It is one of two branches of Haplogroup D1a. The other is D1a1, which is found with high frequency in Tibetans and other Tibeto-Burmese populations and geographical close groups. D is also distributed with low to medium frequency in Central Asia, East Asia, and Mainland Southeast Asia.
Listed here are notable groups and populations from South Asia by human mitochondrial DNA haplogroups based on relevant studies. The samples are taken from individuals identified with linguistic designations, the third column gives the sample size studied, and the other columns give the percentage of the particular haplogroup. The two most widespread MtDNA haplogroups in South Asia are Haplogroup M and Haplogroup U.