The hypothesized migration routes of the ancestors of the Sinhalese and other ethnic groups into Sri Lanka.
The hypothesized migration routes of the ancestors of the Sinhalese and other ethnic groups into Sri Lanka.

Genetic studies on the Sinhalese is part of population genetics investigating the origins of the Sinhalese population.

All studies agree that there is a significant relationship between the Sinhalese and the Bengalis and Tamils, and that there is a significant genetic relationship between Sri Lankan Tamils and Sinhalese, them being closer to each other than other South Asian populations. This is also supported by a genetic distance study, which showed low differences in genetic distance between the Sinhalese and the Bengali, Tamil, and Keralite volunteers.[1]

According to a study published in 2021 using 16 X-chromosomal short tandem repeat markers (STRs), there was no genetic subdivision detected between Sinhalese, Moors and Sri Lankan Tamils while Indian Tamils were having a subtle but statistically significant difference. The observed close relationship between Moors and Sinhalese maybe explained by the matrimonial bonds made by Moor males with Sinhalese females during their original settlement in Sri Lanka. Further, the phylogram generated for the four main ethnic groups of Sri Lanka was suggestive of an Indian origin for Moors compared to the Arabic origin speculated by some.

Relationship to Bengalis

Genetic26526 admixture of Sinhalese by Dr. Saha Papiha
Genetic26526 admixture of Sinhalese by Dr. Saha Papiha

An Alu polymorphism analysis of Sinhalese from Colombo by Dr Sarabjit Mastanain in 2007 using Tamil, Bengali, Gujarati (Patel), and Punjabi as parental populations found different proportions of genetic contribution:[2]

Statistical Method Bengali Tamil North Western
Point Estimate 57.49% 42.5% -
Maximum Likelihood Method 88.07% - -
Using Tamil, Bengali and North West as parental population 50-66% 11-30% 20-23%
Parental population Bengali Tamil Gujarati Punjabi
Using Tamil and Bengali as parental population 70.03% 29.97% -
Using Tamil, Bengali and Gujarati as parental population 71.82% 16.38% 11.82%
Using Bengali, Gujarati and Punjabi as parental population 82.09% - 15.39% 2.52%

D1S80 allele frequency (a popular allele for genetic fingerprinting) is also similar between the Sinhalese and Bengalis, suggesting the two groups are closely related.[3] The Sinhalese also have similar frequencies of the allele MTHFR 677T (13%) to West Bengalis (17%).[4][5]

A test for Y-chromosome DNA haplogroups conducted by Dr Toomas Kivisild on Sinhalese of Sri Lanka has shown that 23% of the subjects were R1a1a (R-SRY1532) positive.[6] Also in the same test 24.1% of the subjects were R2 positive as subclades of Haplogroup P (92R7).[6] Haplogroup R2 is also found in a considerable percentage among Bengali of India. Sample size used was 87 subjects.

Genetic distance of Sinhalese to other ethnic groups, according to an Alu Polymorphism analysis.
Genetic distance of Sinhalese to other ethnic groups, according to an Alu Polymorphism analysis.

A study in 2007 found similar frequencies of the allele HLA-A*02 in sinhalese (7.4%) and North Indian subjects (6.7%). HLA-A*02 is a rare allele which has a relatively high frequency in North Indian populations and is considered to be a novel allele among the North Indian population. This suggests possible North Indian origin of the Sinhalese.[7]

Relationship to Tamils

Main article: Genetic studies on Sri Lankan Tamils

Genetic admixture of Sinhalese by Dr. Gautam K. Kshatriya
Genetic admixture of Sinhalese by Dr. Gautam K. Kshatriya

Another study by GK Kshatriya conducted in 1995 assessing the 'Genetic affinities of Sri Lankan Populations' found a large genetic contribution from the Tamils of South India, as well as from the Bengali and Vedda populations.[8]

Parental population Tamil Bengali Vedda
Using Tamil, Bengali and Vedda as parental population 69.86% 25.41% 4.73%

Dr. Sarabjit Mastanain finding states cophenetic correlation was 0.8956 and it indicates Sinhalese & Tamil as native population. Also, it reflects on genetic distance among five populations of Sri Lanka as per given below eigenvector plot of the R-matrix.[9]

Genetic distance






(5 populations of Sri Lanka)

Relationship to other ethnic groups in Sri Lanka

A study looking at genetic variation of the FUT2 gene in the Sinhalese and Sri Lankan Tamil population, found similar genetic backgrounds for both ethnic groups, with little genetic flow from other neighbouring Asian population groups.[10] Studies have also found no significant difference with regards to blood group, blood genetic markers and single-nucleotide polymorphism between the Sinhalese and other ethnic groups in Sri Lanka.[11][12][13] Another study has also found "no significant genetic variation among the major ethnic groups in Sri Lanka".[14] This is further supported by a study which found very similar frequencies of alleles MTHFR 677T, F2 20210A & F5 1691A in Indian Tamil, Sinhalese, Sri Lankan Tamil, and Sri Lankan Moor populations.[5]

A genetic study carried out in 2015 by Lian dang et. on origin of Malay people and other populations of Sri Lanka involving 200 Sinhalese people, 103 Tamil people of Sri Lankan origin, 200 Tamil people of Indian origin and 35 Burgher people calculated the averaged genetic makeup across individuals of each population,[15] which show substantially higher amount of Central Asian ancestry and low South Asian ancestry among Sinhalese compared to both Tamil groups.

Relationship to East and Southeast Asians

Genetic studies show that the Sinhalese have received some genetic flow from neighboring populations in East Asia and Southeast Asia, such as from the ethnically diverse and disparate Tibeto-Burman peoples and Austro-Asiatic peoples,[16] which is due to their close genetic links to Northeast India.[17][18][19] A 1985 study conducted by Roychoudhury AK and Nei M, indicated the values of genetic distance showed that the Sinhalese people were slightly closer to Mongoloid populations due to gene exchange in the past.[20][21] In regards to comparisons of root and canal morphology of Sri Lankan mandibular molars, it showed that they were further away from Mongoloid populations.[22] Among haplogroups found in East Asian populations, a lower frequency of East Asian mtDNA haplogroup, G has been found among the populations of Sri Lanka alongside haplogroup D in conjunction with the main mtDNA haplogroup of Sri Lanka's ethnic groups, haplogroup M.[23] In regards to Y-DNA, Haplogroup C-M130 is found at low to moderate frequencies in Sri Lanka.[24]

Genetic markers of immunoglobulin among the Sinhalese show high frequencies of afb1b3 which has its origins in the Yunnan and Guangxi provinces of southern China.[25] It is also found at high frequencies among Odias, certain Nepali and Northeast Indian, southern Han Chinese, Southeast Asian and certain Austronesian populations of the Pacific Islands.[25] At a lower frequency, ab3st is also found among the Sinhalese and is generally found at higher frequencies among northern Han Chinese, Tibetan, Mongolian, Korean and Japanese populations.[25] The Transferrin TF*Dchi allele which is common among East Asian and Native American populations is also found among the Sinhalese.[20] HumDN1*4 and HumDN1*5 are the predominant DNase I genes among the Sinhalese and are also the predominant genes among southern Chinese ethnic groups and the Tamang people of Nepal.[26] A 1988 study conducted by N. Saha, showed the high GC*1F and low GC*1S frequencies among the Sinhalese are comparable to those of the Chinese, Japanese, Koreans, Thais, Malays, Vietnamese, Laotians and Tibetans.[27] A 1998 study conducted by D.E. Hawkey showed dental morphology of the Sinhalese is closely related to those of the Austro-Asiatic populations of East and Northeast India.[16] Hemoglobin E a variant of normal hemoglobin, which originated in and is prevalent among populations in Southeast Asia, is also common among the Sinhalese and can reach up to 40% in Sri Lanka.[28]

Relationship to other populations in regards to X-STR loci

A 2021 study focusing on 16 studied X-STR loci, compared four Sri Lankan ethnicities (Sinhalese, Sri Lankan Tamils, Indian Tamils, Moors) with 14 other world populations (Bhil India, Bangladesh, Malaysia, Thailand, China, Japan, Taiwan, Germany, Italy, Sweden, Denmark, North Portugal, Somalia, and Ivory Coast) with eight X chromosome based STR markers using a multidimensional scaling plot (MDS plot), it revealed that Sri Lankans were clustered together not only with South Asians like Indians and Bangladeshis, but also with Europeans. However, allelic distribution of many X-STR loci in Sri Lankan ethnic groups differ from European, Southeast Asian, East Asian and African populations and are most similar to the two Indian populations and Bangladeshi population included in the study.[29]


  1. ^ Kirk RL (July 1976). "The legend of Prince Vijaya – a study of Sinhalese origins". American Journal of Physical Anthropology. 45 (1): 91–99. doi:10.1002/ajpa.1330450112.
  2. ^ Mastana S (2007). "Molecular anthropology: population and forensic genetic applications" (PDF). Anthropologist Special. 3: 373–383.
  3. ^ Surinder Singh Papiha (1999). Genomic Diversity: Applications in Human Population Genetics. London: Springer. 7.
  4. ^ Mukhopadhyay K, Dutta S, Das Bhomik A (January 2007). "MTHFR gene polymorphisms analyzed in population from Kolkata, West Bengal". Indian Journal of Human Genetics. 13 (1): 38. doi:10.4103/0971-6866.32035. PMC 3168154. PMID 21957342.
  5. ^ a b Dissanayake VH, Weerasekera LY, Gammulla CG, Jayasekara RW (October 2009). "Prevalence of genetic thrombophilic polymorphisms in the Sri Lankan population--implications for association study design and clinical genetic testing services". Experimental and Molecular Pathology. 87 (2): 159–62. doi:10.1016/j.yexmp.2009.07.002. PMID 19591822.
  6. ^ a b Kivisild T, Rootsi S, Metspalu M, Metspalu E, Parik J, Kaldma K, et al. (2003). "The Genetics of Language and Farming Spread in India" (PDF). In Bellwood P, Renfrew C (eds.). Examining the farming/language dispersal hypothesis. Cambridge, United Kingdom: McDonald Institute for Archaeological Research. pp. 215–222.
  7. ^ Malavige GN, Rostron T, Seneviratne SL, Fernando S, Sivayogan S, Wijewickrama A, Ogg GS (October 2007). "HLA analysis of Sri Lankan Sinhalese predicts North Indian origin". International Journal of Immunogenetics. 34 (5): 313–5. doi:10.1111/j.1744-313X.2007.00698.x. PMID 17845299. S2CID 13210660.
  8. ^ Kshatriya GK (December 1995). "Genetic affinities of Sri Lankan populations". Human Biology. American Association of Anthropological Genetics. 67 (6): 843–66. PMID 8543296.
  9. ^ Mastana, Sarabjit (November 1996). "Genetic variation in Sri Lanka" (PDF). Scientific Reports: 26–27.
  10. ^ Soejima M, Koda Y (December 2005). "Denaturing high-performance liquid chromatography-based genotyping and genetic variation of FUT2 in Sri Lanka". Transfusion. 45 (12): 1934–9. doi:10.1111/j.1537-2995.2005.00651.x. PMID 16371047. S2CID 10401001.
  11. ^ Saha N (June 1988). "Blood genetic markers in Sri Lankan populations--reappraisal of the legend of Prince Vijaya". American Journal of Physical Anthropology. 76 (2): 217–25. doi:10.1002/ajpa.1330760210. PMID 3166342.
  12. ^ Roberts DF, Creen CK, Abeyaratne KP (1972). "Blood Groups of the Sinhalese". Man. 7 (1): 122–127. doi:10.2307/2799860. JSTOR 2799860.
  13. ^ Dissanayake VH, Giles V, Jayasekara RW, Seneviratne HR, Kalsheker N, Broughton Pipkin F, Morgan L (April 2009). "A study of three candidate genes for pre-eclampsia in a Sinhalese population from Sri Lanka". The Journal of Obstetrics and Gynaecology Research. 35 (2): 234–42. doi:10.1111/j.1447-0756.2008.00926.x. PMID 19708171. S2CID 24958292.
  14. ^ Illeperuma RJ, Mohotti SN, De Silva TM, Fernandopulle ND, Ratnasooriya WD (June 2009). "Genetic profile of 11 autosomal STR loci among the four major ethnic groups in Sri Lanka". Forensic Science International. Genetics. 3 (3): e105-6. doi:10.1016/j.fsigen.2008.10.002. PMID 19414153.
  15. ^ Deng L, Hoh BP, Lu D, Saw WY, Twee-Hee Ong R, Kasturiratne A, et al. (September 2015). "Dissecting the genetic structure and admixture of four geographical Malay populations". Scientific Reports. 5 (1): 14375. Bibcode:2015NatSR...514375D. doi:10.1038/srep14375. PMC 4585825. PMID 26395220.
  16. ^ a b Petraglia MD, Allchin B (2007). The Evolution and History of Human Populations in South Asia: Inter-disciplinary Studies in Archaeology, Biological Anthropology, Linguistics and Genetics. Springer Science & Business Media. ISBN 978-1-4020-5562-1.[page needed]
  17. ^ Soejima M, Koda Y (January 2007). "Population differences of two coding SNPs in pigmentation-related genes SLC24A5 and SLC45A2". International Journal of Legal Medicine. 121 (1): 36–9. doi:10.1007/s00414-006-0112-z. PMID 16847698. S2CID 11192076.
  18. ^ Kivisild T, Rootsi S, Metspalu M, Mastana S, Kaldma K, Parik J, et al. (February 2003). "The genetic heritage of the earliest settlers persists both in Indian tribal and caste populations". American Journal of Human Genetics. 72 (2): 313–32. doi:10.1086/346068. PMC 379225. PMID 12536373.
  19. ^ Sengupta S, Zhivotovsky LA, King R, Mehdi SQ, Edmonds CA, Chow CE, et al. (February 2006). "Polarity and temporality of high-resolution y-chromosome distributions in India identify both indigenous and exogenous expansions and reveal minor genetic influence of Central Asian pastoralists". American Journal of Human Genetics. 78 (2): 202–21. doi:10.1086/499411. PMC 1380230. PMID 16400607.
  20. ^ a b Roychoudhury AK, Nei M (1985). "Genetic relationships between Indians and their neighboring populations". Human Heredity. 35 (4): 201–6. doi:10.1159/000153545. PMID 4029959.
  21. ^ Bhasin MK (4 September 2017). "Morphology to Molecular Anthropology: Castes and Tribes of India". International Journal of Human Genetics. 9 (3–4): 145–230. doi:10.1080/09723757.2009.11886070. S2CID 53353581.
  22. ^ Peiris R, Takahashi M, Sasaki K, Kanazawa E (July 2007). "Root and canal morphology of permanent mandibular molars in a Sri Lankan population". Odontology. 95 (1): 16–23. doi:10.1007/s10266-007-0074-8. PMID 17660977. S2CID 8504778.
  23. ^ Ranaweera L, Kaewsutthi S, Win Tun A, Boonyarit H, Poolsuwan S, Lertrit P (January 2014). "Mitochondrial DNA history of Sri Lankan ethnic people: their relations within the island and with the Indian subcontinental populations". Journal of Human Genetics. 59 (1): 28–36. doi:10.1038/jhg.2013.112. PMID 24196378.
  24. ^ "Y-DNA Haplogroup C and its Subclades - 2017". International Society of Genetic Genealogy. Retrieved 31 May 2017.
  25. ^ a b c Matsumoto H (2009). "The origin of the Japanese race based on genetic markers of immunoglobulin G". Proceedings of the Japan Academy. Series B, Physical and Biological Sciences. 85 (2): 69–82. Bibcode:2009PJAB...85...69M. doi:10.2183/pjab.85.69. PMC 3524296. PMID 19212099.
  26. ^ Fujihara J, Yasuda T, Iida R, Ueki M, Sano R, Kominato Y, et al. (July 2015). "Global analysis of genetic variations in a 56-bp variable number of tandem repeat polymorphisms within the human deoxyribonuclease I gene". Legal Medicine. 17 (4): 283–6. doi:10.1016/j.legalmed.2015.01.005. PMID 25771153.
  27. ^ Malhotra R (1992). Anthropology of Development: Commemoration Volume in the Honour of Professor I.P. Singh. Mittal Publications. ISBN 978-81-7099-328-5.[page needed]
  28. ^ Kumar D (2012). Genetic Disorders of the Indian Subcontinent. Springer Science & Business Media. ISBN 978-1-4020-2231-9.[page needed]
  29. ^ Perera, Nandika; Galhena, Gayani; Ranawaka, Gaya (17 June 2021). "X-chromosomal STR based genetic polymorphisms and demographic history of Sri Lankan ethnicities and their relationship with global populations". Scientific Reports. 11: 12748. doi:10.1038/s41598-021-92314-9. ISSN 2045-2322. PMC 8211843. PMID 34140598.