• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

综合语言学、社会结构和地理学模型来模拟印度内部的遗传多样性。

Integrating Linguistics, Social Structure, and Geography to Model Genetic Diversity within India.

机构信息

Computational Genomics, IBM T.J. Watson Research Center, Yorktown Heights, NY, USA.

Computer Science Department, Purdue University, West Lafayette, IN, USA.

出版信息

Mol Biol Evol. 2021 May 4;38(5):1809-1819. doi: 10.1093/molbev/msaa321.

DOI:10.1093/molbev/msaa321
PMID:33481022
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8097304/
Abstract

India represents an intricate tapestry of population substructure shaped by geography, language, culture, and social stratification. Although geography closely correlates with genetic structure in other parts of the world, the strict endogamy imposed by the Indian caste system and the large number of spoken languages add further levels of complexity to understand Indian population structure. To date, no study has attempted to model and evaluate how these factors have interacted to shape the patterns of genetic diversity within India. We merged all publicly available data from the Indian subcontinent into a data set of 891 individuals from 90 well-defined groups. Bringing together geography, genetics, and demographic factors, we developed Correlation Optimization of Genetics and Geodemographics to build a model that explains the observed population genetic substructure. We show that shared language along with social structure have been the most powerful forces in creating paths of gene flow in the subcontinent. Furthermore, we discover the ethnic groups that best capture the diverse genetic substructure using a ridge leverage score statistic. Integrating data from India with a data set of additional 1,323 individuals from 50 Eurasian populations, we find that Indo-European and Dravidian speakers of India show shared genetic drift with Europeans, whereas the Tibeto-Burman speaking tribal groups have maximum shared genetic drift with East Asians.

摘要

印度是一个人口结构复杂的国家,其人口结构受到地理、语言、文化和社会分层等多种因素的影响。尽管地理因素与世界其他地区的遗传结构密切相关,但印度种姓制度所强加的严格内婚制以及众多的语言使用情况进一步增加了理解印度人口结构的复杂性。迄今为止,尚无研究试图对这些因素如何相互作用以塑造印度内部遗传多样性模式进行建模和评估。我们将来自印度次大陆的所有公开可用数据合并到一个包含 891 个人的数据集中,这些人来自 90 个明确界定的群体。我们综合考虑了地理、遗传和人口统计学因素,开发了遗传与地理人口统计学相关性优化(Correlation Optimization of Genetics and Geodemographics),以构建一个可以解释观察到的人口遗传亚结构的模型。我们发现,共同的语言以及社会结构是在次大陆创造基因流路径的最强大力量。此外,我们还使用脊杠杆得分统计量发现了最能捕捉到不同遗传亚结构的族群。我们将来自印度的数据与来自 50 个欧亚人群的另外 1323 个人的数据进行整合,发现印度的印欧语系和达罗毗荼语系使用者与欧洲人表现出共同的遗传漂变,而讲藏缅语的部落群体与东亚人表现出最大的共同遗传漂变。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f152/8097304/4c4aed10a900/msaa321f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f152/8097304/6be74a271d6f/msaa321f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f152/8097304/706582c613bb/msaa321f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f152/8097304/4c4aed10a900/msaa321f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f152/8097304/6be74a271d6f/msaa321f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f152/8097304/706582c613bb/msaa321f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f152/8097304/4c4aed10a900/msaa321f3.jpg

相似文献

1
Integrating Linguistics, Social Structure, and Geography to Model Genetic Diversity within India.综合语言学、社会结构和地理学模型来模拟印度内部的遗传多样性。
Mol Biol Evol. 2021 May 4;38(5):1809-1819. doi: 10.1093/molbev/msaa321.
2
Phylogeography of mtDNA haplogroup R7 in the Indian peninsula.印度半岛线粒体DNA单倍群R7的系统地理学
BMC Evol Biol. 2008 Aug 4;8:227. doi: 10.1186/1471-2148-8-227.
3
A microsatellite study to disentangle the ambiguity of linguistic, geographic, ethnic and genetic influences on tribes of India to get a better clarity of the antiquity and peopling of South Asia.一项微卫星研究,旨在厘清语言、地理、种族和基因对印度部落的影响所存在的模糊性,以便更清晰地了解南亚的古代史和人口情况。
Am J Phys Anthropol. 2009 Aug;139(4):533-46. doi: 10.1002/ajpa.21018.
4
Genetic affinities among the lower castes and tribal groups of India: inference from Y chromosome and mitochondrial DNA.印度低种姓群体与部落群体之间的遗传亲缘关系:来自Y染色体和线粒体DNA的推断
BMC Genet. 2006 Aug 7;7:42. doi: 10.1186/1471-2156-7-42.
5
Y chromosome haplogroup distribution in Indo-European speaking tribes of Gujarat, western India.印度西部古吉拉特邦说印欧语系语言部落的Y染色体单倍群分布
PLoS One. 2014 Mar 10;9(3):e90414. doi: 10.1371/journal.pone.0090414. eCollection 2014.
6
A prehistory of Indian Y chromosomes: evaluating demic diffusion scenarios.印度Y染色体的史前史:评估人口扩散情形
Proc Natl Acad Sci U S A. 2006 Jan 24;103(4):843-8. doi: 10.1073/pnas.0507714103. Epub 2006 Jan 13.
7
The Austroasiatic Munda population from India and Its enigmatic origin: a HLA diversity study.来自印度的南亚语系蒙达人群体及其神秘起源:一项人类白细胞抗原多样性研究。
Hum Biol. 2011 Jun;83(3):405-35. doi: 10.3378/027.083.0306.
8
Microsatellite diversity reveals the interplay of language and geography in shaping genetic differentiation of diverse Proto-Australoid populations of west-central India.微卫星多样性揭示了语言和地理因素在塑造印度中西部不同原澳大利亚人种群体遗传分化过程中的相互作用。
Am J Phys Anthropol. 2006 Feb;129(2):260-7. doi: 10.1002/ajpa.20283.
9
A genetic chronology for the Indian Subcontinent points to heavily sex-biased dispersals.一份针对印度次大陆的基因年表显示存在严重的性别偏向性扩散。
BMC Evol Biol. 2017 Mar 23;17(1):88. doi: 10.1186/s12862-017-0936-9.
10
Genomic congruence of Indo-European speaking tribes of western India with Dravidian-speaking populations of southern India: A study of 20 autosomal DNA markers.印度西部说印欧语系语言的部落与印度南部说达罗毗荼语系语言的人群的基因组一致性:一项关于20个常染色体DNA标记的研究。
Ann Hum Biol. 2011 Sep;38(5):583-91. doi: 10.3109/03014460.2011.577455. Epub 2011 May 11.

引用本文的文献

1
Human genetic diversity across South Asian populations: A systematic review and meta-analysis.南亚人群的人类遗传多样性:系统评价与荟萃分析。
Medicine (Baltimore). 2025 Aug 29;104(35):e44147. doi: 10.1097/MD.0000000000044147.
2
Genetic insights into forensic features and population structure of the Hajong tribe in the Indian regions of Eastern Himalaya.对喜马拉雅东部印度地区哈琼部落法医特征和种群结构的遗传学见解。
Mol Genet Genomics. 2025 Jun 5;300(1):55. doi: 10.1007/s00438-025-02262-6.
3
Populations of Latvia and Lithuania in the context of some Indo-European and non-Indo-European speaking populations of Europe and India: insights from genetic structure analysis.

本文引用的文献

1
The formation of human populations in South and Central Asia.南亚和中亚地区的人口形成。
Science. 2019 Sep 6;365(6457). doi: 10.1126/science.aat7487.
2
TeraPCA: a fast and scalable software package to study genetic variation in tera-scale genotypes.TeraPCA:一个快速且可扩展的软件包,用于研究万亿级基因型中的遗传变异。
Bioinformatics. 2019 Oct 1;35(19):3679-3683. doi: 10.1093/bioinformatics/btz157.
3
The genetic legacy of continental scale admixture in Indian Austroasiatic speakers.印度澳亚语人群中大陆规模混合的遗传遗产。
拉脱维亚和立陶宛人口与欧洲及印度一些说印欧语系和非印欧语系语言的人口情况:来自基因结构分析的见解
Front Genet. 2024 Nov 20;15:1493270. doi: 10.3389/fgene.2024.1493270. eCollection 2024.
4
Matrix sketching framework for linear mixed models in association studies.矩阵素描框架在关联研究中的线性混合模型。
Genome Res. 2024 Oct 11;34(9):1304-1311. doi: 10.1101/gr.279230.124.
5
Forensic Characterization, Genomic Variability and Ancestry Analysis of Six Populations from Odisha Using mtDNA SNPs and Autosomal STRs.利用线粒体DNA单核苷酸多态性和常染色体短串联重复序列对奥里萨邦六个群体进行法医特征分析、基因组变异性分析和血统分析。
Biochem Genet. 2024 Jul 22. doi: 10.1007/s10528-024-10887-2.
6
Editorial: The adaptive value of languages: non-linguistic causes of language diversity, volume II.社论:语言的适应性价值:语言多样性的非语言成因,第二卷。
Front Psychol. 2024 Mar 6;15:1387290. doi: 10.3389/fpsyg.2024.1387290. eCollection 2024.
7
Genomic analysis reveals geography rather than culture as the predominant factor shaping genetic variation in northern Kenyan human populations.基因组分析揭示,在塑造肯尼亚北部人类群体遗传变异方面,起主要作用的是地理因素,而非文化因素。
Am J Biol Anthropol. 2022 Jul;178(3):488-503. doi: 10.1002/ajpa.24521. Epub 2022 Apr 14.
8
Clinical and serological association of plasma 25-hydroxyvitamin D (25(OH)D) levels in lupus and the short-term effects of oral vitamin D supplementation.狼疮患者血浆 25-羟维生素 D(25(OH)D)水平的临床和血清学关联,以及口服维生素 D 补充的短期效果。
Arthritis Res Ther. 2023 Jan 3;25(1):2. doi: 10.1186/s13075-022-02976-7.
9
Pathophysiology, phenotypes and management of type 2 diabetes mellitus in Indian and Chinese populations.中国和印度人群 2 型糖尿病的病理生理学、表型和管理。
Nat Rev Endocrinol. 2022 Jul;18(7):413-432. doi: 10.1038/s41574-022-00669-4. Epub 2022 May 4.
Sci Rep. 2019 Mar 7;9(1):3818. doi: 10.1038/s41598-019-40399-8.
4
The Genetic Ancestry of Modern Indus Valley Populations from Northwest India.印度西北部现代印度河流域人群的遗传起源。
Am J Hum Genet. 2018 Dec 6;103(6):918-929. doi: 10.1016/j.ajhg.2018.10.022.
5
A genetic chronology for the Indian Subcontinent points to heavily sex-biased dispersals.一份针对印度次大陆的基因年表显示存在严重的性别偏向性扩散。
BMC Evol Biol. 2017 Mar 23;17(1):88. doi: 10.1186/s12862-017-0936-9.
6
Genetics of the peloponnesean populations and the theory of extinction of the medieval peloponnesean Greeks.伯罗奔尼撒半岛人群的遗传学与中世纪伯罗奔尼撒半岛希腊人灭绝理论。
Eur J Hum Genet. 2017 May;25(5):637-645. doi: 10.1038/ejhg.2017.18. Epub 2017 Mar 8.
7
Reconstructing the population history of the largest tribe of India: the Dravidian speaking Gond.重构印度最大部落的人口历史:说达罗毗荼语的贡德人。
Eur J Hum Genet. 2017 Apr;25(4):493-498. doi: 10.1038/ejhg.2016.198. Epub 2017 Feb 1.
8
Genomic analysis of Andamanese provides insights into ancient human migration into Asia and adaptation.安达曼岛人的基因组分析提供了有关古代人类向亚洲迁移和适应的见解。
Nat Genet. 2016 Sep;48(9):1066-70. doi: 10.1038/ng.3621. Epub 2016 Jul 25.
9
Admixture, Population Structure, and F-Statistics.混合、群体结构和F统计量。
Genetics. 2016 Apr;202(4):1485-501. doi: 10.1534/genetics.115.183913. Epub 2016 Feb 8.
10
Genomic reconstruction of the history of extant populations of India reveals five distinct ancestral components and a complex structure.对印度现存人口历史的基因组重建揭示了五个不同的祖先成分和一个复杂的结构。
Proc Natl Acad Sci U S A. 2016 Feb 9;113(6):1594-9. doi: 10.1073/pnas.1513197113. Epub 2016 Jan 25.