• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用 1000 基因组计划与 3D 面部图像进行异构数据集的全基因组祖先推断:实例研究。

Robust genome-wide ancestry inference for heterogeneous datasets: illustrated using the 1,000 genome project with 3D facial images.

机构信息

Medical Imaging Research Center, MIRC, University Hospitals Leuven, Leuven, Belgium.

Department of Electrical Engineering, ESAT/PSI, KU Leuven, Leuven, Belgium.

出版信息

Sci Rep. 2020 Jul 16;10(1):11850. doi: 10.1038/s41598-020-68259-w.

DOI:10.1038/s41598-020-68259-w
PMID:32678112
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7367291/
Abstract

Estimates of individual-level genomic ancestry are routinely used in human genetics, and related fields. The analysis of population structure and genomic ancestry can yield insights in terms of modern and ancient populations, allowing us to address questions regarding admixture, and the numbers and identities of the parental source populations. Unrecognized population structure is also an important confounder to correct for in genome-wide association studies. However, it remains challenging to work with heterogeneous datasets from multiple studies collected by different laboratories with diverse genotyping and imputation protocols. This work presents a new approach and an accompanying open-source toolbox that facilitates a robust integrative analysis for population structure and genomic ancestry estimates for heterogeneous datasets. We show robustness against individual outliers and different protocols for the projection of new samples into a reference ancestry space, and the ability to reveal and adjust for population structure in a simulated case-control admixed population. Given that visually evident and easily recognizable patterns of human facial characteristics co-vary with genomic ancestry, and based on the integration of three different sources of genome data, we generate average 3D faces to illustrate genomic ancestry variations within the 1,000 Genome project and for eight ancient-DNA profiles, respectively.

摘要

个体水平基因组起源的估计在人类遗传学和相关领域中得到了广泛应用。人口结构和基因组起源的分析可以提供有关现代和古代人口的见解,使我们能够解决关于混合、父母源群体的数量和身份的问题。未被识别的人口结构也是全基因组关联研究中需要纠正的一个重要混杂因素。然而,处理来自不同实验室、具有不同基因分型和 imputation 方案的多个研究的异质数据集仍然具有挑战性。这项工作提出了一种新的方法和一个配套的开源工具箱,用于对异质数据集进行稳健的综合分析,以估计人口结构和基因组起源。我们展示了对个体离群值和将新样本投影到参考起源空间的不同方案的稳健性,以及在模拟的病例对照混合人群中揭示和调整人口结构的能力。鉴于人类面部特征的明显和易于识别的模式与基因组起源相关,并且基于三个不同的基因组数据源的整合,我们分别生成平均的 3D 面孔,以说明 1000 基因组计划和八个古 DNA 图谱内的基因组起源变化。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4eb8/7367291/b7a0309eebfc/41598_2020_68259_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4eb8/7367291/b7a0309eebfc/41598_2020_68259_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4eb8/7367291/b7a0309eebfc/41598_2020_68259_Fig2_HTML.jpg

相似文献

1
Robust genome-wide ancestry inference for heterogeneous datasets: illustrated using the 1,000 genome project with 3D facial images.利用 1000 基因组计划与 3D 面部图像进行异构数据集的全基因组祖先推断:实例研究。
Sci Rep. 2020 Jul 16;10(1):11850. doi: 10.1038/s41598-020-68259-w.
2
Genome-wide Association Studies in Ancestrally Diverse Populations: Opportunities, Methods, Pitfalls, and Recommendations.全基因组关联研究在遗传背景多样化的人群中的应用:机遇、方法、陷阱和建议。
Cell. 2019 Oct 17;179(3):589-603. doi: 10.1016/j.cell.2019.08.051. Epub 2019 Oct 10.
3
Colloquium paper: genome-wide patterns of population structure and admixture among Hispanic/Latino populations.学术研讨会论文:西班牙裔/拉丁裔人群的全基因组人口结构和混合模式。
Proc Natl Acad Sci U S A. 2010 May 11;107 Suppl 2(Suppl 2):8954-61. doi: 10.1073/pnas.0914618107. Epub 2010 May 5.
4
Fast individual ancestry inference from DNA sequence data leveraging allele frequencies for multiple populations.利用多个群体的等位基因频率从DNA序列数据中快速推断个体祖先。
BMC Bioinformatics. 2015 Jan 16;16:4. doi: 10.1186/s12859-014-0418-7.
5
Accurate inference of local phased ancestry of modern admixed populations.现代混合群体局部定相祖先的准确推断。
Sci Rep. 2014 Jul 23;4:5800. doi: 10.1038/srep05800.
6
Data Harmonization Guidelines to Combine Multi-platform Genomic Data from Admixed Populations and Boost Power in Genome-Wide Association Studies.数据协调准则,用于整合来自混合人群的多平台基因组数据,并提高全基因组关联研究的效能。
Curr Protoc. 2024 Jun;4(6):e1055. doi: 10.1002/cpz1.1055.
7
Rapid automated landmarking for morphometric analysis of three-dimensional facial scans.用于三维面部扫描形态计量分析的快速自动地标定位
J Anat. 2017 Apr;230(4):607-618. doi: 10.1111/joa.12576. Epub 2017 Jan 12.
8
No evidence from genome-wide data of a Khazar origin for the Ashkenazi Jews.没有全基因组数据能证明阿什肯纳兹犹太人有哈扎尔人起源的证据。
Hum Biol. 2013 Dec;85(6):859-900. doi: 10.3378/027.085.0604.
9
Population genetic inference from personal genome data: impact of ancestry and admixture on human genomic variation.从个人基因组数据推断群体遗传学:遗传和混合对人类基因组变异的影响。
Am J Hum Genet. 2012 Oct 5;91(4):660-71. doi: 10.1016/j.ajhg.2012.08.025.
10
An ancestry informative marker panel design for individual ancestry estimation of Hispanic population using whole exome sequencing data.基于全外显子组测序数据的西班牙裔个体祖籍信息标记面板设计用于个体祖籍估计。
BMC Genomics. 2019 Dec 30;20(Suppl 12):1007. doi: 10.1186/s12864-019-6333-6.

引用本文的文献

1
Forensic skeletal and molecular anthropology face to face: Combining expertise for identification of human remains.法医骨骼人类学与分子人类学面对面:结合专业知识鉴定人类遗骸。
Ann N Y Acad Sci. 2025 Aug;1550(1):77-107. doi: 10.1111/nyas.15398. Epub 2025 Jul 10.
2
Optimized phenotyping of complex morphological traits: enhancing discovery of common and rare genetic variants.复杂形态特征的优化表型分析:加强常见和罕见遗传变异的发现。
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf090.
3
Ancestry testing of "Old Tom," a killer whale central to mutualistic interactions with human whalers.

本文引用的文献

1
Identification of individuals by trait prediction using whole-genome sequencing data.基于全基因组测序数据的特征预测进行个体识别。
Proc Natl Acad Sci U S A. 2017 Sep 19;114(38):10166-10171. doi: 10.1073/pnas.1711125114. Epub 2017 Sep 5.
2
TAF1 Variants Are Associated with Dysmorphic Features, Intellectual Disability, and Neurological Manifestations.TAF1基因变异与畸形特征、智力障碍及神经学表现相关。
Am J Hum Genet. 2015 Dec 3;97(6):922-32. doi: 10.1016/j.ajhg.2015.11.005.
3
Robust inference of population structure for ancestry prediction and correction of stratification in the presence of relatedness.
对“老汤姆”(一头虎鲸)的祖先进行测试,它是与人类捕鲸者互利互动的关键。
J Hered. 2023 Nov 15;114(6):598-611. doi: 10.1093/jhered/esad058.
4
Exploring regional aspects of 3D facial variation within European individuals.探讨欧洲个体三维面部变异的区域特征。
Sci Rep. 2023 Mar 6;13(1):3708. doi: 10.1038/s41598-023-30855-x.
5
Hybrid autoencoder with orthogonal latent space for robust population structure inference.具有正交潜在空间的混合自动编码器,用于稳健的群体结构推断。
Sci Rep. 2023 Feb 14;13(1):2612. doi: 10.1038/s41598-023-28759-x.
6
Principal Component Analyses (PCA)-based findings in population genetic studies are highly biased and must be reevaluated.基于主成分分析(PCA)的群体遗传学研究结果存在高度偏差,必须重新评估。
Sci Rep. 2022 Aug 29;12(1):14683. doi: 10.1038/s41598-022-14395-4.
7
Genetic variants underlying differences in facial morphology in East Asian and European populations.东亚和欧洲人群面部形态差异背后的基因变异。
Nat Genet. 2022 Apr;54(4):403-411. doi: 10.1038/s41588-022-01038-7. Epub 2022 Apr 7.
在存在亲缘关系的情况下,对群体结构进行稳健推断,以进行血统预测和分层校正。
Genet Epidemiol. 2015 May;39(4):276-93. doi: 10.1002/gepi.21896. Epub 2015 Mar 23.
4
Toward DNA-based facial composites: preliminary results and validation.迈向基于DNA的面部合成画像:初步结果与验证
Forensic Sci Int Genet. 2014 Nov;13:208-16. doi: 10.1016/j.fsigen.2014.08.008. Epub 2014 Aug 20.
5
The genetic prehistory of the New World Arctic.新世界北极的遗传史前史。
Science. 2014 Aug 29;345(6200):1255832. doi: 10.1126/science.1255832.
6
Inference of population structure using dense haplotype data.利用高密度单倍型数据推断种群结构。
PLoS Genet. 2012 Jan;8(1):e1002453. doi: 10.1371/journal.pgen.1002453. Epub 2012 Jan 26.
7
Laplacian eigenfunctions learn population structure.拉普拉斯特征函数可学习群体结构。
PLoS One. 2009 Dec 1;4(12):e7928. doi: 10.1371/journal.pone.0007928.
8
Genetic structure of Europeans: a view from the North-East.欧洲人的基因结构:来自东北部的视角。
PLoS One. 2009;4(5):e5472. doi: 10.1371/journal.pone.0005472. Epub 2009 May 8.
9
A recessive skeletal dysplasia, SEMD aggrecan type, results from a missense mutation affecting the C-type lectin domain of aggrecan.一种隐性骨骼发育不良,即聚集蛋白聚糖型SEMD,是由一个影响聚集蛋白聚糖C型凝集素结构域的错义突变引起的。
Am J Hum Genet. 2009 Jan;84(1):72-9. doi: 10.1016/j.ajhg.2008.12.001. Epub 2008 Dec 24.
10
Population admixture: detection by Hardy-Weinberg test and its quantitative effects on linkage-disequilibrium methods for localizing genes underlying complex traits.群体混合:通过哈迪-温伯格检验进行检测及其对定位复杂性状潜在基因的连锁不平衡方法的定量影响。
Genetics. 2001 Feb;157(2):885-97. doi: 10.1093/genetics/157.2.885.