Suppr超能文献

利用 1000 基因组计划与 3D 面部图像进行异构数据集的全基因组祖先推断:实例研究。

Robust genome-wide ancestry inference for heterogeneous datasets: illustrated using the 1,000 genome project with 3D facial images.

机构信息

Medical Imaging Research Center, MIRC, University Hospitals Leuven, Leuven, Belgium.

Department of Electrical Engineering, ESAT/PSI, KU Leuven, Leuven, Belgium.

出版信息

Sci Rep. 2020 Jul 16;10(1):11850. doi: 10.1038/s41598-020-68259-w.

Abstract

Estimates of individual-level genomic ancestry are routinely used in human genetics, and related fields. The analysis of population structure and genomic ancestry can yield insights in terms of modern and ancient populations, allowing us to address questions regarding admixture, and the numbers and identities of the parental source populations. Unrecognized population structure is also an important confounder to correct for in genome-wide association studies. However, it remains challenging to work with heterogeneous datasets from multiple studies collected by different laboratories with diverse genotyping and imputation protocols. This work presents a new approach and an accompanying open-source toolbox that facilitates a robust integrative analysis for population structure and genomic ancestry estimates for heterogeneous datasets. We show robustness against individual outliers and different protocols for the projection of new samples into a reference ancestry space, and the ability to reveal and adjust for population structure in a simulated case-control admixed population. Given that visually evident and easily recognizable patterns of human facial characteristics co-vary with genomic ancestry, and based on the integration of three different sources of genome data, we generate average 3D faces to illustrate genomic ancestry variations within the 1,000 Genome project and for eight ancient-DNA profiles, respectively.

摘要

个体水平基因组起源的估计在人类遗传学和相关领域中得到了广泛应用。人口结构和基因组起源的分析可以提供有关现代和古代人口的见解,使我们能够解决关于混合、父母源群体的数量和身份的问题。未被识别的人口结构也是全基因组关联研究中需要纠正的一个重要混杂因素。然而,处理来自不同实验室、具有不同基因分型和 imputation 方案的多个研究的异质数据集仍然具有挑战性。这项工作提出了一种新的方法和一个配套的开源工具箱,用于对异质数据集进行稳健的综合分析,以估计人口结构和基因组起源。我们展示了对个体离群值和将新样本投影到参考起源空间的不同方案的稳健性,以及在模拟的病例对照混合人群中揭示和调整人口结构的能力。鉴于人类面部特征的明显和易于识别的模式与基因组起源相关,并且基于三个不同的基因组数据源的整合,我们分别生成平均的 3D 面孔,以说明 1000 基因组计划和八个古 DNA 图谱内的基因组起源变化。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4eb8/7367291/b7a0309eebfc/41598_2020_68259_Fig2_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验