Suppr超能文献

千人基因组计划最后阶段的高度近亲繁殖情况。

High level of inbreeding in final phase of 1000 Genomes Project.

作者信息

Gazal Steven, Sahbatou Mourad, Babron Marie-Claude, Génin Emmanuelle, Leutenegger Anne-Louise

机构信息

INSERM, IAME, UMR 1137, F-75018 Paris, France.

Plateforme de génomique constitutionnelle du GHU Nord, Assistance Publique des Hôpitaux de Paris (APHP), Hôpital Bichat, F-75018 Paris, France.

出版信息

Sci Rep. 2015 Dec 2;5:17453. doi: 10.1038/srep17453.

Abstract

The 1000 Genomes Project provides a unique source of whole genome sequencing data for studies of human population genetics and human diseases. The last release of this project includes more than 2,500 sequenced individuals from 26 populations. Although relationships among individuals have been investigated in some of the populations, inbreeding has never been studied. In this article, we estimated the genomic inbreeding coefficient of each individual and found an unexpected high level of inbreeding in 1000 Genomes data: nearly a quarter of the individuals were inbred and around 4% of them had inbreeding coefficients similar or greater than the ones expected for first-cousin offspring. Inbred individuals were found in each of the 26 populations, with some populations showing proportions of inbred individuals above 50%. We also detected 227 previously unreported pairs of close relatives (up to and including first-cousins). Thus, we propose subsets of unrelated and outbred individuals, for use by the scientific community. In addition, because admixed populations are present in the 1000 Genomes Project, we performed simulations to study the robustness of inbreeding coefficient estimates in the presence of admixture. We found that our multi-point approach (FSuite) was quite robust to admixture, unlike single-point methods (PLINK).

摘要

千人基因组计划为人类群体遗传学和人类疾病研究提供了独特的全基因组测序数据来源。该计划的最新版本包含来自26个群体的2500多个已测序个体。尽管已经对部分群体中个体之间的关系进行了研究,但从未对近亲繁殖进行过研究。在本文中,我们估计了每个个体的基因组近亲繁殖系数,发现在千人基因组数据中存在意外的高近亲繁殖水平:近四分之一的个体是近亲繁殖个体,其中约4%的个体的近亲繁殖系数与表亲后代预期的系数相似或更高。在26个群体中的每一个群体中都发现了近亲繁殖个体,有些群体中近亲繁殖个体的比例超过了50%。我们还检测到227对以前未报告的近亲(包括一级表亲)。因此,我们提出了非近亲繁殖和远交个体的子集,供科学界使用。此外,由于千人基因组计划中存在混合群体,我们进行了模拟,以研究在存在混合的情况下近亲繁殖系数估计的稳健性。我们发现,与单点方法(PLINK)不同,我们的多点方法(FSuite)对混合具有相当强的稳健性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a6c1/4667178/ac74d1dcdb26/srep17453-f1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验