Suppr超能文献

将表型与全基因组进行匹配:从个人基因组计划社区挑战的四次迭代中学到的经验教训。

Matching phenotypes to whole genomes: Lessons learned from four iterations of the personal genome project community challenges.

作者信息

Cai Binghuang, Li Biao, Kiga Nikki, Thusberg Janita, Bergquist Timothy, Chen Yun-Ching, Niknafs Noushin, Carter Hannah, Tokheim Collin, Beleva-Guthrie Violeta, Douville Christopher, Bhattacharya Rohit, Yeo Hui Ting Grace, Fan Jean, Sengupta Sohini, Kim Dewey, Cline Melissa, Turner Tychele, Diekhans Mark, Zaucha Jan, Pal Lipika R, Cao Chen, Yu Chen-Hsin, Yin Yizhou, Carraro Marco, Giollo Manuel, Ferrari Carlo, Leonardi Emanuela, Tosatto Silvio C E, Bobe Jason, Ball Madeleine, Hoskins Roger A, Repo Susanna, Church George, Brenner Steven E, Moult John, Gough Julian, Stanke Mario, Karchin Rachel, Mooney Sean D

机构信息

Department of Biomedical Informatics & Medical Education, University of Washington School of Medicine, Seattle, Washington.

The Buck Institute for Research on Aging, Novato, California.

出版信息

Hum Mutat. 2017 Sep;38(9):1266-1276. doi: 10.1002/humu.23265. Epub 2017 Jun 19.

Abstract

The advent of next-generation sequencing has dramatically decreased the cost for whole-genome sequencing and increased the viability for its application in research and clinical care. The Personal Genome Project (PGP) provides unrestricted access to genomes of individuals and their associated phenotypes. This resource enabled the Critical Assessment of Genome Interpretation (CAGI) to create a community challenge to assess the bioinformatics community's ability to predict traits from whole genomes. In the CAGI PGP challenge, researchers were asked to predict whether an individual had a particular trait or profile based on their whole genome. Several approaches were used to assess submissions, including ROC AUC (area under receiver operating characteristic curve), probability rankings, the number of correct predictions, and statistical significance simulations. Overall, we found that prediction of individual traits is difficult, relying on a strong knowledge of trait frequency within the general population, whereas matching genomes to trait profiles relies heavily upon a small number of common traits including ancestry, blood type, and eye color. When a rare genetic disorder is present, profiles can be matched when one or more pathogenic variants are identified. Prediction accuracy has improved substantially over the last 6 years due to improved methodology and a better understanding of features.

摘要

下一代测序技术的出现极大地降低了全基因组测序的成本,并提高了其在研究和临床护理中应用的可行性。个人基因组计划(PGP)提供了对个体基因组及其相关表型的无限制访问。这一资源使得基因组解释关键评估(CAGI)能够发起一项社区挑战,以评估生物信息学社区从全基因组预测性状的能力。在CAGI PGP挑战中,研究人员被要求根据个体的全基因组预测其是否具有特定的性状或特征。使用了几种方法来评估提交的结果,包括ROC AUC(受试者操作特征曲线下的面积)、概率排名、正确预测的数量以及统计显著性模拟。总体而言,我们发现预测个体性状很困难,这依赖于对一般人群中性状频率的深入了解,而将基因组与性状特征进行匹配则严重依赖于少数常见性状,包括祖先、血型和眼睛颜色。当存在罕见的遗传疾病时,当识别出一个或多个致病变异时,就可以进行特征匹配。由于方法的改进和对特征的更好理解,在过去6年中预测准确性有了显著提高。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e6c/5645203/d2e0e075ba95/nihms889883f1.jpg

相似文献

2
A probabilistic model to predict clinical phenotypic traits from genome sequencing.一种从基因组测序预测临床表型特征的概率模型。
PLoS Comput Biol. 2014 Sep 4;10(9):e1003825. doi: 10.1371/journal.pcbi.1003825. eCollection 2014 Sep.

引用本文的文献

3
Genome interpretation using in silico predictors of variant impact.使用变异影响的计算机预测因子进行基因组解读。
Hum Genet. 2022 Oct;141(10):1549-1577. doi: 10.1007/s00439-022-02457-6. Epub 2022 Apr 30.

本文引用的文献

3
Ten simple rules for a community computational challenge.社区计算挑战的十条简单规则。
PLoS Comput Biol. 2015 Apr 23;11(4):e1004150. doi: 10.1371/journal.pcbi.1004150. eCollection 2015 Apr.
4
The SUPERFAMILY 1.75 database in 2014: a doubling of data.2014年的超家族1.75数据库:数据量翻倍。
Nucleic Acids Res. 2015 Jan;43(Database issue):D227-33. doi: 10.1093/nar/gku1041. Epub 2014 Nov 20.
5
A probabilistic model to predict clinical phenotypic traits from genome sequencing.一种从基因组测序预测临床表型特征的概率模型。
PLoS Comput Biol. 2014 Sep 4;10(9):e1003825. doi: 10.1371/journal.pcbi.1003825. eCollection 2014 Sep.
7
Identifying Mendelian disease genes with the variant effect scoring tool.使用变异效应评分工具鉴定孟德尔疾病基因。
BMC Genomics. 2013;14 Suppl 3(Suppl 3):S3. doi: 10.1186/1471-2164-14-S3-S3. Epub 2013 May 28.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验