人类表型数据库的生物学一致性。

The biological coherence of human phenome databases.

机构信息

Centre for Molecular and Biomolecular Informatics, Nijmegen Centre for Molecular Life Sciences, Radboud University Nijmegen Medical Centre, Geert Grooteplein 26-28, 6525 GA Nijmegen, The Netherlands.

出版信息

Am J Hum Genet. 2009 Dec;85(6):801-8. doi: 10.1016/j.ajhg.2009.10.026.

DOI:10.1016/j.ajhg.2009.10.026

PMID:20004759

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2790572/

Abstract

Disease networks are increasingly explored as a complement to networks centered around interactions between genes and proteins. The quality of disease networks is heavily dependent on the amount and quality of phenotype information in phenotype databases of human genetic diseases. We explored which aspects of phenotype database architecture and content best reflect the underlying biology of disease. We used the OMIM-based HPO, Orphanet, and POSSUM phenotype databases for this purpose and devised a biological coherence score based on the sharing of gene ontology annotation to investigate the degree to which phenotype similarity in these databases reflects related pathobiology. Our analyses support the notion that a fine-grained phenotype ontology enhances the accuracy of phenome representation. In addition, we find that the OMIM database that is most used by the human genetics community is heavily underannotated. We show that this problem can easily be overcome by simply adding data available in the POSSUM database to improve OMIM phenotype representations in the HPO. Also, we find that the use of feature frequency estimates--currently implemented only in the Orphanet database--significantly improves the quality of the phenome representation. Our data suggest that there is much to be gained by improving human phenome databases and that some of the measures needed to achieve this are relatively easy to implement. More generally, we propose that curation and more systematic annotation of human phenome databases can greatly improve the power of the phenotype for genetic disease analysis.

摘要

疾病网络越来越多地被探索作为围绕基因和蛋白质相互作用的网络的补充。疾病网络的质量在很大程度上取决于人类遗传疾病表型数据库中表型信息的数量和质量。我们探讨了表型数据库架构和内容的哪些方面最能反映疾病的潜在生物学。为此，我们使用了基于 OMIM 的 HPO、Orphanet 和 POSSUM 表型数据库，并基于基因本体论注释的共享设计了一个生物学一致性评分，以调查这些数据库中表型相似性在多大程度上反映了相关的病理生物学。我们的分析支持这样一种观点，即精细的表型本体可以提高表型的准确性。此外，我们发现最受人类遗传学社区使用的 OMIM 数据库严重注释不足。我们表明，通过简单地添加 POSSUM 数据库中可用的数据来改进 HPO 中的 OMIM 表型表示，可以很容易地解决这个问题。此外，我们发现使用特征频率估计值（目前仅在 Orphanet 数据库中实现）可显著提高表型表示的质量。我们的数据表明，通过改进人类表型数据库可以获得很多收益，而实现这一目标所需的一些措施相对容易实施。更一般地，我们提出，对人类表型数据库的管理和更系统的注释可以极大地提高表型在遗传疾病分析中的作用。

相似文献

The biological coherence of human phenome databases.人类表型数据库的生物学一致性。

Am J Hum Genet. 2009 Dec;85(6):801-8. doi: 10.1016/j.ajhg.2009.10.026.

A text-mining analysis of the human phenome.人类表型组的文本挖掘分析

Eur J Hum Genet. 2006 May;14(5):535-42. doi: 10.1038/sj.ejhg.5201585.

HPO2Vec+: Leveraging heterogeneous knowledge resources to enrich node embeddings for the Human Phenotype Ontology.HPO2Vec+：利用异构知识资源丰富人类表型本体的节点嵌入。

J Biomed Inform. 2019 Aug;96:103246. doi: 10.1016/j.jbi.2019.103246. Epub 2019 Jun 27.

PhenoDis: a comprehensive database for phenotypic characterization of rare cardiac diseases.PhenoDis：一个用于罕见心脏疾病表型特征描述的综合数据库。

Orphanet J Rare Dis. 2018 Jan 25;13(1):22. doi: 10.1186/s13023-018-0765-y.

Gene- and Disease-Based Expansion of the Knowledge on Inborn Errors of Immunity.基于基因和疾病的免疫固有错误知识扩展。

Front Immunol. 2019 Oct 21;10:2475. doi: 10.3389/fimmu.2019.02475. eCollection 2019.

Pathway networks generated from human disease phenome.人类疾病表型生成的通路网络。

BMC Med Genomics. 2018 Sep 14;11(Suppl 3):75. doi: 10.1186/s12920-018-0386-2.

The human phenotype ontology.人类表型本体论。

Clin Genet. 2010 Jun;77(6):525-34. doi: 10.1111/j.1399-0004.2010.01436.x. Epub 2010 Feb 11.

Annotating Diseases Using Human Phenotype Ontology Improves Prediction of Disease-Associated Long Non-coding RNAs.使用人类表型本体注释疾病可提高疾病相关长非编码 RNA 的预测能力。

J Mol Biol. 2018 Jul 20;430(15):2219-2230. doi: 10.1016/j.jmb.2018.05.006. Epub 2018 May 24.

Towards prediction and prioritization of disease genes by the modularity of human phenome-genome assembled network.基于人类表型组-基因组组装网络的模块性实现疾病基因的预测与优先级排序

J Integr Bioinform. 2010 Nov 22;7(2):425. doi: 10.2390/biecoll-jib-2010-149.

Ontological phenotype standards for neurogenetics.神经遗传学的本体论表型标准。

Hum Mutat. 2012 Sep;33(9):1333-9. doi: 10.1002/humu.22112. Epub 2012 Jul 2.

引用本文的文献

Closing the 'phenotype gap' in precision medicine: improving what we measure to understand complex disease mechanisms.弥合精准医学中的“表型差距”：改善我们的测量方法以了解复杂疾病机制。

Mamm Genome. 2019 Aug;30(7-8):201-211. doi: 10.1007/s00335-019-09810-7. Epub 2019 Aug 19.

PhenoDis: a comprehensive database for phenotypic characterization of rare cardiac diseases.PhenoDis：一个用于罕见心脏疾病表型特征描述的综合数据库。

Orphanet J Rare Dis. 2018 Jan 25;13(1):22. doi: 10.1186/s13023-018-0765-y.

Context-sensitive network-based disease genetics prediction and its implications in drug discovery.基于上下文敏感网络的疾病遗传学预测及其在药物发现中的意义。

Bioinformatics. 2017 Apr 1;33(7):1031-1039. doi: 10.1093/bioinformatics/btw737.

Biomechanisms of Comorbidity: Reviewing Integrative Analyses of Multi-omics Datasets and Electronic Health Records.共病的生物机制：多组学数据集与电子健康记录的综合分析综述

Yearb Med Inform. 2016 Nov 10(1):194-206. doi: 10.15265/IY-2016-040.

Systematic Phenomics Analysis Deconvolutes Genes Mutated in Intellectual Disability into Biologically Coherent Modules.系统表型组学分析将智力障碍中突变的基因解卷积为生物学上连贯的模块。

Am J Hum Genet. 2016 Jan 7;98(1):149-64. doi: 10.1016/j.ajhg.2015.11.024.

Phenome-driven disease genetics prediction toward drug discovery.面向药物发现的表型驱动疾病遗传学预测。

Bioinformatics. 2015 Jun 15;31(12):i276-83. doi: 10.1093/bioinformatics/btv245.

Comparative analysis of a novel disease phenotype network based on clinical manifestations.基于临床表现的新型疾病表型网络的比较分析

J Biomed Inform. 2015 Feb;53:113-20. doi: 10.1016/j.jbi.2014.09.007. Epub 2014 Sep 30.

Effective diagnosis of genetic disease by computational phenotype analysis of the disease-associated genome.通过对疾病相关基因组进行计算表型分析来有效诊断遗传疾病。

Sci Transl Med. 2014 Sep 3;6(252):252ra123. doi: 10.1126/scitranslmed.3009262.

Using association rule mining to determine promising secondary phenotyping hypotheses.使用关联规则挖掘确定有前途的次要表型假说。

Bioinformatics. 2014 Jun 15;30(12):i52-59. doi: 10.1093/bioinformatics/btu260.

Human intellectual disability genes form conserved functional modules in Drosophila.人类智力残疾基因在果蝇中形成保守的功能模块。

PLoS Genet. 2013 Oct;9(10):e1003911. doi: 10.1371/journal.pgen.1003911. Epub 2013 Oct 31.

本文引用的文献

Personal phenotypes to go with personal genomes.与个人基因组相匹配的个人表型。

Mol Syst Biol. 2009;5:273. doi: 10.1038/msb.2009.32. Epub 2009 May 19.

Elements of morphology: introduction.形态学要素：引言

Am J Med Genet A. 2009 Jan;149A(1):2-5. doi: 10.1002/ajmg.a.32601.

Align human interactome with phenome to identify causative genes and networks underlying disease families.将人类相互作用组与表型组进行比对，以识别疾病家族背后的致病基因和网络。

Bioinformatics. 2009 Jan 1;25(1):98-104. doi: 10.1093/bioinformatics/btn593. Epub 2008 Nov 13.

The Human Phenotype Ontology: a tool for annotating and analyzing human hereditary disease.人类表型本体论：一种用于注释和分析人类遗传病的工具。

Am J Hum Genet. 2008 Nov;83(5):610-5. doi: 10.1016/j.ajhg.2008.09.017. Epub 2008 Oct 23.

InterPro: the integrative protein signature database.InterPro：综合蛋白质特征数据库。

Nucleic Acids Res. 2009 Jan;37(Database issue):D211-5. doi: 10.1093/nar/gkn785. Epub 2008 Oct 21.

Gene Ontology term overlap as a measure of gene functional similarity.基因本体术语重叠作为基因功能相似性的一种度量。

BMC Bioinformatics. 2008 Aug 4;9:327. doi: 10.1186/1471-2105-9-327.

The implications of human metabolic network topology for disease comorbidity.人类代谢网络拓扑结构对疾病共病的影响。

Proc Natl Acad Sci U S A. 2008 Jul 22;105(29):9880-5. doi: 10.1073/pnas.0802208105. Epub 2008 Jul 3.

Modularity in the genetic disease-phenotype network.遗传疾病-表型网络中的模块化

FEBS Lett. 2008 Jul 23;582(17):2549-54. doi: 10.1016/j.febslet.2008.06.023. Epub 2008 Jun 26.

Network-based global inference of human disease genes.基于网络的人类疾病基因全局推断

Mol Syst Biol. 2008;4:189. doi: 10.1038/msb.2008.27. Epub 2008 May 6.

Phenome connections.表型关联

Trends Genet. 2008 Mar;24(3):103-6. doi: 10.1016/j.tig.2007.12.005. Epub 2008 Feb 19.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验