基于疾病与蛋白质扩散分布之间的拓扑相似性对候选疾病基因进行优先级排序。

Prioritization of candidate disease genes by topological similarity between disease and protein diffusion profiles.

机构信息

Department of Mathematics, Shanghai Normal University, Shanghai, China.

出版信息

BMC Bioinformatics. 2013;14 Suppl 5(Suppl 5):S5. doi: 10.1186/1471-2105-14-S5-S5. Epub 2013 Apr 10.

DOI:10.1186/1471-2105-14-S5-S5

PMID:23734762

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3622672/

Abstract

BACKGROUND

Identification of gene-phenotype relationships is a fundamental challenge in human health clinic. Based on the observation that genes causing the same or similar phenotypes tend to correlate with each other in the protein-protein interaction network, a lot of network-based approaches were proposed based on different underlying models. A recent comparative study showed that diffusion-based methods achieve the state-of-the-art predictive performance.

RESULTS

In this paper, a new diffusion-based method was proposed to prioritize candidate disease genes. Diffusion profile of a disease was defined as the stationary distribution of candidate genes given a random walk with restart where similarities between phenotypes are incorporated. Then, candidate disease genes are prioritized by comparing their diffusion profiles with that of the disease. Finally, the effectiveness of our method was demonstrated through the leave-one-out cross-validation against control genes from artificial linkage intervals and randomly chosen genes. Comparative study showed that our method achieves improved performance compared to some classical diffusion-based methods. To further illustrate our method, we used our algorithm to predict new causing genes of 16 multifactorial diseases including Prostate cancer and Alzheimer's disease, and the top predictions were in good consistent with literature reports.

CONCLUSIONS

Our study indicates that integration of multiple information sources, especially the phenotype similarity profile data, and introduction of global similarity measure between disease and gene diffusion profiles are helpful for prioritizing candidate disease genes.

AVAILABILITY

Programs and data are available upon request.

摘要

背景

鉴定基因-表型关系是人类健康临床中的一个基本挑战。基于这样一种观察，即引起相同或相似表型的基因在蛋白质-蛋白质相互作用网络中往往相互关联，许多基于网络的方法已经根据不同的基础模型被提出来。最近的一项比较研究表明，基于扩散的方法具有最先进的预测性能。

结果

在本文中，提出了一种新的基于扩散的方法来对候选疾病基因进行优先级排序。疾病的扩散谱被定义为在随机游走中重新启动时候选基因的平稳分布，其中表型之间的相似性被包含在内。然后，通过比较候选疾病基因的扩散谱与疾病的扩散谱来对候选疾病基因进行优先级排序。最后，通过对来自人工连锁区间和随机选择基因的对照基因进行留一交叉验证，证明了我们方法的有效性。比较研究表明，与一些经典的基于扩散的方法相比，我们的方法具有更好的性能。为了进一步说明我们的方法，我们使用我们的算法来预测包括前列腺癌和阿尔茨海默病在内的 16 种多因素疾病的新致病基因，排名靠前的预测与文献报道结果具有很好的一致性。

结论

我们的研究表明，整合多种信息源，特别是表型相似性谱数据，并引入疾病和基因扩散谱之间的全局相似性度量，有助于对候选疾病基因进行优先级排序。

可用性

程序和数据可根据要求提供。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52ea/3622672/c50bc3b39145/1471-2105-14-S5-S5-1.jpg

相似文献

Prioritization of candidate disease genes by topological similarity between disease and protein diffusion profiles.基于疾病与蛋白质扩散分布之间的拓扑相似性对候选疾病基因进行优先级排序。

BMC Bioinformatics. 2013;14 Suppl 5(Suppl 5):S5. doi: 10.1186/1471-2105-14-S5-S5. Epub 2013 Apr 10.

Prioritization of candidate disease genes by combining topological similarity and semantic similarity.通过结合拓扑相似性和语义相似性对候选疾病基因进行优先级排序。

J Biomed Inform. 2015 Oct;57:1-5. doi: 10.1016/j.jbi.2015.07.005. Epub 2015 Jul 11.

Gene gravity-like algorithm for disease gene prediction based on phenotype-specific network.基于表型特异性网络的疾病基因预测的基因引力样算法。

BMC Syst Biol. 2017 Dec 6;11(1):121. doi: 10.1186/s12918-017-0519-9.

Prioritization of candidate disease genes by enlarging the seed set and fusing information of the network topology and gene expression.通过扩大种子集并融合网络拓扑结构和基因表达信息来对候选疾病基因进行优先级排序。

Mol Biosyst. 2014 Jun;10(6):1400-8. doi: 10.1039/c3mb70588a. Epub 2014 Apr 3.

Prioritization of potential candidate disease genes by topological similarity of protein-protein interaction network and phenotype data.通过蛋白质-蛋白质相互作用网络和表型数据的拓扑相似性对潜在候选疾病基因进行优先级排序。

J Biomed Inform. 2015 Feb;53:229-36. doi: 10.1016/j.jbi.2014.11.004. Epub 2014 Nov 15.

Constructing an integrated gene similarity network for the identification of disease genes.构建用于疾病基因识别的综合基因相似性网络。

J Biomed Semantics. 2017 Sep 20;8(Suppl 1):32. doi: 10.1186/s13326-017-0141-1.

Global risk transformative prioritization for prostate cancer candidate genes in molecular networks.分子网络中前列腺癌候选基因的全球风险转化优先级排序

Mol Biosyst. 2011 Sep;7(9):2547-53. doi: 10.1039/c1mb05134b. Epub 2011 Jul 7.

Associating genes and protein complexes with disease via network propagation.通过网络传播将基因和蛋白质复合物与疾病相关联。

PLoS Comput Biol. 2010 Jan 15;6(1):e1000641. doi: 10.1371/journal.pcbi.1000641.

Prioritizing disease genes with an improved dual label propagation framework.利用改进的双重标签传播框架优先考虑疾病基因。

BMC Bioinformatics. 2018 Feb 8;19(1):47. doi: 10.1186/s12859-018-2040-6.

A computational method based on the integration of heterogeneous networks for predicting disease-gene associations.基于异构网络整合的计算方法预测疾病-基因关联。

PLoS One. 2011;6(9):e24171. doi: 10.1371/journal.pone.0024171. Epub 2011 Sep 2.

引用本文的文献

Network propagation for GWAS analysis: a practical guide to leveraging molecular networks for disease gene discovery.GWAS 分析中的网络传播：利用分子网络发现疾病基因的实用指南。

Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae014.

A knowledge graph-based disease-gene prediction system using multi-relational graph convolution networks.基于知识图的多关系图卷积网络疾病-基因预测系统。

AMIA Annu Symp Proc. 2023 Apr 29;2022:468-476. eCollection 2022.

HetIG-PreDiG: A Heterogeneous Integrated Graph Model for Predicting Human Disease Genes based on gene expression.HetIG-PreDiG：一种基于基因表达的用于预测人类疾病基因的异构集成图模型。

PLoS One. 2023 Feb 15;18(2):e0280839. doi: 10.1371/journal.pone.0280839. eCollection 2023.

Integrating node embeddings and biological annotations for genes to predict disease-gene associations.整合基因的节点嵌入和生物学注释以预测疾病-基因关联。

BMC Syst Biol. 2018 Dec 31;12(Suppl 9):138. doi: 10.1186/s12918-018-0662-y.

Pan-Cancer Analysis Reveals the Functional Importance of Protein Lysine Modification in Cancer Development.泛癌分析揭示了蛋白质赖氨酸修饰在癌症发展中的功能重要性。

Front Genet. 2018 Jul 17;9:254. doi: 10.3389/fgene.2018.00254. eCollection 2018.

Refine gene functional similarity network based on interaction networks.基于相互作用网络细化基因功能相似性网络。

BMC Bioinformatics. 2017 Dec 28;18(Suppl 16):550. doi: 10.1186/s12859-017-1969-1.

Constructing an integrated gene similarity network for the identification of disease genes.构建用于疾病基因识别的综合基因相似性网络。

J Biomed Semantics. 2017 Sep 20;8(Suppl 1):32. doi: 10.1186/s13326-017-0141-1.

m6A-Driver: Identifying Context-Specific mRNA m6A Methylation-Driven Gene Interaction Networks.m6A驱动因子：识别特定背景下mRNA的m6A甲基化驱动的基因相互作用网络

PLoS Comput Biol. 2016 Dec 27;12(12):e1005287. doi: 10.1371/journal.pcbi.1005287. eCollection 2016 Dec.

Uncover miRNA-Disease Association by Exploiting Global Network Similarity.通过利用全局网络相似性揭示微小RNA与疾病的关联。

PLoS One. 2016 Dec 1;11(12):e0166509. doi: 10.1371/journal.pone.0166509. eCollection 2016.

A new method for identifying causal genes of schizophrenia and anti-tuberculosis drug-induced hepatotoxicity.一种新方法可用于鉴定精神分裂症和抗结核药物性肝损伤的致病基因。

Sci Rep. 2016 Sep 1;6:32571. doi: 10.1038/srep32571.

本文引用的文献

Multiple ant colony algorithm method for selecting tag SNPs.多重蚁群算法筛选标签 SNPs 方法。

J Biomed Inform. 2012 Oct;45(5):931-7. doi: 10.1016/j.jbi.2012.03.003. Epub 2012 Mar 28.

Inferring disease and gene set associations with rank coherence in networks.在网络中通过秩相干性推断疾病和基因集的关联。

Bioinformatics. 2011 Oct 1;27(19):2692-9. doi: 10.1093/bioinformatics/btr463. Epub 2011 Aug 8.

Uncover disease genes by maximizing information flow in the phenome-interactome network.通过最大化表型-互作网络中的信息流来发现疾病基因。

Bioinformatics. 2011 Jul 1;27(13):i167-76. doi: 10.1093/bioinformatics/btr213.

Network medicine: a network-based approach to human disease.网络医学：一种基于网络的人类疾病研究方法。

Nat Rev Genet. 2011 Jan;12(1):56-68. doi: 10.1038/nrg2918.

Genome-wide inferring gene-phenotype relationship by walking on the heterogeneous network.基于异构网络游走的全基因组推断基因-表型关系。

Bioinformatics. 2010 May 1;26(9):1219-24. doi: 10.1093/bioinformatics/btq108. Epub 2010 Mar 9.

The power of protein interaction networks for associating genes with diseases.蛋白质相互作用网络在将基因与疾病相关联中的作用。

Bioinformatics. 2010 Apr 15;26(8):1057-63. doi: 10.1093/bioinformatics/btq076. Epub 2010 Feb 24.

Associating genes and protein complexes with disease via network propagation.通过网络传播将基因和蛋白质复合物与疾病相关联。

PLoS Comput Biol. 2010 Jan 15;6(1):e1000641. doi: 10.1371/journal.pcbi.1000641.

Key susceptibility locus for nonsyndromic cleft lip with or without cleft palate on chromosome 8q24.8号染色体q24区域上非综合征性唇裂伴或不伴腭裂的关键易感基因座。

Nat Genet. 2009 Apr;41(4):473-7. doi: 10.1038/ng.333. Epub 2009 Mar 8.

Genome-wide association of early-onset myocardial infarction with single nucleotide polymorphisms and copy number variants.早发性心肌梗死与单核苷酸多态性和拷贝数变异的全基因组关联研究

Nat Genet. 2009 Mar;41(3):334-41. doi: 10.1038/ng.327. Epub 2009 Feb 8.

Revealing biological modules via graph summarization.通过图摘要揭示生物模块。

J Comput Biol. 2009 Feb;16(2):253-64. doi: 10.1089/cmb.2008.11TT.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于疾病与蛋白质扩散分布之间的拓扑相似性对候选疾病基因进行优先级排序。

Prioritization of candidate disease genes by topological similarity between disease and protein diffusion profiles.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

AVAILABILITY

背景

结果

结论

可用性

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献