SemFunSim：一种通过整合语义和基因功能关联来测量疾病相似性的新方法。

SemFunSim: a new method for measuring disease similarity by integrating semantic and gene functional association.

作者信息

Cheng Liang, Li Jie, Ju Peng, Peng Jiajie, Wang Yadong

机构信息

Center for Bioinformatics, School of Computer Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang, China.

School of Electrical and Electronic Engineering, Nanyang Technological University, Singapore, Singapore.

出版信息

PLoS One. 2014 Jun 16;9(6):e99415. doi: 10.1371/journal.pone.0099415. eCollection 2014.

DOI:10.1371/journal.pone.0099415

PMID:24932637

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4059643/

Abstract

BACKGROUND

Measuring similarity between diseases plays an important role in disease-related molecular function research. Functional associations between disease-related genes and semantic associations between diseases are often used to identify pairs of similar diseases from different perspectives. Currently, it is still a challenge to exploit both of them to calculate disease similarity. Therefore, a new method (SemFunSim) that integrates semantic and functional association is proposed to address the issue.

METHODS

SemFunSim is designed as follows. First of all, FunSim (Functional similarity) is proposed to calculate disease similarity using disease-related gene sets in a weighted network of human gene function. Next, SemSim (Semantic Similarity) is devised to calculate disease similarity using the relationship between two diseases from Disease Ontology. Finally, FunSim and SemSim are integrated to measure disease similarity.

RESULTS

The high average AUC (area under the receiver operating characteristic curve) (96.37%) shows that SemFunSim achieves a high true positive rate and a low false positive rate. 79 of the top 100 pairs of similar diseases identified by SemFunSim are annotated in the Comparative Toxicogenomics Database (CTD) as being targeted by the same therapeutic compounds, while other methods we compared could identify 35 or less such pairs among the top 100. Moreover, when using our method on diseases without annotated compounds in CTD, we could confirm many of our predicted candidate compounds from literature. This indicates that SemFunSim is an effective method for drug repositioning.

摘要

背景

测量疾病之间的相似性在疾病相关分子功能研究中起着重要作用。疾病相关基因之间的功能关联以及疾病之间的语义关联常被用于从不同角度识别相似疾病对。目前，综合利用这两者来计算疾病相似性仍是一项挑战。因此，提出了一种整合语义和功能关联的新方法（SemFunSim）来解决这一问题。

方法

SemFunSim的设计如下。首先，提出FunSim（功能相似性），利用人类基因功能加权网络中的疾病相关基因集来计算疾病相似性。其次，设计SemSim（语义相似性），利用疾病本体中两种疾病之间的关系来计算疾病相似性。最后，将FunSim和SemSim整合起来测量疾病相似性。

结果

较高的平均AUC（受试者工作特征曲线下面积）（96.37%）表明SemFunSim实现了高真阳性率和低假阳性率。SemFunSim识别出的前100对相似疾病中有79对在比较毒理基因组学数据库（CTD）中被注释为受相同治疗化合物靶向，而我们比较的其他方法在前100对中只能识别出35对或更少这样的疾病对。此外，当将我们的方法应用于CTD中没有注释化合物的疾病时，我们可以从文献中确认许多我们预测的候选化合物。这表明SemFunSim是一种有效的药物重新定位方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c64e/4059643/105eb6a1eac2/pone.0099415.g001.jpg

相似文献

SemFunSim: a new method for measuring disease similarity by integrating semantic and gene functional association.

PLoS One. 2014 Jun 16;9(6):e99415. doi: 10.1371/journal.pone.0099415. eCollection 2014.

IDSSIM: an lncRNA functional similarity calculation model based on an improved disease semantic similarity method.

BMC Bioinformatics. 2020 Jul 31;21(1):339. doi: 10.1186/s12859-020-03699-9.

Computational drug repositioning using meta-path-based semantic network analysis.

BMC Syst Biol. 2018 Dec 31;12(Suppl 9):134. doi: 10.1186/s12918-018-0658-7.

DisSetSim: an online system for calculating similarity between disease sets.

J Biomed Semantics. 2017 Sep 20;8(Suppl 1):28. doi: 10.1186/s13326-017-0140-2.

Annotating Diseases Using Human Phenotype Ontology Improves Prediction of Disease-Associated Long Non-coding RNAs.

J Mol Biol. 2018 Jul 20;430(15):2219-2230. doi: 10.1016/j.jmb.2018.05.006. Epub 2018 May 24.

Prioritizing candidate diseases-related metabolites based on literature and functional similarity.

BMC Bioinformatics. 2019 Nov 25;20(Suppl 18):574. doi: 10.1186/s12859-019-3127-4.

Optimal Threshold Determination for Interpreting Semantic Similarity and Particularity: Application to the Comparison of Gene Sets and Metabolic Pathways Using GO and ChEBI.

PLoS One. 2015 Jul 31;10(7):e0133579. doi: 10.1371/journal.pone.0133579. eCollection 2015.

Network-based inference methods for drug repositioning.

Comput Math Methods Med. 2015;2015:130620. doi: 10.1155/2015/130620. Epub 2015 Apr 12.

Drug Repositioning Based on Deep Sparse Autoencoder and Drug-Disease Similarity.

Interdiscip Sci. 2024 Mar;16(1):160-175. doi: 10.1007/s12539-023-00593-9. Epub 2023 Dec 16.

Gene gravity-like algorithm for disease gene prediction based on phenotype-specific network.

BMC Syst Biol. 2017 Dec 6;11(1):121. doi: 10.1186/s12918-017-0519-9.

引用本文的文献

Improving computational drug repositioning through multi-source disease similarity networks.

Sci Rep. 2025 Aug 21;15(1):30773. doi: 10.1038/s41598-025-04772-0.

OntoTiger: a platform of ontology-based application tools for integrative biomedical exploration.

Nucleic Acids Res. 2025 Jul 7;53(W1):W440-W450. doi: 10.1093/nar/gkaf337.

EnrichDO: a global weighted model for Disease Ontology enrichment analysis.

Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf021.

DiSMVC: a multi-view graph collaborative learning framework for measuring disease similarity.

Bioinformatics. 2024 May 2;40(5). doi: 10.1093/bioinformatics/btae306.

Hessian Regularized [Formula: see text]-Nonnegative Matrix Factorization and Deep Learning for miRNA-Disease Associations Prediction.

Interdiscip Sci. 2024 Mar;16(1):176-191. doi: 10.1007/s12539-023-00594-8. Epub 2023 Dec 15.

DapBCH: a disease association prediction model Based on Cross-species and Heterogeneous graph embedding.

Front Genet. 2023 Sep 22;14:1222346. doi: 10.3389/fgene.2023.1222346. eCollection 2023.

PPIGCF: A Protein-Protein Interaction-Based Gene Correlation Filter for Optimal Gene Selection.

Genes (Basel). 2023 May 10;14(5):1063. doi: 10.3390/genes14051063.

Identification of MiRNA-Disease Associations Based on Information of Multi-Module and Meta-Path.

Molecules. 2022 Jul 11;27(14):4443. doi: 10.3390/molecules27144443.

Deepening the knowledge of rare diseases dependent on angiogenesis through semantic similarity clustering and network analysis.

Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac220.

A multi-network integration approach for measuring disease similarity based on ncRNA regulation and heterogeneous information.

BMC Bioinformatics. 2022 Mar 7;23(Suppl 1):89. doi: 10.1186/s12859-022-04613-1.

本文引用的文献

A New Method for Computational Drug Repositioning Using Drug Pairwise Similarity.

Proceedings (IEEE Int Conf Bioinformatics Biomed). 2012;2012:1-4. doi: 10.1109/BIBM.2012.6392722.

SIDD: a semantically integrated database towards a global view of human disease.

PLoS One. 2013 Oct 11;8(10):e75504. doi: 10.1371/journal.pone.0075504. eCollection 2013.

Identifying cross-category relations in gene ontology and constructing genome-specific term association networks.

BMC Bioinformatics. 2013;14 Suppl 2(Suppl 2):S15. doi: 10.1186/1471-2105-14-S2-S15. Epub 2013 Jan 21.

A framework for annotating human genome in disease context.

PLoS One. 2012;7(12):e49686. doi: 10.1371/journal.pone.0049686. Epub 2012 Dec 10.

Genenames.org: the HGNC resources in 2013.

Nucleic Acids Res. 2013 Jan;41(Database issue):D545-52. doi: 10.1093/nar/gks1066. Epub 2012 Nov 17.

The Comparative Toxicogenomics Database: update 2013.

Nucleic Acids Res. 2013 Jan;41(Database issue):D1104-14. doi: 10.1093/nar/gks994. Epub 2012 Oct 23.

Pentoxifylline in hepatopulmonary syndrome.

World J Gastroenterol. 2012 Sep 21;18(35):4912-6. doi: 10.3748/wjg.v18.i35.4912.

Network medicine: linking disorders.

Hum Genet. 2012 Dec;131(12):1811-20. doi: 10.1007/s00439-012-1206-y. Epub 2012 Jul 24.

The emerging paradigm of network medicine in the study of human disease.

Circ Res. 2012 Jul 20;111(3):359-74. doi: 10.1161/CIRCRESAHA.111.258541.

Predicting new indications for approved drugs using a proteochemometric method.

J Med Chem. 2012 Aug 9;55(15):6832-48. doi: 10.1021/jm300576q. Epub 2012 Jul 25.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

SemFunSim：一种通过整合语义和基因功能关联来测量疾病相似性的新方法。

SemFunSim: a new method for measuring disease similarity by integrating semantic and gene functional association.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

背景

方法

结果

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献