Suppr超能文献

SemFunSim:一种通过整合语义和基因功能关联来测量疾病相似性的新方法。

SemFunSim: a new method for measuring disease similarity by integrating semantic and gene functional association.

作者信息

Cheng Liang, Li Jie, Ju Peng, Peng Jiajie, Wang Yadong

机构信息

Center for Bioinformatics, School of Computer Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang, China.

School of Electrical and Electronic Engineering, Nanyang Technological University, Singapore, Singapore.

出版信息

PLoS One. 2014 Jun 16;9(6):e99415. doi: 10.1371/journal.pone.0099415. eCollection 2014.

Abstract

BACKGROUND

Measuring similarity between diseases plays an important role in disease-related molecular function research. Functional associations between disease-related genes and semantic associations between diseases are often used to identify pairs of similar diseases from different perspectives. Currently, it is still a challenge to exploit both of them to calculate disease similarity. Therefore, a new method (SemFunSim) that integrates semantic and functional association is proposed to address the issue.

METHODS

SemFunSim is designed as follows. First of all, FunSim (Functional similarity) is proposed to calculate disease similarity using disease-related gene sets in a weighted network of human gene function. Next, SemSim (Semantic Similarity) is devised to calculate disease similarity using the relationship between two diseases from Disease Ontology. Finally, FunSim and SemSim are integrated to measure disease similarity.

RESULTS

The high average AUC (area under the receiver operating characteristic curve) (96.37%) shows that SemFunSim achieves a high true positive rate and a low false positive rate. 79 of the top 100 pairs of similar diseases identified by SemFunSim are annotated in the Comparative Toxicogenomics Database (CTD) as being targeted by the same therapeutic compounds, while other methods we compared could identify 35 or less such pairs among the top 100. Moreover, when using our method on diseases without annotated compounds in CTD, we could confirm many of our predicted candidate compounds from literature. This indicates that SemFunSim is an effective method for drug repositioning.

摘要

背景

测量疾病之间的相似性在疾病相关分子功能研究中起着重要作用。疾病相关基因之间的功能关联以及疾病之间的语义关联常被用于从不同角度识别相似疾病对。目前,综合利用这两者来计算疾病相似性仍是一项挑战。因此,提出了一种整合语义和功能关联的新方法(SemFunSim)来解决这一问题。

方法

SemFunSim的设计如下。首先,提出FunSim(功能相似性),利用人类基因功能加权网络中的疾病相关基因集来计算疾病相似性。其次,设计SemSim(语义相似性),利用疾病本体中两种疾病之间的关系来计算疾病相似性。最后,将FunSim和SemSim整合起来测量疾病相似性。

结果

较高的平均AUC(受试者工作特征曲线下面积)(96.37%)表明SemFunSim实现了高真阳性率和低假阳性率。SemFunSim识别出的前100对相似疾病中有79对在比较毒理基因组学数据库(CTD)中被注释为受相同治疗化合物靶向,而我们比较的其他方法在前100对中只能识别出35对或更少这样的疾病对。此外,当将我们的方法应用于CTD中没有注释化合物的疾病时,我们可以从文献中确认许多我们预测的候选化合物。这表明SemFunSim是一种有效的药物重新定位方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c64e/4059643/105eb6a1eac2/pone.0099415.g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验