• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

本体概念之间的析取共享信息:在基因本体中的应用

Disjunctive shared information between ontology concepts: application to Gene Ontology.

作者信息

Couto Francisco M, Silva Mário J

机构信息

Departamento de Informática, Faculdade de Ciências da Universidade de Lisboa, Lisboa, 1749-016, Portugal.

出版信息

J Biomed Semantics. 2011 Aug 31;2:5. doi: 10.1186/2041-1480-2-5.

DOI:10.1186/2041-1480-2-5
PMID:21884591
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3200982/
Abstract

BACKGROUND

The large-scale effort in developing, maintaining and making biomedical ontologies available motivates the application of similarity measures to compare ontology concepts or, by extension, the entities described therein. A common approach, known as semantic similarity, compares ontology concepts through the information content they share in the ontology. However, different disjunctive ancestors in the ontology are frequently neglected, or not properly explored, by semantic similarity measures.

RESULTS

This paper proposes a novel method, dubbed DiShIn, that effectively exploits the multiple inheritance relationships present in many biomedical ontologies. DiShIn calculates the shared information content of two ontology concepts, based on the information content of the disjunctive common ancestors of the concepts being compared. DiShIn identifies these disjunctive ancestors through the number of distinct paths from the concepts to their common ancestors.

CONCLUSIONS

DiShIn was applied to Gene Ontology and its performance was evaluated against state-of-the-art measures using CESSM, a publicly available evaluation platform of protein similarity measures. By modifying the way traditional semantic similarity measures calculate the shared information content, DiShIn was able to obtain a statistically significant higher correlation between semantic and sequence similarity. Moreover, the incorporation of DiShIn in existing applications that exploit multiple inheritance would reduce their execution time.

摘要

背景

开发、维护和提供生物医学本体的大规模努力促使应用相似性度量来比较本体概念,或者进而比较其中描述的实体。一种常见的方法,即语义相似性,通过本体中共享的信息内容来比较本体概念。然而,本体中不同的析取祖先经常被语义相似性度量忽略或未得到恰当探究。

结果

本文提出了一种名为DiShIn的新方法,该方法有效地利用了许多生物医学本体中存在的多重继承关系。DiShIn基于被比较概念的析取共同祖先的信息内容来计算两个本体概念的共享信息内容。DiShIn通过从概念到其共同祖先的不同路径数量来识别这些析取祖先。

结论

DiShIn被应用于基因本体,并使用CESSM(一个公开可用的蛋白质相似性度量评估平台)与现有最先进的度量方法进行性能评估。通过修改传统语义相似性度量计算共享信息内容的方式,DiShIn能够在语义相似性和序列相似性之间获得统计学上显著更高的相关性。此外,将DiShIn纳入利用多重继承的现有应用中会减少其执行时间。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9eb/3200982/213e68963515/2041-1480-2-5-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9eb/3200982/5e67c5fd8992/2041-1480-2-5-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9eb/3200982/59ff49266134/2041-1480-2-5-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9eb/3200982/1a420e7c5a8a/2041-1480-2-5-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9eb/3200982/213e68963515/2041-1480-2-5-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9eb/3200982/5e67c5fd8992/2041-1480-2-5-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9eb/3200982/59ff49266134/2041-1480-2-5-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9eb/3200982/1a420e7c5a8a/2041-1480-2-5-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9eb/3200982/213e68963515/2041-1480-2-5-4.jpg

相似文献

1
Disjunctive shared information between ontology concepts: application to Gene Ontology.本体概念之间的析取共享信息:在基因本体中的应用
J Biomed Semantics. 2011 Aug 31;2:5. doi: 10.1186/2041-1480-2-5.
2
Semantic similarity measurement between gene ontology terms based on exclusively inherited shared information.基于完全继承的共享信息的基因本体术语之间的语义相似性度量
Gene. 2015 Mar 1;558(1):108-17. doi: 10.1016/j.gene.2014.12.062. Epub 2014 Dec 28.
3
HESML: a real-time semantic measures library for the biomedical domain with a reproducible survey.HESML:生物医学领域的实时语义度量库,附有可重现的调查。
BMC Bioinformatics. 2022 Jan 6;23(1):23. doi: 10.1186/s12859-021-04539-0.
4
The effects of shared information on semantic calculations in the gene ontology.共享信息对基因本体中语义计算的影响。
Comput Struct Biotechnol J. 2017 Jan 30;15:195-211. doi: 10.1016/j.csbj.2017.01.009. eCollection 2017.
5
UFO: A tool for unifying biomedical ontology-based semantic similarity calculation, enrichment analysis and visualization.UFO:一种用于统一基于生物医学本体的语义相似性计算、富集分析和可视化的工具。
PLoS One. 2020 Jul 9;15(7):e0235670. doi: 10.1371/journal.pone.0235670. eCollection 2020.
6
Evaluating semantic similarity between Chinese biomedical terms through multiple ontologies with score normalization: An initial study.通过多本体和分数归一化评估中文生物医学术语之间的语义相似性:一项初步研究。
J Biomed Inform. 2016 Dec;64:273-287. doi: 10.1016/j.jbi.2016.10.017. Epub 2016 Nov 1.
7
A relation based measure of semantic similarity for Gene Ontology annotations.一种基于关系的基因本体注释语义相似度度量方法。
BMC Bioinformatics. 2008 Nov 4;9:468. doi: 10.1186/1471-2105-9-468.
8
A novel view on information content of concepts in a large ontology and a view on the structure and the quality of the ontology.关于大型本体中概念的信息内容的新观点以及关于本体的结构和质量的观点。
Int J Med Inform. 2005 Mar;74(2-4):125-32. doi: 10.1016/j.ijmedinf.2004.03.009.
9
Exploring information from the topology beneath the Gene Ontology terms to improve semantic similarity measures.探索基因本体术语之下的拓扑结构中的信息以改进语义相似性度量。
Gene. 2016 Jul 15;586(1):148-57. doi: 10.1016/j.gene.2016.04.024. Epub 2016 Apr 12.
10
Towards Semantic Sensor Data: An Ontology Approach.迈向语义传感器数据:本体论方法。
Sensors (Basel). 2019 Mar 8;19(5):1193. doi: 10.3390/s19051193.

引用本文的文献

1
Assessment of GO-Based Protein Interaction Affinities in the Large-Scale Human-Coronavirus Family Interactome.基于基因本体论的大规模人类冠状病毒家族相互作用组中蛋白质相互作用亲和力评估。
Vaccines (Basel). 2023 Feb 25;11(3):549. doi: 10.3390/vaccines11030549.
2
HiG2Vec: hierarchical representations of Gene Ontology and genes in the Poincaré ball.HiG2Vec:庞加莱球中基因本体论和基因的层次表示。
Bioinformatics. 2021 Sep 29;37(18):2971-2980. doi: 10.1093/bioinformatics/btab193.
3
STS-NLSP: A Network-Based Label Space Partition Method for Predicting the Specificity of Membrane Transporter Substrates Using a Hybrid Feature of Structural and Semantic Similarity.

本文引用的文献

1
A shortest-path graph kernel for estimating gene product semantic similarity.一种用于估计基因产物语义相似性的最短路径图核。
J Biomed Semantics. 2011 Jul 29;2:3. doi: 10.1186/2041-1480-2-3.
2
Biomedical semantics in the Semantic Web.语义网中的生物医学语义学。
J Biomed Semantics. 2011 Mar 7;2 Suppl 1(Suppl 1):S1. doi: 10.1186/2041-1480-2-S1-S1.
3
Semantic similarity for automatic classification of chemical compounds.化合物自动分类的语义相似度。
STS-NLSP:一种基于网络的标签空间划分方法,用于利用结构和语义相似性的混合特征预测膜转运体底物的特异性
Front Bioeng Biotechnol. 2019 Nov 6;7:306. doi: 10.3389/fbioe.2019.00306. eCollection 2019.
4
Handling Big Data Scalability in Biological Domain Using Parallel and Distributed Processing: A Case of Three Biological Semantic Similarity Measures.使用并行和分布式处理处理生物领域的大数据可扩展性:三个生物语义相似性度量的案例。
Biomed Res Int. 2019 Jan 27;2019:6750296. doi: 10.1155/2019/6750296. eCollection 2019.
5
Identifying disease genes using machine learning and gene functional similarities, assessed through Gene Ontology.利用机器学习和基因功能相似性(通过基因本体论评估)识别疾病基因。
PLoS One. 2018 Dec 10;13(12):e0208626. doi: 10.1371/journal.pone.0208626. eCollection 2018.
6
GOGO: An improved algorithm to measure the semantic similarity between gene ontology terms.GO-GO:一种改进的基因本体术语间语义相似度测量算法。
Sci Rep. 2018 Oct 10;8(1):15107. doi: 10.1038/s41598-018-33219-y.
7
Gene Ontology Enrichment Improves Performances of Functional Similarity of Genes.基因本体论富集提高了基因功能相似性的性能。
Sci Rep. 2018 Aug 14;8(1):12100. doi: 10.1038/s41598-018-30455-0.
8
Review of computational methods for virus-host protein interaction prediction: a case study on novel Ebola-human interactions.病毒-宿主蛋白相互作用预测的计算方法综述:以新型埃博拉病毒-人类相互作用为例
Brief Funct Genomics. 2018 Nov 26;17(6):381-391. doi: 10.1093/bfgp/elx026.
9
The effects of shared information on semantic calculations in the gene ontology.共享信息对基因本体中语义计算的影响。
Comput Struct Biotechnol J. 2017 Jan 30;15:195-211. doi: 10.1016/j.csbj.2017.01.009. eCollection 2017.
10
TopoICSim: a new semantic similarity measure based on gene ontology.TopoICSim:一种基于基因本体论的新语义相似性度量方法。
BMC Bioinformatics. 2016 Jul 29;17(1):296. doi: 10.1186/s12859-016-1160-0.
PLoS Comput Biol. 2010 Sep 23;6(9):e1000937. doi: 10.1371/journal.pcbi.1000937.
4
UMLS-Interface and UMLS-Similarity : open source software for measuring paths and semantic similarity.统一医学语言系统接口与统一医学语言系统相似度:用于测量路径和语义相似度的开源软件。
AMIA Annu Symp Proc. 2009 Nov 14;2009:431-5.
5
Linking human diseases to animal models using ontology-based phenotype annotation.利用基于本体的表型注释将人类疾病与动物模型联系起来。
PLoS Biol. 2009 Nov;7(11):e1000247. doi: 10.1371/journal.pbio.1000247. Epub 2009 Nov 24.
6
The Pfam protein families database.Pfam 蛋白质家族数据库。
Nucleic Acids Res. 2010 Jan;38(Database issue):D211-22. doi: 10.1093/nar/gkp985. Epub 2009 Nov 17.
7
Clinical diagnostics in human genetics with semantic similarity searches in ontologies.基于本体中语义相似性搜索的人类遗传学临床诊断
Am J Hum Genet. 2009 Oct;85(4):457-64. doi: 10.1016/j.ajhg.2009.09.003.
8
Semantic similarity in biomedical ontologies.生物医学本体中的语义相似性。
PLoS Comput Biol. 2009 Jul;5(7):e1000443. doi: 10.1371/journal.pcbi.1000443. Epub 2009 Jul 31.
9
Fast gene ontology based clustering for microarray experiments.基于快速基因本体的微阵列实验聚类。
BioData Min. 2008 Nov 21;1(1):11. doi: 10.1186/1756-0381-1-11.
10
Predicting protein function from sequence and structure.从序列和结构预测蛋白质功能。
Nat Rev Mol Cell Biol. 2007 Dec;8(12):995-1005. doi: 10.1038/nrm2281.