• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从本体到语义相似度:基于本体的语义相似度计算

From ontology to semantic similarity: calculation of ontology-based semantic similarity.

作者信息

Gan Mingxin, Dou Xue, Jiang Rui

机构信息

Dongling School of Economics and Management, University of Science and Technology Beijing, Beijing 100083, China.

出版信息

ScientificWorldJournal. 2013;2013:793091. doi: 10.1155/2013/793091. Epub 2013 Feb 28.

DOI:10.1155/2013/793091
PMID:23533360
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3603583/
Abstract

Advances in high-throughput experimental techniques in the past decade have enabled the explosive increase of omics data, while effective organization, interpretation, and exchange of these data require standard and controlled vocabularies in the domain of biological and biomedical studies. Ontologies, as abstract description systems for domain-specific knowledge composition, hence receive more and more attention in computational biology and bioinformatics. Particularly, many applications relying on domain ontologies require quantitative measures of relationships between terms in the ontologies, making it indispensable to develop computational methods for the derivation of ontology-based semantic similarity between terms. Nevertheless, with a variety of methods available, how to choose a suitable method for a specific application becomes a problem. With this understanding, we review a majority of existing methods that rely on ontologies to calculate semantic similarity between terms. We classify existing methods into five categories: methods based on semantic distance, methods based on information content, methods based on properties of terms, methods based on ontology hierarchy, and hybrid methods. We summarize characteristics of each category, with emphasis on basic notions, advantages and disadvantages of these methods. Further, we extend our review to software tools implementing these methods and applications using these methods.

摘要

在过去十年中,高通量实验技术的进步使得组学数据呈爆发式增长,而有效组织、解读和交换这些数据需要生物和生物医学研究领域的标准和受控词汇表。本体作为特定领域知识构成的抽象描述系统,因此在计算生物学和生物信息学中受到越来越多的关注。特别是,许多依赖领域本体的应用需要对本体中术语之间的关系进行定量测量,这使得开发用于推导基于本体的术语语义相似度的计算方法变得不可或缺。然而,由于有多种方法可供选择,如何为特定应用选择合适的方法就成了一个问题。基于这种认识,我们回顾了大多数现有的依赖本体来计算术语之间语义相似度的方法。我们将现有方法分为五类:基于语义距离的方法、基于信息内容的方法、基于术语属性的方法、基于本体层次结构的方法和混合方法。我们总结了每一类方法的特点,重点介绍了这些方法的基本概念、优缺点。此外,我们将综述扩展到实现这些方法的软件工具以及使用这些方法的应用。

相似文献

1
From ontology to semantic similarity: calculation of ontology-based semantic similarity.从本体到语义相似度:基于本体的语义相似度计算
ScientificWorldJournal. 2013;2013:793091. doi: 10.1155/2013/793091. Epub 2013 Feb 28.
2
UFO: A tool for unifying biomedical ontology-based semantic similarity calculation, enrichment analysis and visualization.UFO:一种用于统一基于生物医学本体的语义相似性计算、富集分析和可视化的工具。
PLoS One. 2020 Jul 9;15(7):e0235670. doi: 10.1371/journal.pone.0235670. eCollection 2020.
3
A relation based measure of semantic similarity for Gene Ontology annotations.一种基于关系的基因本体注释语义相似度度量方法。
BMC Bioinformatics. 2008 Nov 4;9:468. doi: 10.1186/1471-2105-9-468.
4
Correlating information contents of gene ontology terms to infer semantic similarity of gene products.关联基因本体术语的信息内容以推断基因产物的语义相似性。
Comput Math Methods Med. 2014;2014:891842. doi: 10.1155/2014/891842. Epub 2014 May 22.
5
Aggregating the syntactic and semantic similarity of healthcare data towards their transformation to HL7 FHIR through ontology matching.通过本体匹配,聚合医疗保健数据的语法和语义相似性,以将其转换为 HL7 FHIR。
Int J Med Inform. 2019 Dec;132:104002. doi: 10.1016/j.ijmedinf.2019.104002. Epub 2019 Oct 5.
6
Evaluating semantic similarity between Chinese biomedical terms through multiple ontologies with score normalization: An initial study.通过多本体和分数归一化评估中文生物医学术语之间的语义相似性:一项初步研究。
J Biomed Inform. 2016 Dec;64:273-287. doi: 10.1016/j.jbi.2016.10.017. Epub 2016 Nov 1.
7
A framework for unifying ontology-based semantic similarity measures: a study in the biomedical domain.基于本体的语义相似性度量的统一框架:在生物医学领域的研究。
J Biomed Inform. 2014 Apr;48:38-53. doi: 10.1016/j.jbi.2013.11.006. Epub 2013 Nov 21.
8
Bi-directional semantic similarity for gene ontology to optimize biological and clinical analyses.双向语义相似性在基因本体论中的应用,以优化生物和临床分析。
J Am Med Inform Assoc. 2012 Sep-Oct;19(5):765-74. doi: 10.1136/amiajnl-2011-000659. Epub 2012 Feb 28.
9
The semantic measures library and toolkit: fast computation of semantic similarity and relatedness using biomedical ontologies.语义度量库和工具包:使用生物医学本体快速计算语义相似度和相关性。
Bioinformatics. 2014 Mar 1;30(5):740-2. doi: 10.1093/bioinformatics/btt581. Epub 2013 Oct 9.
10
Semantic similarity in biomedical ontologies.生物医学本体中的语义相似性。
PLoS Comput Biol. 2009 Jul;5(7):e1000443. doi: 10.1371/journal.pcbi.1000443. Epub 2009 Jul 31.

引用本文的文献

1
Pathway Analysis Interpretation in the Multi-Omic Era.多组学时代的通路分析解读
BioTech (Basel). 2025 Jul 29;14(3):58. doi: 10.3390/biotech14030058.
2
Pheno-Ranker: a toolkit for comparison of phenotypic data stored in GA4GH standards and beyond.Pheno-Ranker:用于比较存储在GA4GH标准及其他标准中的表型数据的工具包。
BMC Bioinformatics. 2024 Dec 4;25(1):373. doi: 10.1186/s12859-024-05993-2.
3
DAPNet: multi-view graph contrastive network incorporating disease clinical and molecular associations for disease progression prediction.DAPNet:一种多视图图对比网络,结合疾病临床和分子关联进行疾病进展预测。
BMC Med Inform Decis Mak. 2024 Nov 19;24(1):345. doi: 10.1186/s12911-024-02756-0.
4
Unveiling inter-embryo variability in spindle length over time: Towards quantitative phenotype analysis.揭示胚胎间纺锤体长度随时间的变化:迈向定量表型分析。
PLoS Comput Biol. 2024 Sep 5;20(9):e1012330. doi: 10.1371/journal.pcbi.1012330. eCollection 2024 Sep.
5
Establishing a Common Nutritional Vocabulary - From Food Production to Diet.建立通用营养词汇表——从食品生产到饮食
Front Nutr. 2022 Jun 21;9:928837. doi: 10.3389/fnut.2022.928837. eCollection 2022.
6
Evaluating semantic similarity methods for comparison of text-derived phenotype profiles.评估语义相似性方法以比较基于文本的表型谱。
BMC Med Inform Decis Mak. 2022 Feb 5;22(1):33. doi: 10.1186/s12911-022-01770-4.
7
Effects of Negation and Uncertainty Stratification on Text-Derived Patient Profile Similarity.否定和不确定性分层对文本衍生患者概况相似性的影响。
Front Digit Health. 2021 Dec 6;3:781227. doi: 10.3389/fdgth.2021.781227. eCollection 2021.
8
MitoPhen database: a human phenotype ontology-based approach to identify mitochondrial DNA diseases.MitoPhen 数据库:一种基于人类表型本体的方法,用于识别线粒体 DNA 疾病。
Nucleic Acids Res. 2021 Sep 27;49(17):9686-9695. doi: 10.1093/nar/gkab726.
9
Towards similarity-based differential diagnostics for common diseases.面向常见疾病的基于相似性的鉴别诊断。
Comput Biol Med. 2021 Jun;133:104360. doi: 10.1016/j.compbiomed.2021.104360. Epub 2021 Apr 1.
10
Integration of anatomy ontology data with protein-protein interaction networks improves the candidate gene prediction accuracy for anatomical entities.解剖学本体数据与蛋白质-蛋白质相互作用网络的整合提高了解剖实体候选基因预测的准确性。
BMC Bioinformatics. 2020 Oct 7;21(1):442. doi: 10.1186/s12859-020-03773-2.

本文引用的文献

1
Gene Expression Correlation and Gene Ontology-Based Similarity: An Assessment of Quantitative Relationships.基因表达相关性与基于基因本体论的相似性:定量关系评估
Proc IEEE Symp Comput Intell Bioinforma Comput Biol. 2004 Oct 7;2004:25-31. doi: 10.1109/CIBCB.2004.1393927.
2
Constructing a gene semantic similarity network for the inference of disease genes.构建用于疾病基因推断的基因语义相似性网络。
BMC Syst Biol. 2011;5 Suppl 2(Suppl 2):S2. doi: 10.1186/1752-0509-5-S2-S2. Epub 2011 Dec 14.
3
Computational tools for prioritizing candidate genes: boosting disease gene discovery.计算工具在候选基因优先级排序中的应用:提高疾病基因发现的效率。
Nat Rev Genet. 2012 Jul 3;13(8):523-36. doi: 10.1038/nrg3253.
4
DOSim: an R package for similarity between diseases based on Disease Ontology.DOSim:一个基于疾病本体论的疾病相似性的 R 包。
BMC Bioinformatics. 2011 Jun 29;12:266. doi: 10.1186/1471-2105-12-266.
5
PREDICT: a method for inferring novel drug indications with application to personalized medicine.PREDICT:一种用于推断新药物适应症的方法,适用于个性化医疗。
Mol Syst Biol. 2011 Jun 7;7:496. doi: 10.1038/msb.2011.26.
6
GOSemSim: an R package for measuring semantic similarity among GO terms and gene products.GO 语义相似度分析:用于测量 GO 术语和基因产物之间语义相似性的 R 包。
Bioinformatics. 2010 Apr 1;26(7):976-8. doi: 10.1093/bioinformatics/btq064. Epub 2010 Feb 23.
7
Semantic similarity in biomedical ontologies.生物医学本体中的语义相似性。
PLoS Comput Biol. 2009 Jul;5(7):e1000443. doi: 10.1371/journal.pcbi.1000443. Epub 2009 Jul 31.
8
Genotype-phenotype databases: challenges and solutions for the post-genomic era.基因型-表型数据库:后基因组时代的挑战与解决方案
Nat Rev Genet. 2009 Jan;10(1):9-18. doi: 10.1038/nrg2483.
9
Comparison of ontology-based semantic-similarity measures.基于本体的语义相似性度量比较。
AMIA Annu Symp Proc. 2008 Nov 6;2008:384-8.
10
The Human Phenotype Ontology: a tool for annotating and analyzing human hereditary disease.人类表型本体论:一种用于注释和分析人类遗传病的工具。
Am J Hum Genet. 2008 Nov;83(5):610-5. doi: 10.1016/j.ajhg.2008.09.017. Epub 2008 Oct 23.