一种大规模本体匹配的有效方法。

An effective method of large scale ontology matching.

作者信息

Diallo Gayo

机构信息

University Bordeaux, ISPED, Centre INSERM U897, F-33000 Bordeaux, France.

出版信息

J Biomed Semantics. 2014 Oct 28;5(1):44. doi: 10.1186/2041-1480-5-44. eCollection 2014.

DOI:10.1186/2041-1480-5-44

PMID:25411633

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4236493/

Abstract

BACKGROUND

We are currently facing a proliferation of heterogeneous biomedical data sources accessible through various knowledge-based applications. These data are annotated by increasingly extensive and widely disseminated knowledge organisation systems ranging from simple terminologies and structured vocabularies to formal ontologies. In order to solve the interoperability issue, which arises due to the heterogeneity of these ontologies, an alignment task is usually performed. However, while significant effort has been made to provide tools that automatically align small ontologies containing hundreds or thousands of entities, little attention has been paid to the matching of large sized ontologies in the life sciences domain.

RESULTS

We have designed and implemented ServOMap, an effective method for large scale ontology matching. It is a fast and efficient high precision system able to perform matching of input ontologies containing hundreds of thousands of entities. The system, which was included in the 2012 and 2013 editions of the Ontology Alignment Evaluation Initiative campaign, performed very well. It was ranked among the top systems for the large ontologies matching.

CONCLUSIONS

We proposed an approach for large scale ontology matching relying on Information Retrieval (IR) techniques and the combination of lexical and machine learning contextual similarity computing for the generation of candidate mappings. It is particularly adapted to the life sciences domain as many of the ontologies in this domain benefit from synonym terms taken from the Unified Medical Language System and that can be used by our IR strategy. The ServOMap system we implemented is able to deal with hundreds of thousands entities with an efficient computation time.

摘要

背景

我们目前面临着大量通过各种基于知识的应用程序可访问的异构生物医学数据源。这些数据由越来越广泛和广泛传播的知识组织系统进行注释，范围从简单的术语和结构化词汇到形式本体。为了解决由于这些本体的异构性而产生的互操作性问题，通常会执行对齐任务。然而，尽管已经做出了巨大努力来提供能够自动对齐包含数百或数千个实体的小型本体的工具，但对于生命科学领域中大型本体的匹配却很少受到关注。

结果

我们设计并实现了ServOMap，一种用于大规模本体匹配的有效方法。它是一个快速高效的高精度系统，能够对包含数十万实体的输入本体进行匹配。该系统被纳入2012年和2013年版的本体对齐评估倡议活动中，表现非常出色。在大型本体匹配方面，它被列为顶级系统之一。

结论

我们提出了一种基于信息检索（IR）技术以及词汇和机器学习上下文相似性计算相结合的大规模本体匹配方法，用于生成候选映射。它特别适用于生命科学领域，因为该领域的许多本体受益于取自统一医学语言系统的同义词，并且可以被我们的IR策略使用。我们实现的ServOMap系统能够在高效的计算时间内处理数十万个实体。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b45a/4236493/5eeafe35b617/13326_2013_189_Fig1_HTML.jpg

相似文献

An effective method of large scale ontology matching.

J Biomed Semantics. 2014 Oct 28;5(1):44. doi: 10.1186/2041-1480-5-44. eCollection 2014.

Matching biomedical ontologies based on formal concept analysis.

J Biomed Semantics. 2018 Mar 19;9(1):11. doi: 10.1186/s13326-018-0178-9.

Matching biomedical ontologies with GCN-based feature propagation.

Math Biosci Eng. 2022 Jun 9;19(8):8479-8504. doi: 10.3934/mbe.2022394.

Aggregating the syntactic and semantic similarity of healthcare data towards their transformation to HL7 FHIR through ontology matching.

Int J Med Inform. 2019 Dec;132:104002. doi: 10.1016/j.ijmedinf.2019.104002. Epub 2019 Oct 5.

Improving the interoperability of biomedical ontologies with compound alignments.

J Biomed Semantics. 2018 Jan 9;9(1):1. doi: 10.1186/s13326-017-0171-8.

Performance assessment of ontology matching systems for FAIR data.

J Biomed Semantics. 2022 Jul 15;13(1):19. doi: 10.1186/s13326-022-00273-5.

Matching disease and phenotype ontologies in the ontology alignment evaluation initiative.

J Biomed Semantics. 2017 Dec 2;8(1):55. doi: 10.1186/s13326-017-0162-9.

Automatic background knowledge selection for matching biomedical ontologies.

PLoS One. 2014 Nov 7;9(11):e111226. doi: 10.1371/journal.pone.0111226. eCollection 2014.

Automated ontology generation framework powered by linked biomedical ontologies for disease-drug domain.

Comput Methods Programs Biomed. 2018 Oct;165:117-128. doi: 10.1016/j.cmpb.2018.08.010. Epub 2018 Aug 16.

Matching Biomedical Ontologies: Construction of Matching Clues and Systematic Evaluation of Different Combinations of Matchers.

JMIR Med Inform. 2021 Aug 19;9(8):e28212. doi: 10.2196/28212.

引用本文的文献

Aligning an interface terminology to the Logical Observation Identifiers Names and Codes (LOINC).

JAMIA Open. 2021 Jun 12;4(2):ooab035. doi: 10.1093/jamiaopen/ooab035. eCollection 2021 Apr.

FTRLIM: Distributed Instance Matching Framework for Large-Scale Knowledge Graph Fusion.

Entropy (Basel). 2021 May 13;23(5):602. doi: 10.3390/e23050602.

Ontological and Non-Ontological Resources for Associating Medical Dictionary for Regulatory Activities Terms to SNOMED Clinical Terms With Semantic Properties.

Front Pharmacol. 2019 Sep 10;10:975. doi: 10.3389/fphar.2019.00975. eCollection 2019.

Matching biomedical ontologies based on formal concept analysis.

J Biomed Semantics. 2018 Mar 19;9(1):11. doi: 10.1186/s13326-018-0178-9.

Tackling the challenges of matching biomedical ontologies.

J Biomed Semantics. 2018 Jan 15;9(1):4. doi: 10.1186/s13326-017-0170-9.

Experiences from the anatomy track in the ontology alignment evaluation initiative.

J Biomed Semantics. 2017 Dec 4;8(1):56. doi: 10.1186/s13326-017-0166-5.

本文引用的文献

OAE: The Ontology of Adverse Events.

J Biomed Semantics. 2014 Jul 5;5:29. doi: 10.1186/2041-1480-5-29. eCollection 2014.

Thematic series on biomedical ontologies in JBMS: challenges and new directions.

J Biomed Semantics. 2014 Mar 6;5:15. doi: 10.1186/2041-1480-5-15. eCollection 2014.

Reuse of termino-ontological resources and text corpora for building a multilingual domain ontology: an application to Alzheimer's disease.

J Biomed Inform. 2014 Apr;48:171-82. doi: 10.1016/j.jbi.2013.12.013. Epub 2013 Dec 29.

The EU-ADR Web Platform: delivering advanced pharmacovigilance tools.

Pharmacoepidemiol Drug Saf. 2013 May;22(5):459-67. doi: 10.1002/pds.3375. Epub 2012 Dec 4.

Using Medical Text Extraction, Reasoning and Mapping System (MTERMS) to process medication information in outpatient clinical notes.

AMIA Annu Symp Proc. 2011;2011:1639-48. Epub 2011 Oct 22.

Mapping Partners Master Drug Dictionary to RxNorm using an NLP-based approach.

J Biomed Inform. 2012 Aug;45(4):626-33. doi: 10.1016/j.jbi.2011.11.006. Epub 2011 Nov 28.

GOMMA: a component-based infrastructure for managing and analyzing life science ontologies and their evolution.

J Biomed Semantics. 2011 Sep 13;2:6. doi: 10.1186/2041-1480-2-6.

BioPortal: ontologies and integrated data resources at the click of a mouse.

Nucleic Acids Res. 2009 Jul;37(Web Server issue):W170-3. doi: 10.1093/nar/gkp440. Epub 2009 May 29.

An automated approach to mapping external terminologies to the UMLS.

IEEE Trans Biomed Eng. 2009 Jun;56(6):1598-605. doi: 10.1109/TBME.2009.2015651. Epub 2009 Mar 4.

The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration.

Nat Biotechnol. 2007 Nov;25(11):1251-5. doi: 10.1038/nbt1346.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种大规模本体匹配的有效方法。

An effective method of large scale ontology matching.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献