• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

概念嵌入用于测量生物医学信息本体的语义相似度。

Concept embedding to measure semantic relatedness for biomedical information ontologies.

机构信息

Department of Bio and Brain Engineering, KAIST, Daejeon, Republic of Korea.

Milner Therapeutics Institute University of Cambridge, Cambridge CB2 1TN, UK.

出版信息

J Biomed Inform. 2019 Jun;94:103182. doi: 10.1016/j.jbi.2019.103182. Epub 2019 Apr 19.

DOI:10.1016/j.jbi.2019.103182
PMID:31009761
Abstract

There have been many attempts to identify relationships among concepts corresponding to terms from biomedical information ontologies such as the Unified Medical Language System (UMLS). In particular, vector representation of such concepts using information from UMLS definition texts is widely used to measure the relatedness between two biological concepts. However, conventional relatedness measures have a limited range of applicable word coverage, which limits the performance of these models. In this paper, we propose a concept-embedding model of a UMLS semantic relatedness measure to overcome the limitations of earlier models. We obtained context texts of biological concepts that are not defined in UMLS by utilizing Wikipedia as an external knowledgebase. Concept vector representations were then derived from the context texts of the biological concepts. The degree of relatedness between two concepts was defined as the cosine similarity between corresponding concept vectors. As a result, we validated that our method provides higher coverage and better performance than the conventional method.

摘要

已经有许多尝试来识别对应于生物医学信息本体论(如统一医学语言系统(UMLS))术语的概念之间的关系。特别是,使用 UMLS 定义文本中的信息来表示这种概念的向量表示形式被广泛用于测量两个生物概念之间的相关性。然而,传统的相关性度量具有有限的适用词覆盖范围,这限制了这些模型的性能。在本文中,我们提出了一种 UMLS 语义相关性度量的概念嵌入模型,以克服早期模型的局限性。我们通过利用维基百科作为外部知识库,获得了 UMLS 中未定义的生物概念的上下文文本。然后从生物概念的上下文文本中推导出概念向量表示。两个概念之间的相关性程度被定义为对应概念向量之间的余弦相似度。结果表明,我们的方法比传统方法具有更高的覆盖率和更好的性能。

相似文献

1
Concept embedding to measure semantic relatedness for biomedical information ontologies.概念嵌入用于测量生物医学信息本体的语义相似度。
J Biomed Inform. 2019 Jun;94:103182. doi: 10.1016/j.jbi.2019.103182. Epub 2019 Apr 19.
2
Use of word and graph embedding to measure semantic relatedness between Unified Medical Language System concepts.使用词和图嵌入来衡量统一医学语言系统概念之间的语义相关性。
J Am Med Inform Assoc. 2020 Oct 1;27(10):1538-1546. doi: 10.1093/jamia/ocaa136.
3
A vector-based semantic relatedness measure using multiple relations within SNOMED CT and UMLS.基于向量的语义关联度量方法,利用 SNOMED CT 和 UMLS 中的多种关系。
J Biomed Inform. 2022 Jul;131:104118. doi: 10.1016/j.jbi.2022.104118. Epub 2022 Jun 9.
4
Retrofitting Concept Vector Representations of Medical Concepts to Improve Estimates of Semantic Similarity and Relatedness.改造医学概念的向量表示以改进语义相似性和相关性的估计。
Stud Health Technol Inform. 2017;245:657-661.
5
Using ontology-based semantic similarity to facilitate the article screening process for systematic reviews.利用基于本体的语义相似性来促进系统评价的文献筛选过程。
J Biomed Inform. 2017 May;69:33-42. doi: 10.1016/j.jbi.2017.03.007. Epub 2017 Mar 14.
6
Association measures for estimating semantic similarity and relatedness between biomedical concepts.用于估计生物医学概念之间语义相似性和相关性的关联度量。
Artif Intell Med. 2019 Jan;93:1-10. doi: 10.1016/j.artmed.2018.08.006. Epub 2018 Sep 7.
7
Multi-Ontology Refined Embeddings (MORE): A hybrid multi-ontology and corpus-based semantic representation model for biomedical concepts.多本体精炼嵌入模型(MORE):一种基于混合多本体和语料库的生物医学概念语义表示模型。
J Biomed Inform. 2020 Nov;111:103581. doi: 10.1016/j.jbi.2020.103581. Epub 2020 Oct 1.
8
Evaluating semantic relations in neural word embeddings with biomedical and general domain knowledge bases.利用生物医学和一般领域知识库评估神经词汇嵌入中的语义关系。
BMC Med Inform Decis Mak. 2018 Jul 23;18(Suppl 2):65. doi: 10.1186/s12911-018-0630-x.
9
Evaluating semantic similarity between Chinese biomedical terms through multiple ontologies with score normalization: An initial study.通过多本体和分数归一化评估中文生物医学术语之间的语义相似性:一项初步研究。
J Biomed Inform. 2016 Dec;64:273-287. doi: 10.1016/j.jbi.2016.10.017. Epub 2016 Nov 1.
10
Evaluating measures of semantic similarity and relatedness to disambiguate terms in biomedical text.评估语义相似性和关联性的度量标准,以消除生物医学文本中的术语歧义。
J Biomed Inform. 2013 Dec;46(6):1116-24. doi: 10.1016/j.jbi.2013.08.008. Epub 2013 Sep 4.

引用本文的文献

1
Large-scale transformer-based topic graphs identify thematic links between engineering and biology.基于大规模变压器的主题图识别工程学与生物学之间的主题联系。
Sci Rep. 2025 Aug 10;15(1):29256. doi: 10.1038/s41598-025-15067-9.
2
Using Structured Codes and Free-Text Notes to Measure Information Complementarity in Electronic Health Records: Feasibility and Validation Study.使用结构化编码和自由文本注释来衡量电子健康记录中的信息互补性:可行性与验证研究。
J Med Internet Res. 2025 Feb 13;27:e66910. doi: 10.2196/66910.
3
NeighBERT: Medical Entity Linking Using Relation-Induced Dense Retrieval.
NeighBERT:使用关系诱导密集检索的医学实体链接
J Healthc Inform Res. 2024 Jan 18;8(2):353-369. doi: 10.1007/s41666-023-00136-3. eCollection 2024 Jun.
4
Using language models and ontology topology to perform semantic mapping of traits between biomedical datasets.利用语言模型和本体拓扑结构对生物医学数据集之间的特征进行语义映射。
Bioinformatics. 2023 Apr 3;39(4). doi: 10.1093/bioinformatics/btad169.
5
Improved biomedical word embeddings in the transformer era.Transformer 时代改进的生物医学词向量。
J Biomed Inform. 2021 Aug;120:103867. doi: 10.1016/j.jbi.2021.103867. Epub 2021 Jul 18.
6
Automated Coding of Under-Studied Medical Concept Domains: Linking Physical Activity Reports to the International Classification of Functioning, Disability, and Health.对研究不足的医学概念领域进行自动编码:将身体活动报告与《国际功能、残疾和健康分类》相联系。
Front Digit Health. 2021 Mar;3. doi: 10.3389/fdgth.2021.620828. Epub 2021 Mar 10.
7
Use of word and graph embedding to measure semantic relatedness between Unified Medical Language System concepts.使用词和图嵌入来衡量统一医学语言系统概念之间的语义相关性。
J Am Med Inform Assoc. 2020 Oct 1;27(10):1538-1546. doi: 10.1093/jamia/ocaa136.
8
An interactive retrieval system for clinical trial studies with context-dependent protocol elements.具有上下文相关协议元素的临床试验研究的交互式检索系统。
PLoS One. 2020 Sep 18;15(9):e0238290. doi: 10.1371/journal.pone.0238290. eCollection 2020.
9
BioConceptVec: Creating and evaluating literature-based biomedical concept embeddings on a large scale.生物概念向量:在大规模上创建和评估基于文献的生物医学概念嵌入。
PLoS Comput Biol. 2020 Apr 23;16(4):e1007617. doi: 10.1371/journal.pcbi.1007617. eCollection 2020 Apr.