• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Retrofitting Concept Vector Representations of Medical Concepts to Improve Estimates of Semantic Similarity and Relatedness.改造医学概念的向量表示以改进语义相似性和相关性的估计。
Stud Health Technol Inform. 2017;245:657-661.
2
Concept embedding to measure semantic relatedness for biomedical information ontologies.概念嵌入用于测量生物医学信息本体的语义相似度。
J Biomed Inform. 2019 Jun;94:103182. doi: 10.1016/j.jbi.2019.103182. Epub 2019 Apr 19.
3
A comparison of word embeddings for the biomedical natural language processing.生物医学自然语言处理中词嵌入的比较。
J Biomed Inform. 2018 Nov;87:12-20. doi: 10.1016/j.jbi.2018.09.008. Epub 2018 Sep 12.
4
Corpus domain effects on distributional semantic modeling of medical terms.语料库领域对医学术语分布语义建模的影响。
Bioinformatics. 2016 Dec 1;32(23):3635-3644. doi: 10.1093/bioinformatics/btw529. Epub 2016 Aug 16.
5
tESA: a distributional measure for calculating semantic relatedness.tESA:一种用于计算语义相关性的分布度量。
J Biomed Semantics. 2016 Dec 28;7(1):67. doi: 10.1186/s13326-016-0109-6.
6
Use of word and graph embedding to measure semantic relatedness between Unified Medical Language System concepts.使用词和图嵌入来衡量统一医学语言系统概念之间的语义相关性。
J Am Med Inform Assoc. 2020 Oct 1;27(10):1538-1546. doi: 10.1093/jamia/ocaa136.
7
Can Unified Medical Language System-based semantic representation improve automated identification of patient safety incident reports by type and severity?基于统一医学语言系统的语义表示能否提高自动识别患者安全事件报告的类型和严重程度的能力?
J Am Med Inform Assoc. 2020 Oct 1;27(10):1502-1509. doi: 10.1093/jamia/ocaa082.
8
Association measures for estimating semantic similarity and relatedness between biomedical concepts.用于估计生物医学概念之间语义相似性和相关性的关联度量。
Artif Intell Med. 2019 Jan;93:1-10. doi: 10.1016/j.artmed.2018.08.006. Epub 2018 Sep 7.
9
Using distributional analysis to semantically classify UMLS concepts.使用分布分析对统一医学语言系统(UMLS)概念进行语义分类。
Stud Health Technol Inform. 2007;129(Pt 1):519-23.
10
Semantic similarity in the biomedical domain: an evaluation across knowledge sources.生物医学领域的语义相似度:跨知识源的评估。
BMC Bioinformatics. 2012 Oct 10;13:261. doi: 10.1186/1471-2105-13-261.

引用本文的文献

1
Improved biomedical word embeddings in the transformer era.Transformer 时代改进的生物医学词向量。
J Biomed Inform. 2021 Aug;120:103867. doi: 10.1016/j.jbi.2021.103867. Epub 2021 Jul 18.
2
Augmenting aer2vec: Enriching distributed representations of adverse event report data with orthographic and lexical information.增强 aer2vec:利用正字法和词汇信息丰富不良事件报告数据的分布式表示。
J Biomed Inform. 2021 Jul;119:103833. doi: 10.1016/j.jbi.2021.103833. Epub 2021 Jun 8.
3
Retrofitting Vector Representations of Adverse Event Reporting Data to Structured Knowledge to Improve Pharmacovigilance Signal Detection.将不良事件报告数据的向量表示形式改造为结构化知识以改进药物警戒信号检测。
AMIA Annu Symp Proc. 2021 Jan 25;2020:383-392. eCollection 2020.
4
Better synonyms for enriching biomedical search.更好的生物医学搜索丰富化的同义词。
J Am Med Inform Assoc. 2020 Dec 9;27(12):1894-1902. doi: 10.1093/jamia/ocaa151.
5
Use of word and graph embedding to measure semantic relatedness between Unified Medical Language System concepts.使用词和图嵌入来衡量统一医学语言系统概念之间的语义相关性。
J Am Med Inform Assoc. 2020 Oct 1;27(10):1538-1546. doi: 10.1093/jamia/ocaa136.
6
BioConceptVec: Creating and evaluating literature-based biomedical concept embeddings on a large scale.生物概念向量:在大规模上创建和评估基于文献的生物医学概念嵌入。
PLoS Comput Biol. 2020 Apr 23;16(4):e1007617. doi: 10.1371/journal.pcbi.1007617. eCollection 2020 Apr.
7
Discovering Clinical Information Models Online to Promote Interoperability of Electronic Health Records: A Feasibility Study of OpenEHR.在线发现临床信息模型以促进电子健康记录的互操作性:OpenEHR的可行性研究
J Med Internet Res. 2019 May 28;21(5):e13504. doi: 10.2196/13504.

本文引用的文献

1
Corpus domain effects on distributional semantic modeling of medical terms.语料库领域对医学术语分布语义建模的影响。
Bioinformatics. 2016 Dec 1;32(23):3635-3644. doi: 10.1093/bioinformatics/btw529. Epub 2016 Aug 16.
2
Semantic similarity in the biomedical domain: an evaluation across knowledge sources.生物医学领域的语义相似度:跨知识源的评估。
BMC Bioinformatics. 2012 Oct 10;13:261. doi: 10.1186/1471-2105-13-261.
3
Semantic Similarity and Relatedness between Clinical Terms: An Experimental Study.临床术语之间的语义相似性和相关性:一项实验研究。
AMIA Annu Symp Proc. 2010 Nov 13;2010:572-6.
4
Towards a framework for developing semantic relatedness reference standards.迈向开发语义关联参照标准的框架。
J Biomed Inform. 2011 Apr;44(2):251-65. doi: 10.1016/j.jbi.2010.10.004. Epub 2010 Oct 31.
5
Can literature analysis identify innovation drivers in drug discovery?文献分析能否识别药物研发中的创新驱动因素?
Nat Rev Drug Discov. 2009 Nov;8(11):865-78. doi: 10.1038/nrd2973.
6
Empirical distributional semantics: methods and biomedical applications.实证分布语义学:方法与生物医学应用
J Biomed Inform. 2009 Apr;42(2):390-405. doi: 10.1016/j.jbi.2009.02.002. Epub 2009 Feb 14.
7
A document clustering and ranking system for exploring MEDLINE citations.一种用于探索MEDLINE引文的文档聚类和排序系统。
J Am Med Inform Assoc. 2007 Sep-Oct;14(5):651-61. doi: 10.1197/jamia.M2215. Epub 2007 Jun 28.
8
Measures of semantic similarity and relatedness in the biomedical domain.生物医学领域中语义相似性和相关性的度量。
J Biomed Inform. 2007 Jun;40(3):288-99. doi: 10.1016/j.jbi.2006.06.004. Epub 2006 Jun 10.
9
Aligning knowledge sources in the UMLS: methods, quantitative results, and applications.统一医学语言系统中的知识源:方法、定量结果及应用
Stud Health Technol Inform. 2004;107(Pt 1):327-31.
10
Towards the development of a conceptual distance metric for the UMLS.迈向开发用于统一医学语言系统(UMLS)的概念距离度量标准。
J Biomed Inform. 2004 Apr;37(2):77-85. doi: 10.1016/j.jbi.2004.02.001.

改造医学概念的向量表示以改进语义相似性和相关性的估计。

Retrofitting Concept Vector Representations of Medical Concepts to Improve Estimates of Semantic Similarity and Relatedness.

作者信息

Yu Zhiguo, Wallace Byron C, Johnson Todd, Cohen Trevor

机构信息

The University of Texas School of Biomedical Informatics at Houston, Houston, Texas, USA.

College of Computer and Information Science, Northeastern University, Boston, Massachusetts, USA.

出版信息

Stud Health Technol Inform. 2017;245:657-661.

PMID:29295178
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6464117/
Abstract

Estimation of semantic similarity and relatedness between biomedical concepts has utility for many informatics applications. Automated methods fall into two categories: methods based on distributional statistics drawn from text corpora, and methods using the structure of existing knowledge resources. Methods in the former category disregard taxonomic structure, while those in the latter fail to consider semantically relevant empirical information. In this paper, we present a method that retrofits distributional context vector representations of biomedical concepts using structural information from the UMLS Metathesaurus, such that the similarity between vector representations of linked concepts is augmented. We evaluated it on the UMNSRS benchmark. Our results demonstrate that retrofitting of concept vector representations leads to better correlation with human raters for both similarity and relatedness, surpassing the best results reported to date. They also demonstrate a clear improvement in performance on this reference standard for retrofitted vector representations, as compared to those without retrofitting.

摘要

生物医学概念之间语义相似性和相关性的评估对许多信息学应用都有用处。自动化方法可分为两类:基于从文本语料库中提取的分布统计信息的方法,以及使用现有知识资源结构的方法。前一类方法忽略分类结构,而后一类方法则未能考虑语义相关的经验信息。在本文中,我们提出了一种方法,该方法利用来自UMLS元词表的结构信息对生物医学概念的分布上下文向量表示进行改造,从而增强链接概念的向量表示之间的相似性。我们在UMNSRS基准上对其进行了评估。我们的结果表明,概念向量表示的改造在相似性和相关性方面都能与人类评分者有更好的相关性,超过了迄今为止报道的最佳结果。与未改造的向量表示相比,它们还表明改造后的向量表示在该参考标准上的性能有明显提高。