• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于识别统一医学语言系统语义关系的半监督学习

Semi-Supervised Learning to Identify UMLS Semantic Relations.

作者信息

Luo Yuan, Uzuner Ozlem

机构信息

Massachueets Institute of Technology.

Massachueets Institute of Technology ; State University of New York at Albany.

出版信息

AMIA Jt Summits Transl Sci Proc. 2014 Apr 7;2014:67-75. eCollection 2014.

PMID:25954580
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4419772/
Abstract

The UMLS Semantic Network is constructed by experts and requires periodic expert review to update. We propose and implement a semi-supervised approach for automatically identifying UMLS semantic relations from narrative text in PubMed. Our method analyzes biomedical narrative text to collect semantic entity pairs, and extracts multiple semantic, syntactic and orthographic features for the collected pairs. We experiment with seeded k-means clustering with various distance metrics. We create and annotate a ground truth corpus according to the top two levels of the UMLS semantic relation hierarchy. We evaluate our system on this corpus and characterize the learning curves of different clustering configuration. Using KL divergence consistently performs the best on the held-out test data. With full seeding, we obtain macro-averaged F-measures above 70% for clustering the top level UMLS relations (2-way), and above 50% for clustering the second level relations (7-way).

摘要

统一医学语言系统(UMLS)语义网络由专家构建,需要定期进行专家审查以更新。我们提出并实施了一种半监督方法,用于从PubMed中的叙述文本中自动识别UMLS语义关系。我们的方法分析生物医学叙述文本以收集语义实体对,并为收集到的实体对提取多种语义、句法和正字法特征。我们使用各种距离度量对种子k均值聚类进行实验。我们根据UMLS语义关系层次结构的前两个级别创建并注释了一个真值语料库。我们在这个语料库上评估我们的系统,并刻画不同聚类配置的学习曲线。使用KL散度在留出的测试数据上始终表现最佳。在完全播种的情况下,对于顶级UMLS关系(二元)聚类,我们获得的宏平均F值超过70%,对于二级关系(七元)聚类,该值超过50%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27f1/4419772/048836b0b7a4/1861093f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27f1/4419772/577bfafa9c42/1861093f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27f1/4419772/220e9d9ec3fd/1861093f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27f1/4419772/e3ec8de9c9a4/1861093f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27f1/4419772/a64f6cad2990/1861093f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27f1/4419772/048836b0b7a4/1861093f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27f1/4419772/577bfafa9c42/1861093f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27f1/4419772/220e9d9ec3fd/1861093f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27f1/4419772/e3ec8de9c9a4/1861093f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27f1/4419772/a64f6cad2990/1861093f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27f1/4419772/048836b0b7a4/1861093f5.jpg

相似文献

1
Semi-Supervised Learning to Identify UMLS Semantic Relations.用于识别统一医学语言系统语义关系的半监督学习
AMIA Jt Summits Transl Sci Proc. 2014 Apr 7;2014:67-75. eCollection 2014.
2
In the pursuit of a semantic similarity metric based on UMLS annotations for articles in PubMed Central Open Access.在为美国国立医学图书馆医学主题词表(UMLS)注释的基于PubMed Central开放获取文章的语义相似性度量标准的研究中。
J Biomed Inform. 2015 Oct;57:204-18. doi: 10.1016/j.jbi.2015.07.015. Epub 2015 Aug 1.
3
Auditing the Assignments of Top-Level Semantic Types in the UMLS Semantic Network to UMLS Concepts.审核统一医学语言系统(UMLS)语义网络中顶级语义类型到UMLS概念的分配情况。
Proceedings (IEEE Int Conf Bioinformatics Biomed). 2017 Nov;2017:1262-1269. doi: 10.1109/BIBM.2017.8217840. Epub 2017 Dec 18.
4
Dynamic categorization of clinical research eligibility criteria by hierarchical clustering.基于层次聚类的临床研究入选标准的动态分类。
J Biomed Inform. 2011 Dec;44(6):927-35. doi: 10.1016/j.jbi.2011.06.001. Epub 2011 Jun 12.
5
Quality Assurance of UMLS Semantic Type Assignments Using SNOMED CT Hierarchies.使用SNOMED CT层次结构对统一医学语言系统语义类型分配进行质量保证
Methods Inf Med. 2016;55(2):158-65. doi: 10.3414/ME14-01-0104. Epub 2015 Apr 30.
6
Aligning biomedical ontologies using lexical methods and the UMLS: the case of disease ontologies.使用词汇方法和统一医学语言系统对齐生物医学本体:以疾病本体为例。
Stud Health Technol Inform. 2006;124:781-6.
7
Identification of the Best Semantic Expansion to Query PubMed Through Automatic Performance Assessment of Four Search Strategies on All Medical Subject Heading Descriptors: Comparative Study.通过对所有医学主题词描述符的四种检索策略进行自动性能评估来确定查询PubMed的最佳语义扩展:比较研究
JMIR Med Inform. 2020 Jun 4;8(6):e12799. doi: 10.2196/12799.
8
A method exploiting syntactic patterns and the UMLS semantics for aligning biomedical ontologies: the case of OBO disease ontologies.一种利用句法模式和统一医学语言系统语义来对齐生物医学本体的方法:以OBO疾病本体为例。
Int J Med Inform. 2007 Dec;76 Suppl 3:S353-61. doi: 10.1016/j.ijmedinf.2007.03.004. Epub 2007 May 22.
9
Validating UMLS Semantic Type Assignments Using SNOMED CT Semantic Tags.使用SNOMED CT语义标签验证统一医学语言系统(UMLS)语义类型分配
Methods Inf Med. 2018 Feb;57(1):43-53. doi: 10.3414/ME17-01-0120. Epub 2018 Apr 5.
10
Consistency across the hierarchies of the UMLS Semantic Network and Metathesaurus.美国国立医学图书馆医学主题词表语义网络和元词表各层次之间的一致性。
J Biomed Inform. 2003 Dec;36(6):450-61. doi: 10.1016/j.jbi.2003.11.001.

引用本文的文献

1
Enhancing Medical Image Retrieval with UMLS-Integrated CNN-Based Text Indexing.利用基于统一医学语言系统(UMLS)集成的卷积神经网络(CNN)文本索引增强医学图像检索
Diagnostics (Basel). 2024 Jun 6;14(11):1204. doi: 10.3390/diagnostics14111204.
2
The impact of learning Unified Medical Language System knowledge embeddings in relation extraction from biomedical texts.学习统一医学语言系统知识嵌入对生物医学文本中关系抽取的影响。
J Am Med Inform Assoc. 2020 Oct 1;27(10):1556-1567. doi: 10.1093/jamia/ocaa205.
3
Natural Language Processing for EHR-Based Computational Phenotyping.

本文引用的文献

1
Empirical distributional semantics: methods and biomedical applications.实证分布语义学:方法与生物医学应用
J Biomed Inform. 2009 Apr;42(2):390-405. doi: 10.1016/j.jbi.2009.02.002. Epub 2009 Feb 14.
2
An upper-level ontology for the biomedical domain.一个用于生物医学领域的上层本体。
Comp Funct Genomics. 2003;4(1):80-4. doi: 10.1002/cfg.255.
3
Exploiting semantic relations for literature-based discovery.利用语义关系进行基于文献的发现。
基于电子健康记录的自然语言处理计算表型。
IEEE/ACM Trans Comput Biol Bioinform. 2019 Jan-Feb;16(1):139-153. doi: 10.1109/TCBB.2018.2849968. Epub 2018 Jun 25.
4
Efficient Queries of Stand-off Annotations for Natural Language Processing on Electronic Medical Records.电子病历中自然语言处理的远距离标注高效查询
Biomed Inform Insights. 2016 Jul 19;8:29-38. doi: 10.4137/BII.S38916. eCollection 2016.
5
Bridging semantics and syntax with graph algorithms-state-of-the-art of extracting biomedical relations.用图算法弥合语义与句法——提取生物医学关系的研究现状
Brief Bioinform. 2017 Jan;18(1):160-178. doi: 10.1093/bib/bbw001. Epub 2016 Feb 5.
AMIA Annu Symp Proc. 2006;2006:349-53.
4
AliBaba: PubMed as a graph.阿里巴巴:作为图的PubMed。
Bioinformatics. 2006 Oct 1;22(19):2444-5. doi: 10.1093/bioinformatics/btl408. Epub 2006 Jul 26.
5
Using co-occurrence network structure to extract synonymous gene and protein names from MEDLINE abstracts.利用共现网络结构从MEDLINE摘要中提取同义基因和蛋白质名称。
BMC Bioinformatics. 2005 Apr 22;6:103. doi: 10.1186/1471-2105-6-103.
6
The interaction of domain knowledge and linguistic structure in natural language processing: interpreting hypernymic propositions in biomedical text.自然语言处理中领域知识与语言结构的相互作用:解读生物医学文本中的上位命题
J Biomed Inform. 2003 Dec;36(6):462-77. doi: 10.1016/j.jbi.2003.11.003.
7
Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program.生物医学文本到UMLS元词表的有效映射:MetaMap程序
Proc AMIA Symp. 2001:17-21.