Suppr超能文献

在医学期刊数据库中查找统一医学语言系统元词表概念。

Finding UMLS Metathesaurus concepts in MEDLINE.

作者信息

Srinivasan Suresh, Rindflesch Thomas C, Hole William T, Aronson Alan R, Mork James G

机构信息

National Library of Medicine, Bethesda, MD, USA.

出版信息

Proc AMIA Symp. 2002:727-31.

Abstract

The entire collection of 11.5 million MEDLINE abstracts was processed to extract 549 million noun phrases using a shallow syntactic parser. English language strings in the 2002 and 2001 releases of the UMLS Metathesaurus were then matched against these phrases using flexible matching techniques. 34% of the Metathesaurus names (occurring in 30% of the concepts) were found in the titles and abstracts of articles in the literature. The matching concepts are fairly evenly chemical and non-chemical in nature and span a wide spectrum of semantic types. This paper details the approach taken and the results of the analysis.

摘要

使用浅句法分析器对1150万篇MEDLINE摘要的全集进行处理,以提取5.49亿个名词短语。然后,使用灵活匹配技术将2002年和2001年版UMLS元词表中的英语字符串与这些短语进行匹配。在文献中文章的标题和摘要中发现了元词表中34%的名称(出现在30%的概念中)。匹配的概念在性质上化学和非化学的分布相当均匀,并且涵盖广泛的语义类型。本文详细介绍了所采用的方法和分析结果。

引用本文的文献

1
Clinical concept annotation with contextual word embedding in active transfer learning environment.
Digit Health. 2024 Dec 19;10:20552076241308987. doi: 10.1177/20552076241308987. eCollection 2024 Jan-Dec.
2
Clinical Concept Extraction with Lexical Semantics to Support Automatic Annotation.
Int J Environ Res Public Health. 2021 Oct 9;18(20):10564. doi: 10.3390/ijerph182010564.
3
Assessing the enrichment of dietary supplement coverage in the Unified Medical Language System.
J Am Med Inform Assoc. 2020 Oct 1;27(10):1547-1555. doi: 10.1093/jamia/ocaa128.
4
Constructing a knowledge-based heterogeneous information graph for medical health status classification.
Health Inf Sci Syst. 2020 Feb 14;8(1):10. doi: 10.1007/s13755-020-0100-6. eCollection 2020 Dec.
5
Identifying named entities from PubMed for enriching semantic categories.
BMC Bioinformatics. 2015 Feb 21;16:57. doi: 10.1186/s12859-015-0487-2.
6
An evaluation of the UMLS in representing corpus derived clinical concepts.
AMIA Annu Symp Proc. 2011;2011:435-44. Epub 2011 Oct 22.
8
Rewriting and suppressing UMLS terms for improved biomedical term identification.
J Biomed Semantics. 2010 Mar 31;1(1):5. doi: 10.1186/2041-1480-1-5.
9

本文引用的文献

1
Aggregating UMLS semantic types for reducing conceptual complexity.
Stud Health Technol Inform. 2001;84(Pt 1):216-20.
2
Corpus-based statistical screening for phrase identification.
J Am Med Inform Assoc. 2000 Sep-Oct;7(5):499-511. doi: 10.1136/jamia.2000.0070499.
4
UMLS-based access to CPR data.
Stud Health Technol Inform. 1998;52 Pt 1:166-70.
5
Query expansion using the UMLS Metathesaurus.
Proc AMIA Annu Fall Symp. 1997:485-9.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验