Suppr超能文献

从临床角度分析多义词概念:在统一医学语言系统(UMLS)中审核概念分类的应用

Analyzing polysemous concepts from a clinical perspective: application to auditing concept categorization in the UMLS.

作者信息

Mougin Fleur, Bodenreider Olivier, Burgun Anita

机构信息

LESIM, INSERM U593, ISPED, University of Bordeaux 2, France.

出版信息

J Biomed Inform. 2009 Jun;42(3):440-51. doi: 10.1016/j.jbi.2009.03.008. Epub 2009 Mar 18.

Abstract

OBJECTIVES

Polysemy is a frequent issue in biomedical terminologies. In the Unified Medical Language System (UMLS), polysemous terms are either represented as several independent concepts, or clustered into a single, multiply-categorized concept. The objective of this study is to analyze polysemous concepts in the UMLS through their categorization and hierarchical relations for auditing purposes.

METHODS

We used the association of a concept with multiple Semantic Groups (SGs) as a surrogate for polysemy. We first extracted multi-SG (MSG) concepts from the UMLS Metathesaurus and characterized them in terms of the combinations of SGs with which they are associated. We then clustered MSG concepts in order to identify major types of polysemy. We also analyzed the inheritance of SGs in MSG concepts. Finally, we manually reviewed the categorization of the MSG concepts for auditing purposes.

RESULTS

The 1208 MSG concepts in the Metathesaurus are associated with 30 distinct pairs of SGs. We created 75 semantically homogeneous clusters of MSG concepts, and 276 MSG concepts could not be clustered for lack of hierarchical relations. The clusters were characterized by the most frequent pairs of semantic types of their constituent MSG concepts. MSG concepts exhibit limited semantic compatibility with their parent and child concepts. A large majority of MSG concepts (92%) are adequately categorized. Examples of miscategorized concepts are presented.

CONCLUSION

This work is a systematic analysis and manual review of all concepts categorized by multiple SGs in the UMLS. The correctly-categorized MSG concepts do reflect polysemy in the UMLS Metathesaurus. The analysis of inheritance of SGs proved useful for auditing concept categorization in the UMLS.

摘要

目的

多义性是生物医学术语中常见的问题。在统一医学语言系统(UMLS)中,多义词要么表示为几个独立的概念,要么聚类为一个单一的、多重分类的概念。本研究的目的是通过多义词的分类和层次关系分析UMLS中的多义概念,以进行审核。

方法

我们将一个概念与多个语义组(SGs)的关联用作多义性的替代指标。我们首先从UMLS元词表中提取多语义组(MSG)概念,并根据与之相关联的SGs组合对其进行特征描述。然后我们对MSG概念进行聚类,以识别多义性的主要类型。我们还分析了MSG概念中SGs的继承情况。最后,我们手动审核MSG概念的分类以进行审核。

结果

元词表中的1208个MSG概念与30对不同的SGs相关联。我们创建了75个语义同质的MSG概念簇,276个MSG概念因缺乏层次关系而无法聚类。这些簇由其组成MSG概念最常见的语义类型对来表征。MSG概念与其父概念和子概念的语义兼容性有限。绝大多数MSG概念(92%)分类恰当。文中给出了分类错误概念的示例。

结论

这项工作是对UMLS中由多个SGs分类的所有概念进行系统分析和人工审核。正确分类的MSG概念确实反映了UMLS元词表中的多义性。对SGs继承情况的分析被证明有助于审核UMLS中的概念分类。

相似文献

1
Analyzing polysemous concepts from a clinical perspective: application to auditing concept categorization in the UMLS.
J Biomed Inform. 2009 Jun;42(3):440-51. doi: 10.1016/j.jbi.2009.03.008. Epub 2009 Mar 18.
3
Structural group-based auditing of missing hierarchical relationships in UMLS.
J Biomed Inform. 2009 Jun;42(3):452-67. doi: 10.1016/j.jbi.2008.08.006. Epub 2008 Aug 20.
4
Auditing concept categorizations in the UMLS.
Artif Intell Med. 2004 May;31(1):29-44. doi: 10.1016/j.artmed.2004.02.002.
5
Auditing associative relations across two knowledge sources.
J Biomed Inform. 2009 Jun;42(3):426-39. doi: 10.1016/j.jbi.2009.01.004.
6
Auditing the multiply-related concepts within the UMLS.
J Am Med Inform Assoc. 2014 Oct;21(e2):e185-93. doi: 10.1136/amiajnl-2013-002227. Epub 2014 Jan 24.
7
Quality Assurance of UMLS Semantic Type Assignments Using SNOMED CT Hierarchies.
Methods Inf Med. 2016;55(2):158-65. doi: 10.3414/ME14-01-0104. Epub 2015 Apr 30.
8
A review of auditing techniques for the Unified Medical Language System.
J Am Med Inform Assoc. 2020 Oct 1;27(10):1625-1638. doi: 10.1093/jamia/ocaa108.
9
Consistency across the hierarchies of the UMLS Semantic Network and Metathesaurus.
J Biomed Inform. 2003 Dec;36(6):450-61. doi: 10.1016/j.jbi.2003.11.001.

引用本文的文献

1
Context-Enriched Learning Models for Aligning Biomedical Vocabularies at Scale in the UMLS Metathesaurus.
Proc Int World Wide Web Conf. 2022 Apr;2022:1037-1046. doi: 10.1145/3485447.3511946. Epub 2022 Apr 25.
2
Biomedical Vocabulary Alignment at Scale in the UMLS Metathesaurus.
Proc Int World Wide Web Conf. 2021 Apr;2021:2672-2683. doi: 10.1145/3442381.3450128. Epub 2021 Apr 19.
3
A review of auditing techniques for the Unified Medical Language System.
J Am Med Inform Assoc. 2020 Oct 1;27(10):1625-1638. doi: 10.1093/jamia/ocaa108.
4
Assessing the practice of biomedical ontology evaluation: Gaps and opportunities.
J Biomed Inform. 2018 Apr;80:1-13. doi: 10.1016/j.jbi.2018.02.010. Epub 2018 Feb 17.
5
COBE: A Conjunctive Ontology Browser and Explorer for Visualizing SNOMED CT Fragments.
AMIA Annu Symp Proc. 2015 Nov 5;2015:2092-100. eCollection 2015.
7
Abstraction networks for terminologies: Supporting management of "big knowledge".
Artif Intell Med. 2015 May;64(1):1-16. doi: 10.1016/j.artmed.2015.03.005. Epub 2015 Apr 2.
8
An analysis of FMA using structural self-bisimilarity.
J Biomed Inform. 2013 Jun;46(3):497-505. doi: 10.1016/j.jbi.2013.03.005. Epub 2013 Apr 2.
9
Logic-based assessment of the compatibility of UMLS ontology sources.
J Biomed Semantics. 2011 Mar 7;2 Suppl 1(Suppl 1):S2. doi: 10.1186/2041-1480-2-S1-S2.

本文引用的文献

1
An upper-level ontology for the biomedical domain.
Comp Funct Genomics. 2003;4(1):80-4. doi: 10.1002/cfg.255.
2
Relations in biomedical ontologies.
Genome Biol. 2005;6(5):R46. doi: 10.1186/gb-2005-6-5-r46. Epub 2005 Apr 28.
3
Integrating SNOMED CT into the UMLS: an exploration of different views of synonymy and quality of editing.
J Am Med Inform Assoc. 2005 Jul-Aug;12(4):486-94. doi: 10.1197/jamia.M1767. Epub 2005 Mar 31.
4
Coping with medical polysemy in the semantic web: the role of ontologies.
Stud Health Technol Inform. 2004;107(Pt 1):416-9.
5
Auditing concept categorizations in the UMLS.
Artif Intell Med. 2004 May;31(1):29-44. doi: 10.1016/j.artmed.2004.02.002.
6
Consistency across the hierarchies of the UMLS Semantic Network and Metathesaurus.
J Biomed Inform. 2003 Dec;36(6):450-61. doi: 10.1016/j.jbi.2003.11.001.
7
Exploring semantic groups through visual approaches.
J Biomed Inform. 2003 Dec;36(6):414-32. doi: 10.1016/j.jbi.2003.11.002.
8
The Unified Medical Language System (UMLS): integrating biomedical terminology.
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D267-70. doi: 10.1093/nar/gkh061.
9
The cohesive metaschema: a higher-level abstraction of the UMLS Semantic Network.
J Biomed Inform. 2002 Jun;35(3):194-212. doi: 10.1016/s1532-0464(02)00528-2.
10
Aggregating UMLS semantic types for reducing conceptual complexity.
Stud Health Technol Inform. 2001;84(Pt 1):216-20.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验