Suppr超能文献

自动识别人类表型本体论中的缺失 IS-A 关系。

Automated Identification of Missing IS-A Relations in the Human Phenotype Ontology.

机构信息

School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX.

Department of Neurology, The University of Texas Health Science Center at Houston, Houston, TX.

出版信息

AMIA Annu Symp Proc. 2023 Apr 29;2022:785-794. eCollection 2022.

Abstract

Auditing the Human Phenotype Ontology (HPO) is necessary to provide accurate terminology for its use in clinical research. We investigate an approach leveraging the lexical features of concepts in HPO to identify missing IS-A relations among HPO concepts. We first model the names of HPO concepts as sets of words in lower case. Then, we generate two types of concept-pairs which have at least a single common word: (1) Linked concept-pairs generated from concept-pairs having an IS-A relation; (2) Unlinked concept-pairs generated from concept-pairs without an IS- A relation. Concept-pairs generate Derived Term Pairs (DTPs) emphasizing unique lexical information of each concept. If a linked concept-pair and an unlinked concept-pair generate the same DTP, then we suggest a potential missing IS-A relation among the unlinked concept-pair. Applying our approach to the 2022-02-14 release of HPO, we uncovered 2,516 potential missing IS-A relations in HPO. We validated 59 missing IS-A relations leveraging the Unified Medical Language System (UMLS) by mapping the concept-pair to UMLS concepts and verifying whether UMLS records an IS-A relation between the pair of concepts.

摘要

审核人类表型本体(HPO)对于在临床研究中使用它提供准确的术语是必要的。我们研究了一种利用 HPO 中概念的词汇特征来识别 HPO 概念之间缺失的 IS-A 关系的方法。我们首先将 HPO 概念的名称建模为小写单词的集合。然后,我们生成了两种类型的至少有一个共同单词的概念对:(1)从具有 IS-A 关系的概念对生成的链接概念对;(2)从没有 IS-A 关系的概念对生成的非链接概念对。概念对生成派生术语对(DTP),强调每个概念的独特词汇信息。如果链接概念对和非链接概念对生成相同的 DTP,则我们建议在非链接概念对之间存在潜在的缺失 IS-A 关系。将我们的方法应用于 2022-02-14 发布的 HPO,我们在 HPO 中发现了 2516 个潜在的缺失 IS-A 关系。我们利用统一医学语言系统(UMLS)验证了 59 个缺失的 IS-A 关系,通过将概念对映射到 UMLS 概念并验证 UMLS 是否记录了这对概念之间的 IS-A 关系,从而验证了这 59 个缺失的 IS-A 关系。

相似文献

3
Identifying Missing IS-A Relations in Orphanet Rare Disease Ontology.识别《孤儿病本体论》中缺失的“属于”关系。
Proceedings (IEEE Int Conf Bioinformatics Biomed). 2022 Dec;2022:3274-3279. doi: 10.1109/bibm55620.2022.9995614. Epub 2023 Jan 2.
5
Leveraging non-lattice subgraphs for suggestion of new concepts for SNOMED CT.利用非格状子图为医学系统命名法(SNOMED CT)的新概念提供建议。
Proceedings (IEEE Int Conf Bioinformatics Biomed). 2021 Dec;2021:1805-1812. doi: 10.1109/bibm52615.2021.9669407.
8
A substring replacement approach for identifying missing IS-A relations in SNOMED CT.一种用于识别SNOMED CT中缺失的“是一种”关系的子串替换方法。
Proceedings (IEEE Int Conf Bioinformatics Biomed). 2022 Dec;2022:2611-2618. doi: 10.1109/bibm55620.2022.9995595. Epub 2023 Jan 2.
10
Auditing the multiply-related concepts within the UMLS.审核 UMLS 中的多重相关概念。
J Am Med Inform Assoc. 2014 Oct;21(e2):e185-93. doi: 10.1136/amiajnl-2013-002227. Epub 2014 Jan 24.

本文引用的文献

2
Leveraging non-lattice subgraphs for suggestion of new concepts for SNOMED CT.利用非格状子图为医学系统命名法(SNOMED CT)的新概念提供建议。
Proceedings (IEEE Int Conf Bioinformatics Biomed). 2021 Dec;2021:1805-1812. doi: 10.1109/bibm52615.2021.9669407.
3
The Human Phenotype Ontology in 2021.2021 年人类表型本体论。
Nucleic Acids Res. 2021 Jan 8;49(D1):D1207-D1217. doi: 10.1093/nar/gkaa1043.
7

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验