• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估大型生物医学术语中层次关系的粒度平衡,以提高质量。

Evaluating the granularity balance of hierarchical relationships within large biomedical terminologies towards quality improvement.

机构信息

School of Computer Science, University of South China, Hengyang, Hunan 421001, China.

School of Computer Science, University of South China, Hengyang, Hunan 421001, China.

出版信息

J Biomed Inform. 2017 Nov;75:129-137. doi: 10.1016/j.jbi.2017.10.001. Epub 2017 Oct 4.

DOI:10.1016/j.jbi.2017.10.001
PMID:28987379
Abstract

Organizing the descendants of a concept under a particular semantic relationship may be rather arbitrarily carried out during the manual creation processes of large biomedical terminologies, resulting in imbalances in relationship granularity. This work aims to propose scalable models towards systematically evaluating the granularity balance of semantic relationships. We first utilize "parallel concepts set (PCS)" and two features (the length and the strength) of the paths between PCSs to design the general evaluation models, based on which we propose eight concrete evaluation models generated by two specific types of PCSs: single concept set and symmetric concepts set. We then apply those concrete models to the IS-A relationship in FMA and SNOMED CT's Body Structure subset, as well as to the Part-Of relationship in FMA. Moreover, without loss of generality, we conduct two additional rounds of applications on the Part-Of relationship after removing length redundancies and strength redundancies sequentially. At last, we perform automatic evaluation on the imbalances detected after the final round for identifying missing concepts, misaligned relations and inconsistencies. For the IS-A relationship, 34 missing concepts, 80 misalignments and 18 redundancies in FMA as well as 28 missing concepts, 114 misalignments and 1 redundancy in SNOMED CT were uncovered. In addition, 6,801 instances of imbalances for the Part-Of relationship in FMA were also identified, including 3,246 redundancies. After removing those redundancies from FMA, the total number of Part-Of imbalances was dramatically reduced to 327, including 51 missing concepts, 294 misaligned relations, and 36 inconsistencies. Manual curation performed by the FMA project leader confirmed the effectiveness of our method in identifying curation errors. In conclusion, the granularity balance of hierarchical semantic relationship is a valuable property to check for ontology quality assurance, and the scalable evaluation models proposed in this study are effective in fulfilling this task, especially in auditing relationships with sub-hierarchies, such as the seldom evaluated Part-Of relationship.

摘要

对概念的后代进行组织可能在大型生物医学术语的手动创建过程中相当随意地进行,这导致语义关系的粒度不平衡。本工作旨在提出可扩展的模型,以系统地评估语义关系的粒度平衡。我们首先利用“平行概念集(PCS)”和 PCS 之间路径的两个特征(长度和强度)来设计通用评估模型,在此基础上,我们提出了通过两种特定类型的 PCS 生成的八个具体评估模型:单个概念集和对称概念集。然后,我们将这些具体模型应用于 FMA 的 IS-A 关系和 SNOMED CT 的身体结构子集的关系,以及 FMA 的 Part-Of 关系。此外,为了不失一般性,我们在依次删除长度冗余和强度冗余之后,对 Part-Of 关系进行了另外两轮应用。最后,我们对最后一轮检测到的不平衡情况进行自动评估,以识别缺失的概念、对齐关系和不一致性。对于 IS-A 关系,在 FMA 中发现了 34 个缺失的概念、80 个不匹配和 18 个冗余,在 SNOMED CT 中发现了 28 个缺失的概念、114 个不匹配和 1 个冗余。此外,还确定了 FMA 中 Part-Of 关系的 6801 个不平衡实例,其中包括 3246 个冗余。从 FMA 中删除这些冗余后,Part-Of 关系的总数急剧减少到 327,包括 51 个缺失的概念、294 个不匹配的关系和 36 个不一致性。FMA 项目负责人进行的手动整理证实了我们的方法在识别整理错误方面的有效性。总之,层次语义关系的粒度平衡是检查本体质量保证的一个有价值的属性,本研究提出的可扩展评估模型在完成这项任务方面非常有效,特别是在审核子层次关系时,例如很少被评估的 Part-Of 关系。

相似文献

1
Evaluating the granularity balance of hierarchical relationships within large biomedical terminologies towards quality improvement.评估大型生物医学术语中层次关系的粒度平衡,以提高质量。
J Biomed Inform. 2017 Nov;75:129-137. doi: 10.1016/j.jbi.2017.10.001. Epub 2017 Oct 4.
2
Quality Assurance of UMLS Semantic Type Assignments Using SNOMED CT Hierarchies.使用SNOMED CT层次结构对统一医学语言系统语义类型分配进行质量保证
Methods Inf Med. 2016;55(2):158-65. doi: 10.3414/ME14-01-0104. Epub 2015 Apr 30.
3
Auditing SNOMED CT hierarchical relations based on lexical features of concepts in non-lattice subgraphs.基于非格子网中概念的词汇特征来审核 SNOMED CT 层次关系。
J Biomed Inform. 2018 Feb;78:177-184. doi: 10.1016/j.jbi.2017.12.010. Epub 2017 Dec 20.
4
A transformation-based method for auditing the IS-A hierarchy of biomedical terminologies in the Unified Medical Language System.基于转换的方法审核统一医学语言系统中生物医学术语的 IS-A 层次结构。
J Am Med Inform Assoc. 2020 Oct 1;27(10):1568-1575. doi: 10.1093/jamia/ocaa123.
5
From lexical regularities to axiomatic patterns for the quality assurance of biomedical terminologies and ontologies.从词汇规律到公理模式,保障生物医学术语和本体的质量。
J Biomed Inform. 2018 Aug;84:59-74. doi: 10.1016/j.jbi.2018.06.008. Epub 2018 Jun 14.
6
Missing lateral relationships in top-level concepts of an ontology.本体论中顶级概念中缺失横向关系。
BMC Med Inform Decis Mak. 2020 Dec 15;20(Suppl 10):305. doi: 10.1186/s12911-020-01319-3.
7
Leveraging logical definitions and lexical features to detect missing IS-A relations in biomedical terminologies.利用逻辑定义和词汇特征来检测生物医学术语中缺失的 IS-A 关系。
J Biomed Semantics. 2024 May 1;15(1):6. doi: 10.1186/s13326-024-00309-y.
8
NEO: Systematic Non-Lattice Embedding of Ontologies for Comparing the Subsumption Relationship in SNOMED CT and in FMA Using MapReduce.NEO:用于使用MapReduce比较SNOMED CT和FMA中包含关系的本体的系统非格嵌入
AMIA Jt Summits Transl Sci Proc. 2015 Mar 25;2015:216-20. eCollection 2015.
9
A tribal abstraction network for SNOMED CT target hierarchies without attribute relationships.用于无属性关系的SNOMED CT目标层次结构的部落抽象网络。
J Am Med Inform Assoc. 2015 May;22(3):628-39. doi: 10.1136/amiajnl-2014-003173. Epub 2014 Oct 20.
10
Identification of missing concepts in biomedical terminologies using sequence-based formal concept analysis.使用基于序列的形式概念分析识别生物医学术语中的缺失概念。
BMC Med Inform Decis Mak. 2021 Nov 9;21(Suppl 7):234. doi: 10.1186/s12911-021-01592-w.

引用本文的文献

1
Extending import detection algorithms for concept import from two to three biomedical terminologies.将概念导入的导入检测算法从两个扩展到三个生物医学术语。
BMC Med Inform Decis Mak. 2020 Dec 15;20(Suppl 10):272. doi: 10.1186/s12911-020-01290-z.
2
Automatic Structuring of Ontology Terms Based on Lexical Granularity and Machine Learning: Algorithm Development and Validation.基于词汇粒度和机器学习的本体术语自动构建:算法开发与验证
JMIR Med Inform. 2020 Nov 25;8(11):e22333. doi: 10.2196/22333.
3
Alternative classification of identical concepts in different terminologies: Different ways to view the world.
不同术语中相同概念的另类分类:看世界的不同方式。
J Biomed Inform. 2019 Jun;94:103193. doi: 10.1016/j.jbi.2019.103193. Epub 2019 May 7.
4
Leveraging Horizontal Density Differences between Ontologies to Identify Missing Child Concepts: A Proof of Concept.利用本体之间的水平密度差异来识别缺失的子概念:概念验证
AMIA Annu Symp Proc. 2018 Dec 5;2018:644-653. eCollection 2018.