• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

挖掘非格状子图以检测SNOMED CT中缺失的层次关系和概念。

Mining non-lattice subgraphs for detecting missing hierarchical relations and concepts in SNOMED CT.

作者信息

Cui Licong, Zhu Wei, Tao Shiqiang, Case James T, Bodenreider Olivier, Zhang Guo-Qiang

机构信息

Department of Computer Science, University of Kentucky, Lexington, KY, USA.

Institute for Biomedical Informatics, University of Kentucky.

出版信息

J Am Med Inform Assoc. 2017 Jul 1;24(4):788-798. doi: 10.1093/jamia/ocw175.

DOI:10.1093/jamia/ocw175
PMID:28339775
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6080685/
Abstract

OBJECTIVE

Quality assurance of large ontological systems such as SNOMED CT is an indispensable part of the terminology management lifecycle. We introduce a hybrid structural-lexical method for scalable and systematic discovery of missing hierarchical relations and concepts in SNOMED CT.

MATERIAL AND METHODS

All non-lattice subgraphs (the structural part) in SNOMED CT are exhaustively extracted using a scalable MapReduce algorithm. Four lexical patterns (the lexical part) are identified among the extracted non-lattice subgraphs. Non-lattice subgraphs exhibiting such lexical patterns are often indicative of missing hierarchical relations or concepts. Each lexical pattern is associated with a potential specific type of error.

RESULTS

Applying the structural-lexical method to SNOMED CT (September 2015 US edition), we found 6801 non-lattice subgraphs that matched these lexical patterns, of which 2046 were amenable to visual inspection. We evaluated a random sample of 100 small subgraphs, of which 59 were reviewed in detail by domain experts. All the subgraphs reviewed contained errors confirmed by the experts. The most frequent type of error was missing is-a relations due to incomplete or inconsistent modeling of the concepts.

CONCLUSIONS

Our hybrid structural-lexical method is innovative and proved effective not only in detecting errors in SNOMED CT, but also in suggesting remediation for these errors.

摘要

目的

诸如SNOMED CT这样的大型本体系统的质量保证是术语管理生命周期中不可或缺的一部分。我们引入一种混合结构-词汇方法,用于可扩展且系统地发现SNOMED CT中缺失的层次关系和概念。

材料与方法

使用可扩展的MapReduce算法详尽提取SNOMED CT中的所有非格点子图(结构部分)。在提取的非格点子图中识别出四种词汇模式(词汇部分)。呈现此类词汇模式的非格点子图通常表明存在缺失的层次关系或概念。每种词汇模式都与一种潜在的特定错误类型相关联。

结果

将结构-词汇方法应用于SNOMED CT(2015年9月美国版),我们发现6801个与这些词汇模式匹配的非格点子图,其中2046个适合目视检查。我们评估了100个小子图的随机样本,其中59个由领域专家进行了详细审查。所有审查的子图都包含专家确认的错误。最常见的错误类型是由于概念建模不完整或不一致导致的缺失“是一个”关系。

结论

我们的混合结构-词汇方法具有创新性,不仅在检测SNOMED CT中的错误方面有效,而且在为这些错误提出补救措施方面也有效。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f97/6080685/abf33f862a26/ocw175f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f97/6080685/5a1302e42ed8/ocw175f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f97/6080685/626d92692f4a/ocw175f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f97/6080685/12d1d4a1c037/ocw175f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f97/6080685/0d3d705db84a/ocw175f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f97/6080685/28451be1ac3f/ocw175f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f97/6080685/abf33f862a26/ocw175f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f97/6080685/5a1302e42ed8/ocw175f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f97/6080685/626d92692f4a/ocw175f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f97/6080685/12d1d4a1c037/ocw175f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f97/6080685/0d3d705db84a/ocw175f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f97/6080685/28451be1ac3f/ocw175f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f97/6080685/abf33f862a26/ocw175f6.jpg

相似文献

1
Mining non-lattice subgraphs for detecting missing hierarchical relations and concepts in SNOMED CT.挖掘非格状子图以检测SNOMED CT中缺失的层次关系和概念。
J Am Med Inform Assoc. 2017 Jul 1;24(4):788-798. doi: 10.1093/jamia/ocw175.
2
Auditing SNOMED CT hierarchical relations based on lexical features of concepts in non-lattice subgraphs.基于非格子网中概念的词汇特征来审核 SNOMED CT 层次关系。
J Biomed Inform. 2018 Feb;78:177-184. doi: 10.1016/j.jbi.2017.12.010. Epub 2017 Dec 20.
3
Quality Assurance of NCI Thesaurus by Mining Structural-Lexical Patterns.通过挖掘结构-词汇模式对美国国立癌症研究所叙词表进行质量保证
AMIA Annu Symp Proc. 2018 Apr 16;2017:364-373. eCollection 2017.
4
Leveraging non-lattice subgraphs for suggestion of new concepts for SNOMED CT.利用非格状子图为医学系统命名法(SNOMED CT)的新概念提供建议。
Proceedings (IEEE Int Conf Bioinformatics Biomed). 2021 Dec;2021:1805-1812. doi: 10.1109/bibm52615.2021.9669407.
5
Leveraging logical definitions and lexical features to detect missing IS-A relations in biomedical terminologies.利用逻辑定义和词汇特征来检测生物医学术语中缺失的 IS-A 关系。
J Biomed Semantics. 2024 May 1;15(1):6. doi: 10.1186/s13326-024-00309-y.
6
A lexical-based approach for exhaustive detection of missing hierarchical IS-A relations in SNOMED CT.基于词汇的方法,用于全面检测 SNOMED CT 中缺失的层次 IS-A 关系。
AMIA Annu Symp Proc. 2021 Jan 25;2020:1392-1401. eCollection 2020.
7
Web-based Interactive Visualization of Non-Lattice Subgraphs (WINS) in SNOMED CT.基于网络的SNOMED CT中非晶格子图的交互式可视化(WINS)
AMIA Jt Summits Transl Sci Proc. 2020 May 30;2020:740-749. eCollection 2020.
8
A deep learning approach to identify missing is-a relations in SNOMED CT.一种用于识别 SNOMED CT 中缺失的 is-a 关系的深度学习方法。
J Am Med Inform Assoc. 2023 Feb 16;30(3):475-484. doi: 10.1093/jamia/ocac248.
9
Leveraging Non-lattice Subgraphs to Audit Hierarchical Relations in NCI Thesaurus.利用非格状子图审核美国国立癌症研究所叙词表中的层次关系。
AMIA Annu Symp Proc. 2020 Mar 4;2019:982-991. eCollection 2019.
10
Can SNOMED CT Changes Be Used as a Surrogate Standard for Evaluating the Performance of Its Auditing Methods?SNOMED CT的变更能否用作评估其审核方法性能的替代标准?
AMIA Annu Symp Proc. 2018 Apr 16;2017:1903-1912. eCollection 2017.

引用本文的文献

1
Quantitatively assessing the impact of the quality of SNOMED CT subtype hierarchy on cohort queries.定量评估SNOMED CT亚型层次结构质量对队列查询的影响。
J Am Med Inform Assoc. 2025 Jan 1;32(1):89-96. doi: 10.1093/jamia/ocae272.
2
Are ICD codes reliable for observational studies? Assessing coding consistency for data quality.国际疾病分类代码用于观察性研究是否可靠?评估数据质量的编码一致性。
Digit Health. 2024 Oct 29;10:20552076241297056. doi: 10.1177/20552076241297056. eCollection 2024 Jan-Dec.
3
Leveraging logical definitions and lexical features to detect missing IS-A relations in biomedical terminologies.

本文引用的文献

1
MaPLE: A MapReduce Pipeline for Lattice-based Evaluation and Its Application to SNOMED CT.MaPLE:一种用于基于格的评估的MapReduce管道及其在SNOMED CT中的应用。
Proc IEEE Int Conf Big Data. 2014 Oct;2014:754-759. doi: 10.1109/BigData.2014.7004301.
2
Using SPARQL to Test for Lattices: application to quality assurance in biomedical ontologies.使用SPARQL测试格:在生物医学本体论质量保证中的应用。
Semant Web ISWC. 2010;6497:273-288. doi: 10.1007/978-3-642-17749-1_18.
3
Scalable quality assurance for large SNOMED CT hierarchies using subject-based subtaxonomies.
利用逻辑定义和词汇特征来检测生物医学术语中缺失的 IS-A 关系。
J Biomed Semantics. 2024 May 1;15(1):6. doi: 10.1186/s13326-024-00309-y.
4
Concept Coverage Analysis of Ophthalmic Infections and Trauma among the Standardized Medical Terminologies SNOMED-CT, ICD-10-CM, and ICD-11.标准化医学术语SNOMED-CT、ICD-10-CM和ICD-11中眼科感染与创伤的概念覆盖分析
Ophthalmol Sci. 2023 May 25;3(4):100337. doi: 10.1016/j.xops.2023.100337. eCollection 2023 Dec.
5
Logical definition-based identification of potential missing concepts in SNOMED CT.基于逻辑定义的 SNOMED CT 中潜在缺失概念的识别。
BMC Med Inform Decis Mak. 2023 May 9;23(Suppl 1):87. doi: 10.1186/s12911-023-02183-7.
6
Automated Identification of Missing IS-A Relations in the Human Phenotype Ontology.自动识别人类表型本体论中的缺失 IS-A 关系。
AMIA Annu Symp Proc. 2023 Apr 29;2022:785-794. eCollection 2022.
7
Identifying Missing IS-A Relations in Orphanet Rare Disease Ontology.识别《孤儿病本体论》中缺失的“属于”关系。
Proceedings (IEEE Int Conf Bioinformatics Biomed). 2022 Dec;2022:3274-3279. doi: 10.1109/bibm55620.2022.9995614. Epub 2023 Jan 2.
8
A substring replacement approach for identifying missing IS-A relations in SNOMED CT.一种用于识别SNOMED CT中缺失的“是一种”关系的子串替换方法。
Proceedings (IEEE Int Conf Bioinformatics Biomed). 2022 Dec;2022:2611-2618. doi: 10.1109/bibm55620.2022.9995595. Epub 2023 Jan 2.
9
A deep learning approach to identify missing is-a relations in SNOMED CT.一种用于识别 SNOMED CT 中缺失的 is-a 关系的深度学习方法。
J Am Med Inform Assoc. 2023 Feb 16;30(3):475-484. doi: 10.1093/jamia/ocac248.
10
Identification of missing hierarchical relations in the vaccine ontology using acquired term pairs.利用获取的术语对识别疫苗本体中缺失的层次关系。
J Biomed Semantics. 2022 Aug 13;13(1):22. doi: 10.1186/s13326-022-00276-2.
使用基于主题的子分类法对大型SNOMED CT层次结构进行可扩展的质量保证。
J Am Med Inform Assoc. 2015 May;22(3):507-18. doi: 10.1136/amiajnl-2014-003151. Epub 2014 Oct 21.
4
A tribal abstraction network for SNOMED CT target hierarchies without attribute relationships.用于无属性关系的SNOMED CT目标层次结构的部落抽象网络。
J Am Med Inform Assoc. 2015 May;22(3):628-39. doi: 10.1136/amiajnl-2014-003173. Epub 2014 Oct 20.
5
Metrics for assessing the quality of value sets in clinical quality measures.用于评估临床质量指标中价值集质量的指标。
AMIA Annu Symp Proc. 2013 Nov 16;2013:1497-505. eCollection 2013.
6
Contrasting lexical similarity and formal definitions in SNOMED CT: consistency and implications.SNOMED CT中词汇相似性与形式定义的对比:一致性及影响
J Biomed Inform. 2014 Feb;47:192-8. doi: 10.1016/j.jbi.2013.11.003. Epub 2013 Nov 15.
7
Literature review of SNOMED CT use.SNOMED CT 使用文献回顾。
J Am Med Inform Assoc. 2014 Feb;21(e1):e11-9. doi: 10.1136/amiajnl-2013-001636. Epub 2013 Jul 4.
8
Lexically suggest, logically define: quality assurance of the use of qualifiers and expected results of post-coordination in SNOMED CT.从词汇上提示,从逻辑上定义:SNOMED CT 中限定词的使用和后协调预期结果的质量保证。
J Biomed Inform. 2012 Apr;45(2):199-209. doi: 10.1016/j.jbi.2011.10.002. Epub 2011 Oct 14.
9
Auditing complex concepts of SNOMED using a refined hierarchical abstraction network.使用改进的分层抽象网络审核 SNOMED 的复杂概念。
J Biomed Inform. 2012 Feb;45(1):1-14. doi: 10.1016/j.jbi.2011.08.016. Epub 2011 Sep 1.
10
Abstraction of complex concepts with a refined partial-area taxonomy of SNOMED.采用 SNOMED 的精细化局部区域分类法对复杂概念进行抽象。
J Biomed Inform. 2012 Feb;45(1):15-29. doi: 10.1016/j.jbi.2011.08.013. Epub 2011 Aug 25.