• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

系统医学术语命名法(SNOMED CT)在医疗保健中处理自由文本的应用:系统范围综述。

Use of the Systematized Nomenclature of Medicine Clinical Terms (SNOMED CT) for Processing Free Text in Health Care: Systematic Scoping Review.

机构信息

Division of Medical Information Sciences, Geneva University Hospitals, Geneva, Switzerland.

Department of Radiology and Medical Informatics, University of Geneva, Geneva, Switzerland.

出版信息

J Med Internet Res. 2021 Jan 26;23(1):e24594. doi: 10.2196/24594.

DOI:10.2196/24594
PMID:33496673
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7872838/
Abstract

BACKGROUND

Interoperability and secondary use of data is a challenge in health care. Specifically, the reuse of clinical free text remains an unresolved problem. The Systematized Nomenclature of Medicine Clinical Terms (SNOMED CT) has become the universal language of health care and presents characteristics of a natural language. Its use to represent clinical free text could constitute a solution to improve interoperability.

OBJECTIVE

Although the use of SNOMED and SNOMED CT has already been reviewed, its specific use in processing and representing unstructured data such as clinical free text has not. This review aims to better understand SNOMED CT's use for representing free text in medicine.

METHODS

A scoping review was performed on the topic by searching MEDLINE, Embase, and Web of Science for publications featuring free-text processing and SNOMED CT. A recursive reference review was conducted to broaden the scope of research. The review covered the type of processed data, the targeted language, the goal of the terminology binding, the method used and, when appropriate, the specific software used.

RESULTS

In total, 76 publications were selected for an extensive study. The language targeted by publications was 91% (n=69) English. The most frequent types of documents for which the terminology was used are complementary exam reports (n=18, 24%) and narrative notes (n=16, 21%). Mapping to SNOMED CT was the final goal of the research in 21% (n=16) of publications and a part of the final goal in 33% (n=25). The main objectives of mapping are information extraction (n=44, 39%), feature in a classification task (n=26, 23%), and data normalization (n=23, 20%). The method used was rule-based in 70% (n=53) of publications, hybrid in 11% (n=8), and machine learning in 5% (n=4). In total, 12 different software packages were used to map text to SNOMED CT concepts, the most frequent being Medtex, Mayo Clinic Vocabulary Server, and Medical Text Extraction Reasoning and Mapping System. Full terminology was used in 64% (n=49) of publications, whereas only a subset was used in 30% (n=23) of publications. Postcoordination was proposed in 17% (n=13) of publications, and only 5% (n=4) of publications specifically mentioned the use of the compositional grammar.

CONCLUSIONS

SNOMED CT has been largely used to represent free-text data, most frequently with rule-based approaches, in English. However, currently, there is no easy solution for mapping free text to this terminology and to perform automatic postcoordination. Most solutions conceive SNOMED CT as a simple terminology rather than as a compositional bag of ontologies. Since 2012, the number of publications on this subject per year has decreased. However, the need for formal semantic representation of free text in health care is high, and automatic encoding into a compositional ontology could be a solution.

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ab9/7872838/01dfcdae5036/jmir_v23i1e24594_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ab9/7872838/77e05522e8a0/jmir_v23i1e24594_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ab9/7872838/38e7bd8a8d2b/jmir_v23i1e24594_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ab9/7872838/264f434029b9/jmir_v23i1e24594_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ab9/7872838/01dfcdae5036/jmir_v23i1e24594_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ab9/7872838/77e05522e8a0/jmir_v23i1e24594_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ab9/7872838/38e7bd8a8d2b/jmir_v23i1e24594_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ab9/7872838/264f434029b9/jmir_v23i1e24594_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ab9/7872838/01dfcdae5036/jmir_v23i1e24594_fig4.jpg
摘要

背景

互操作性和数据的二次利用是医疗保健领域的一个挑战。具体来说,临床自由文本的再利用仍然是一个未解决的问题。医学系统命名法临床术语(SNOMED CT)已成为医疗保健的通用语言,并具有自然语言的特征。将其用于表示临床自由文本可以构成提高互操作性的解决方案。

目的

尽管已经对 SNOMED 和 SNOMED CT 的使用进行了审查,但尚未对其在处理和表示临床自由文本等非结构化数据方面的具体用途进行审查。本综述旨在更好地了解 SNOMED CT 在医学中表示自由文本的用途。

方法

通过在 MEDLINE、Embase 和 Web of Science 中搜索有关自由文本处理和 SNOMED CT 的出版物,对该主题进行了范围界定审查。通过递归参考审查扩大了研究范围。该审查涵盖了处理数据的类型、目标语言、术语绑定的目标、使用的方法以及在适当情况下使用的特定软件。

结果

共选择了 76 篇论文进行深入研究。出版物的目标语言为 91%(n=69)英语。术语最常使用的文档类型是补充检查报告(n=18,24%)和叙述性笔记(n=16,21%)。在 21%(n=16)的出版物中,将术语映射到 SNOMED CT 是最终目标,在 33%(n=25)的出版物中是最终目标的一部分。映射的主要目标是信息提取(n=44,39%)、分类任务中的特征(n=26,23%)和数据标准化(n=23,20%)。使用的方法在 70%(n=53)的出版物中是基于规则的,在 11%(n=8)的出版物中是混合的,在 5%(n=4)的出版物中是机器学习的。总共使用了 12 个不同的软件包将文本映射到 SNOMED CT 概念,最常用的是 Medtex、Mayo Clinic Vocabulary Server 和 Medical Text Extraction Reasoning and Mapping System。在 64%(n=49)的出版物中使用了完整的术语,而在 30%(n=23)的出版物中仅使用了术语的一个子集。在 17%(n=13)的出版物中提出了后置协调,只有 5%(n=4)的出版物专门提到了使用组合语法。

结论

SNOMED CT 已被广泛用于表示自由文本数据,最常使用基于规则的方法,主要是英语。然而,目前,将自由文本映射到此术语并执行自动后置协调还没有简单的解决方案。大多数解决方案将 SNOMED CT 视为简单的术语,而不是组合的本体袋。自 2012 年以来,每年关于该主题的出版物数量有所减少。然而,医疗保健中对自由文本的正式语义表示的需求很高,自动编码为组合本体可能是一种解决方案。

相似文献

1
Use of the Systematized Nomenclature of Medicine Clinical Terms (SNOMED CT) for Processing Free Text in Health Care: Systematic Scoping Review.系统医学术语命名法(SNOMED CT)在医疗保健中处理自由文本的应用:系统范围综述。
J Med Internet Res. 2021 Jan 26;23(1):e24594. doi: 10.2196/24594.
2
Development and evaluation of RapTAT: a machine learning system for concept mapping of phrases from medical narratives.开发和评估 RapTAT:一种用于从医学叙述中映射短语概念的机器学习系统。
J Biomed Inform. 2014 Apr;48:54-65. doi: 10.1016/j.jbi.2013.11.008. Epub 2013 Dec 4.
3
Supporting SNOMED CT postcoordination with knowledge graph embeddings.利用知识图谱嵌入技术支持SNOMED CT后置协调。
J Biomed Inform. 2023 Mar;139:104297. doi: 10.1016/j.jbi.2023.104297. Epub 2023 Feb 1.
4
Quantitative analysis of manual annotation of clinical text samples.临床文本样本的人工标注定量分析。
Int J Med Inform. 2019 Mar;123:37-48. doi: 10.1016/j.ijmedinf.2018.12.011. Epub 2018 Dec 31.
5
Use of SNOMED CT in Large Language Models: Scoping Review.SNOMED CT 在大语言模型中的应用:范围综述。
JMIR Med Inform. 2024 Oct 7;12:e62924. doi: 10.2196/62924.
6
PCEtoFHIR: Decomposition of Postcoordinated SNOMED CT Expressions for Storage as HL7 FHIR Resources.PCEtoFHIR:用于存储为 HL7 FHIR 资源的后协调 SNOMED CT 表达式的分解。
JMIR Med Inform. 2024 Sep 17;12:e57853. doi: 10.2196/57853.
7
A semi-automatic semantic method for mapping SNOMED CT concepts to VCM Icons.一种将SNOMED CT概念映射到VCM图标集的半自动语义方法。
Stud Health Technol Inform. 2013;192:42-6.
8
Automatic Annotation of French Medical Narratives with SNOMED CT Concepts.使用SNOMED CT概念对法语医学叙述进行自动标注
Stud Health Technol Inform. 2018;247:710-714.
9
A computational linguistics motivated mapping of ICPC-2 PLUS to SNOMED CT.一种基于计算语言学的国际初级保健分类标准第二版增强版(ICPC-2 PLUS)到医学系统命名法临床术语(SNOMED CT)的映射。
BMC Med Inform Decis Mak. 2008 Oct 27;8 Suppl 1(Suppl 1):S5. doi: 10.1186/1472-6947-8-S1-S5.
10
Automatic SNOMED CT coding of Chinese clinical terms via attention-based semantic matching.通过基于注意力的语义匹配对中文临床术语进行自动SNOMED CT编码。
Int J Med Inform. 2022 Mar;159:104676. doi: 10.1016/j.ijmedinf.2021.104676. Epub 2021 Dec 28.

引用本文的文献

1
Exploring the consistency, quality and challenges in manual and automated coding of free-text diagnoses from hospital outpatient letters.探索医院门诊信件中自由文本诊断的人工编码和自动编码的一致性、质量及挑战。
PLoS One. 2025 Aug 25;20(8):e0328108. doi: 10.1371/journal.pone.0328108. eCollection 2025.
2
Using Real Electronic Health Records in Undergraduate Education: Roundtable Discussion.在本科教育中使用真实电子健康记录:圆桌讨论
JMIR Form Res. 2025 Jun 12;9:e60789. doi: 10.2196/60789.
3
A Novel Framework to Assess Clinical Information in Digital Health Technologies: Cross-Sectional Survey Study.

本文引用的文献

1
Computer-Assisted Diagnostic Coding: Effectiveness of an NLP-based approach using SNOMED CT to ICD-10 mappings.计算机辅助诊断编码:一种基于自然语言处理的方法利用SNOMED CT到ICD-10映射的有效性。
AMIA Annu Symp Proc. 2018 Dec 5;2018:807-816. eCollection 2018.
2
Comparison of MetaMap and cTAKES for entity extraction in clinical notes.比较 MetaMap 和 cTAKES 在临床记录中的实体提取。
BMC Med Inform Decis Mak. 2018 Sep 14;18(Suppl 3):74. doi: 10.1186/s12911-018-0654-2.
3
Automatic Normalization of Anatomical Phrases in Radiology Reports Using Unsupervised Learning.
一种评估数字健康技术中临床信息的新框架:横断面调查研究。
JMIR Med Inform. 2025 May 30;13:e58125. doi: 10.2196/58125.
4
Enhancing Physician-Patient Communication in Oncology Using GPT-4 Through Simplified Radiology Reports: Multicenter Quantitative Study.通过简化放射学报告利用GPT-4加强肿瘤学领域医患沟通:多中心定量研究
J Med Internet Res. 2025 Apr 17;27:e63786. doi: 10.2196/63786.
5
Semantics in action: a guide for representing clinical data elements with SNOMED CT.实践中的语义学:使用SNOMED CT表示临床数据元素的指南。
J Biomed Semantics. 2025 Mar 27;16(1):7. doi: 10.1186/s13326-025-00326-5.
6
Prevalence and risk factors associated with metabolic dysfunction-associated steatohepatitis in patients with infection: A population-based study.感染患者中与代谢功能障碍相关脂肪性肝炎相关的患病率及危险因素:一项基于人群的研究。
World J Hepatol. 2024 Oct 27;16(10):1169-1176. doi: 10.4254/wjh.v16.i10.1169.
7
Longitudinal Multimodal Transformer Integrating Imaging and Latent Clinical Signatures From Routine EHRs for Pulmonary Nodule Classification.纵向多模态变压器集成来自常规电子健康记录的成像和潜在临床特征用于肺结节分类
Med Image Comput Comput Assist Interv. 2023 Oct;14221:649-659. doi: 10.1007/978-3-031-43895-0_61. Epub 2023 Oct 1.
8
Central venous access device terminologies, complications, and reason for removal in oncology: a scoping review.肿瘤学中中央静脉通路装置的术语、并发症和移除原因:范围综述。
BMC Cancer. 2024 Apr 19;24(1):498. doi: 10.1186/s12885-024-12099-8.
9
Bibliometric Analysis of Development Trends and Research Hotspots in the Study of Data Mining in Nursing Based on CiteSpace.基于CiteSpace的护理领域数据挖掘研究发展趋势与研究热点的文献计量分析
J Multidiscip Healthc. 2024 Apr 10;17:1561-1575. doi: 10.2147/JMDH.S459079. eCollection 2024.
10
Curating retrospective multimodal and longitudinal data for community cohorts at risk for lung cancer.为有肺癌风险的社区队列整理回顾性多模态和纵向数据。
Cancer Biomark. 2025 Jan;42(1):CBM230340. doi: 10.3233/CBM-230340. Epub 2024 Mar 7.
使用无监督学习对放射学报告中的解剖短语进行自动归一化。
J Digit Imaging. 2019 Feb;32(1):6-18. doi: 10.1007/s10278-018-0116-5.
4
Knowledge discovery for Deep Phenotyping serious mental illness from Electronic Mental Health records.从电子心理健康记录中进行严重精神疾病深度表型分析的知识发现。
F1000Res. 2018 Feb 21;7:210. doi: 10.12688/f1000research.13830.2. eCollection 2018.
5
Extracting cancer mortality statistics from death certificates: A hybrid machine learning and rule-based approach for common and rare cancers.从死亡证明中提取癌症死亡率统计数据:一种用于常见和罕见癌症的混合机器学习和基于规则的方法。
Artif Intell Med. 2018 Jul;89:1-9. doi: 10.1016/j.artmed.2018.04.011. Epub 2018 May 10.
6
Classification of forensic autopsy reports through conceptual graph-based document representation model.基于概念图的文档表示模型对法医解剖报告的分类。
J Biomed Inform. 2018 Jun;82:88-105. doi: 10.1016/j.jbi.2018.04.013. Epub 2018 May 5.
7
Artificial Intelligence Learning Semantics via External Resources for Classifying Diagnosis Codes in Discharge Notes.人工智能通过外部资源学习语义以对出院小结中的诊断代码进行分类。
J Med Internet Res. 2017 Nov 6;19(11):e380. doi: 10.2196/jmir.8344.
8
Enriching the international clinical nomenclature with Chinese daily used synonyms and concept recognition in physician notes.用中文常用同义词丰富国际临床术语表,并在医生记录中进行概念识别。
BMC Med Inform Decis Mak. 2017 May 2;17(1):54. doi: 10.1186/s12911-017-0455-z.
9
Tool-supported Interactive Correction and Semantic Annotation of Narrative Clinical Reports.叙事性临床报告的工具支持交互式校正与语义标注
Methods Inf Med. 2017 May 18;56(3):217-229. doi: 10.3414/ME16-01-0083. Epub 2017 Apr 28.
10
HTP-NLP: A New NLP System for High Throughput Phenotyping.HTP-NLP:一种用于高通量表型分析的新型自然语言处理系统。
Stud Health Technol Inform. 2017;235:276-280.