• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用自然语言处理技术在真实世界的希伯来语自由文本电子病历中识别糖尿病相关并发症。

Identifying Diabetes Related-Complications in a Real-World Free-Text Electronic Medical Records in Hebrew Using Natural Language Processing Techniques.

作者信息

Saban Mor, Lutski Miri, Zucker Inbar, Uziel Moshe, Ben-Moshe Dror, Israel Ariel, Vinker Shlomo, Golan-Cohen Avivit, Laufer Izhar, Green Ilan, Eldor Roy, Merzon Eugene

机构信息

Nursing Department, School of Health Professions, Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel.

The Israel Center for Disease Control, Ministry of Health, Ramat Gan, Israel.

出版信息

J Diabetes Sci Technol. 2024 Jan 30:19322968241228555. doi: 10.1177/19322968241228555.

DOI:10.1177/19322968241228555
PMID:38288672
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11571488/
Abstract

BACKGROUND

Studies have demonstrated that 50% to 80% of patients do not receive an International Classification of Diseases (ICD) code assigned to their medical encounter or condition. For these patients, their clinical information is mostly recorded as unstructured free-text narrative data in the medical record without standardized coding or extraction of structured data elements. Leumit Health Services (LHS) in collaboration with the Israeli Ministry of Health (MoH) conducted this study using electronic medical records (EMRs) to systematically extract meaningful clinical information about people with diabetes from the unstructured free-text notes.

OBJECTIVES

To develop and validate natural language processing (NLP) algorithms to identify diabetes-related complications in the free-text medical records of patients who have LHS membership.

METHODS

The study data included 2.3 million records of 41 469 patients with diabetes aged 35 or older between the years 2012 and 2017. The diabetes related complications included cardiovascular disease, diabetic neuropathy, nephropathy, retinopathy, diabetic foot, cognitive impairments, mood disorders and hypoglycemia. A vocabulary list of terms was determined and adjudicated by two physicians who are experienced in diabetes care board certified diabetes specialist in endocrinology or family medicine. Two independent registered nurses with PhDs reviewed the free-text medical records. Both rule-based and machine learning techniques were used for the NLP algorithm development. Precision, recall, and -score were calculated to compare the performance of (1) the NLP algorithm with the reviewers' comments and (2) the ICD codes with the reviewers' comments for each complication.

RESULTS

The NLP algorithm versus the reviewers (gold standard) achieved an overall good performance with a mean -score of 86%. This was better than the ICD codes which achieved a mean -score of only 51%.

CONCLUSION

NLP algorithms and machine learning processes may enable more accurate identification of diabetes complications in EMR data.

摘要

背景

研究表明,50%至80%的患者在就医时未被分配国际疾病分类(ICD)代码。对于这些患者,他们的临床信息大多以非结构化的自由文本叙述数据形式记录在病历中,没有进行标准化编码或提取结构化数据元素。Leumit健康服务机构(LHS)与以色列卫生部(MoH)合作开展了这项研究,利用电子病历(EMR)从非结构化的自由文本记录中系统地提取有关糖尿病患者的有意义的临床信息。

目的

开发并验证自然语言处理(NLP)算法,以识别LHS会员患者的自由文本病历中的糖尿病相关并发症。

方法

研究数据包括2012年至2017年间41469名35岁及以上糖尿病患者的230万条记录。糖尿病相关并发症包括心血管疾病、糖尿病神经病变、肾病、视网膜病变、糖尿病足、认知障碍、情绪障碍和低血糖。由两名在内分泌学或家庭医学领域具有糖尿病护理经验且获得糖尿病专科委员会认证的医生确定并裁定术语词汇表。两名拥有博士学位的独立注册护士对自由文本病历进行了审查。基于规则和机器学习技术均用于NLP算法开发。计算精度、召回率和F1分数,以比较(1)NLP算法与审查员评论的性能,以及(2)每种并发症的ICD代码与审查员评论的性能。

结果

NLP算法与审查员(金标准)相比总体表现良好,平均F1分数为86%。这优于ICD代码,ICD代码的平均F1分数仅为51%。

结论

NLP算法和机器学习过程可能有助于更准确地识别EMR数据中的糖尿病并发症。

相似文献

1
Identifying Diabetes Related-Complications in a Real-World Free-Text Electronic Medical Records in Hebrew Using Natural Language Processing Techniques.使用自然语言处理技术在真实世界的希伯来语自由文本电子病历中识别糖尿病相关并发症。
J Diabetes Sci Technol. 2024 Jan 30:19322968241228555. doi: 10.1177/19322968241228555.
2
Machine Learning and Natural Language Processing in Mental Health: Systematic Review.机器学习和自然语言处理在心理健康中的应用:系统综述。
J Med Internet Res. 2021 May 4;23(5):e15708. doi: 10.2196/15708.
3
Extracting information from the text of electronic medical records to improve case detection: a systematic review.从电子病历文本中提取信息以改善病例检测:一项系统综述
J Am Med Inform Assoc. 2016 Sep;23(5):1007-15. doi: 10.1093/jamia/ocv180. Epub 2016 Feb 5.
4
Clinical judgement by primary care physicians for the diagnosis of all-cause dementia or cognitive impairment in symptomatic people.初级保健医生对有症状人群进行全因痴呆或认知障碍诊断的临床判断。
Cochrane Database Syst Rev. 2022 Jun 16;6(6):CD012558. doi: 10.1002/14651858.CD012558.pub2.
5
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.对紫杉醇、多西他赛、吉西他滨和长春瑞滨在非小细胞肺癌中的临床疗效和成本效益进行的快速系统评价。
Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.
6
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of topotecan for ovarian cancer.拓扑替康治疗卵巢癌的临床有效性和成本效益的快速系统评价。
Health Technol Assess. 2001;5(28):1-110. doi: 10.3310/hta5280.
7
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
8
The experience of adults who choose watchful waiting or active surveillance as an approach to medical treatment: a qualitative systematic review.选择观察等待或主动监测作为治疗方法的成年人的经历:一项定性系统评价。
JBI Database System Rev Implement Rep. 2016 Feb;14(2):174-255. doi: 10.11124/jbisrir-2016-2270.
9
The potential of Generative Pre-trained Transformer 4 (GPT-4) to analyse medical notes in three different languages: a retrospective model-evaluation study.生成式预训练变换器4(GPT-4)分析三种不同语言医学笔记的潜力:一项回顾性模型评估研究。
Lancet Digit Health. 2025 Jan;7(1):e35-e43. doi: 10.1016/S2589-7500(24)00246-2.
10
Education support services for improving school engagement and academic performance of children and adolescents with a chronic health condition.改善患有慢性病的儿童和青少年的学校参与度和学业成绩的教育支持服务。
Cochrane Database Syst Rev. 2023 Feb 8;2(2):CD011538. doi: 10.1002/14651858.CD011538.pub2.

引用本文的文献

1
Artificial Intelligence-Assisted Clinical Decision-Making: A Perspective on Advancing Personalized Precision Medicine for Elderly Diabetes Patients.人工智能辅助临床决策:推进老年糖尿病患者个性化精准医疗的视角
J Multidiscip Healthc. 2025 Aug 4;18:4643-4651. doi: 10.2147/JMDH.S529190. eCollection 2025.
2
Using natural language processing to identify patterns associated with depression, anxiety, and stress symptoms during the COVID-19 pandemic.利用自然语言处理技术识别在2019冠状病毒病大流行期间与抑郁、焦虑和压力症状相关的模式。
J Affect Disord. 2025 May 1;376:113-121. doi: 10.1016/j.jad.2025.01.139. Epub 2025 Jan 31.
3
Artificial Intelligence to Diagnose Complications of Diabetes.人工智能用于诊断糖尿病并发症。
J Diabetes Sci Technol. 2025 Jan;19(1):246-264. doi: 10.1177/19322968241287773. Epub 2024 Nov 22.
4
DR-GPT: A large language model for medical report analysis of diabetic retinopathy patients.DR-GPT:一种用于分析糖尿病视网膜病变患者医学报告的大型语言模型。
PLoS One. 2024 Oct 11;19(10):e0297706. doi: 10.1371/journal.pone.0297706. eCollection 2024.

本文引用的文献

1
Improving severity classification of Hebrew PET-CT pathology reports using test-time augmentation.利用测试时增强提高希伯来语 PET-CT 病理报告的严重程度分类。
J Biomed Inform. 2024 Jan;149:104577. doi: 10.1016/j.jbi.2023.104577. Epub 2023 Dec 13.
2
Identify diabetic retinopathy-related clinical concepts and their attributes using transformer-based natural language processing methods.使用基于转换器的自然语言处理方法识别与糖尿病视网膜病变相关的临床概念及其属性。
BMC Med Inform Decis Mak. 2022 Sep 27;22(Suppl 3):255. doi: 10.1186/s12911-022-01996-2.
3
Specialists, Scientists, and Sentiments: Word2Vec and Doc2Vec in Analysis of Scientific and Medical Texts.专家、科学家与情感:科学和医学文本分析中的词向量模型(Word2Vec)与文档向量模型(Doc2Vec)
SN Comput Sci. 2021;2(5):414. doi: 10.1007/s42979-021-00807-1. Epub 2021 Aug 15.
4
Structuring clinical text with AI: Old versus new natural language processing techniques evaluated on eight common cardiovascular diseases.利用人工智能构建临床文本:在八种常见心血管疾病上对新旧自然语言处理技术进行评估
Patterns (N Y). 2021 Jun 17;2(7):100289. doi: 10.1016/j.patter.2021.100289. eCollection 2021 Jul 9.
5
Using Natural Language Processing to Measure and Improve Quality of Diabetes Care: A Systematic Review.使用自然语言处理技术衡量和改善糖尿病护理质量:系统评价。
J Diabetes Sci Technol. 2021 May;15(3):553-560. doi: 10.1177/19322968211000831. Epub 2021 Mar 19.
6
Detecting Miscoded Diabetes Diagnosis Codes in Electronic Health Records for Quality Improvement: Temporal Deep Learning Approach.通过时间深度学习方法检测电子健康记录中编码错误的糖尿病诊断代码以提高质量
JMIR Med Inform. 2020 Dec 17;8(12):e22649. doi: 10.2196/22649.
7
Clinical Tractor: A Framework for Automatic Natural Language Understanding of Clinical Practice Guidelines.临床牵引器:一种用于临床实践指南自动自然语言理解的框架。
AMIA Annu Symp Proc. 2020 Mar 4;2019:784-793. eCollection 2019.
8
Enhancing ICD-Code-Based Case Definition for Heart Failure Using Electronic Medical Record Data.利用电子病历数据增强基于ICD编码的心力衰竭病例定义
J Card Fail. 2020 Jul;26(7):610-617. doi: 10.1016/j.cardfail.2020.04.003. Epub 2020 Apr 15.
9
Physician experts in diabetes are natural team leaders for managing diabetic patients with foot complications. A position statement from the Italian diabetic foot study group.糖尿病医学专家是管理足部并发症糖尿病患者的天然团队领导者。意大利糖尿病足研究小组的立场声明。
Nutr Metab Cardiovasc Dis. 2020 Feb 10;30(2):167-178. doi: 10.1016/j.numecd.2019.11.009. Epub 2019 Nov 26.
10
Diabetes as a cardiovascular risk factor: An overview of global trends of macro and micro vascular complications.糖尿病作为心血管风险因素:宏观和微观血管并发症全球趋势概述。
Eur J Prev Cardiol. 2019 Dec;26(2_suppl):25-32. doi: 10.1177/2047487319878371. Epub 2019 Nov 13.