• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用自然语言处理技术从医学记录中对神经学结果进行分类。

Classification of neurologic outcomes from medical notes using natural language processing.

作者信息

Fernandes Marta B, Valizadeh Navid, Alabsi Haitham S, Quadri Syed A, Tesh Ryan A, Bucklin Abigail A, Sun Haoqi, Jain Aayushee, Brenner Laura N, Ye Elissa, Ge Wendong, Collens Sarah I, Lin Stacie, Das Sudeshna, Robbins Gregory K, Zafar Sahar F, Mukerji Shibani S, Westover M Brandon

机构信息

Department of Neurology, Massachusetts General Hospital (MGH), Boston, MA, United States.

Harvard Medical School, Boston, MA, United States.

出版信息

Expert Syst Appl. 2023 Mar 15;214. doi: 10.1016/j.eswa.2022.119171. Epub 2022 Nov 6.

DOI:10.1016/j.eswa.2022.119171
PMID:36865787
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9974159/
Abstract

Neurologic disability level at hospital discharge is an important outcome in many clinical research studies. Outside of clinical trials, neurologic outcomes must typically be extracted by labor intensive manual review of clinical notes in the electronic health record (EHR). To overcome this challenge, we set out to develop a natural language processing (NLP) approach that automatically reads clinical notes to determine neurologic outcomes, to make it possible to conduct larger scale neurologic outcomes studies. We obtained 7314 notes from 3632 patients hospitalized at two large Boston hospitals between January 2012 and June 2020, including discharge summaries (3485), occupational therapy (1472) and physical therapy (2357) notes. Fourteen clinical experts reviewed notes to assign scores on the Glasgow Outcome Scale (GOS) with 4 classes, namely 'good recovery', 'moderate disability', 'severe disability', and 'death' and on the Modified Rankin Scale (mRS), with 7 classes, namely 'no symptoms', 'no significant disability', 'slight disability', 'moderate disability', 'moderately severe disability', 'severe disability', and 'death'. For 428 patients' notes, 2 experts scored the cases generating interrater reliability estimates for GOS and mRS. After preprocessing and extracting features from the notes, we trained a multiclass logistic regression model using LASSO regularization and 5-fold cross validation for hyperparameter tuning. The model performed well on the test set, achieving a micro average area under the receiver operating characteristic and F-score of 0.94 (95% CI 0.93-0.95) and 0.77 (0.75-0.80) for GOS, and 0.90 (0.89-0.91) and 0.59 (0.57-0.62) for mRS, respectively. Our work demonstrates that an NLP algorithm can accurately assign neurologic outcomes based on free text clinical notes. This algorithm increases the scale of research on neurological outcomes that is possible with EHR data.

摘要

出院时的神经功能障碍水平是许多临床研究中的一项重要结果。在临床试验之外,神经功能结局通常必须通过对电子健康记录(EHR)中的临床记录进行劳动强度大的人工审查来提取。为了克服这一挑战,我们着手开发一种自然语言处理(NLP)方法,该方法可以自动读取临床记录以确定神经功能结局,从而使开展更大规模的神经功能结局研究成为可能。我们从2012年1月至2020年6月期间在波士顿两家大型医院住院的3632名患者中获取了7314份记录,包括出院小结(3485份)、职业治疗记录(1472份)和物理治疗记录(2357份)。14名临床专家对记录进行审查,以在格拉斯哥预后量表(GOS)上进行评分,该量表有4个类别,即“良好恢复”、“中度残疾”、“重度残疾”和“死亡”;并在改良Rankin量表(mRS)上进行评分,该量表有7个类别,即“无症状”、“无明显残疾”、“轻度残疾”、“中度残疾”、“中度重度残疾”、“重度残疾”和“死亡”。对于428名患者的记录,由2名专家对病例进行评分,得出GOS和mRS的评分者间信度估计值。在对记录进行预处理和特征提取后,我们使用LASSO正则化和5折交叉验证进行超参数调整,训练了一个多类逻辑回归模型。该模型在测试集上表现良好,GOS的受试者工作特征曲线下的微平均面积和F值分别为0.94(95%CI 0.93 - 0.95)和0.77(0.75 - 0.80),mRS的分别为0.90(0.89 - 0.91)和0.59(0.57 - 0.62)。我们的工作表明,一种NLP算法可以根据自由文本临床记录准确地确定神经功能结局。该算法扩大了利用EHR数据进行神经功能结局研究的规模。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9656/9974159/b9766be9d745/nihms-1861832-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9656/9974159/d32729253290/nihms-1861832-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9656/9974159/200eafffa2e2/nihms-1861832-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9656/9974159/79a7e1184ba5/nihms-1861832-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9656/9974159/b9766be9d745/nihms-1861832-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9656/9974159/d32729253290/nihms-1861832-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9656/9974159/200eafffa2e2/nihms-1861832-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9656/9974159/79a7e1184ba5/nihms-1861832-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9656/9974159/b9766be9d745/nihms-1861832-f0004.jpg

相似文献

1
Classification of neurologic outcomes from medical notes using natural language processing.使用自然语言处理技术从医学记录中对神经学结果进行分类。
Expert Syst Appl. 2023 Mar 15;214. doi: 10.1016/j.eswa.2022.119171. Epub 2022 Nov 6.
2
Classification of the Disposition of Patients Hospitalized with COVID-19: Reading Discharge Summaries Using Natural Language Processing.COVID-19住院患者处置情况分类:使用自然语言处理技术阅读出院小结
JMIR Med Inform. 2021 Feb 10;9(2):e25457. doi: 10.2196/25457.
3
Comparing Natural Language Processing and Structured Medical Data to Develop a Computable Phenotype for Patients Hospitalized Due to COVID-19: Retrospective Analysis.比较自然语言处理和结构化医学数据以开发COVID-19住院患者的可计算表型:回顾性分析
JMIR Med Inform. 2023 Aug 22;11:e46267. doi: 10.2196/46267.
4
Artificial Intelligence Learning Semantics via External Resources for Classifying Diagnosis Codes in Discharge Notes.人工智能通过外部资源学习语义以对出院小结中的诊断代码进行分类。
J Med Internet Res. 2017 Nov 6;19(11):e380. doi: 10.2196/jmir.8344.
5
Finding Important Terms for Patients in Their Electronic Health Records: A Learning-to-Rank Approach Using Expert Annotations.在患者电子健康记录中查找重要术语:一种使用专家注释的排序学习方法。
JMIR Med Inform. 2016 Nov 30;4(4):e40. doi: 10.2196/medinform.6373.
6
Assessment of Natural Language Processing of Electronic Health Records to Measure Goals-of-Care Discussions as a Clinical Trial Outcome.评估电子健康记录中的自然语言处理以衡量作为临床试验结局的照护目标讨论。
JAMA Netw Open. 2023 Mar 1;6(3):e231204. doi: 10.1001/jamanetworkopen.2023.1204.
7
Classifying Characteristics of Opioid Use Disorder From Hospital Discharge Summaries Using Natural Language Processing.使用自然语言处理对住院小结中的阿片类药物使用障碍特征进行分类。
Front Public Health. 2022 May 9;10:850619. doi: 10.3389/fpubh.2022.850619. eCollection 2022.
8
Natural language processing of admission notes to predict severe maternal morbidity during the delivery encounter.入院记录的自然语言处理预测分娩时严重产妇发病率。
Am J Obstet Gynecol. 2022 Sep;227(3):511.e1-511.e8. doi: 10.1016/j.ajog.2022.04.008. Epub 2022 Apr 14.
9
Classifying social determinants of health from unstructured electronic health records using deep learning-based natural language processing.利用基于深度学习的自然语言处理技术从非结构化电子健康记录中分类社会健康决定因素。
J Biomed Inform. 2022 Mar;127:103984. doi: 10.1016/j.jbi.2021.103984. Epub 2022 Jan 7.
10
Challenges of Developing a Natural Language Processing Method With Electronic Health Records to Identify Persons With Chronic Mobility Disability.开发一种使用电子健康记录识别慢性移动障碍患者的自然语言处理方法所面临的挑战。
Arch Phys Med Rehabil. 2020 Oct;101(10):1739-1746. doi: 10.1016/j.apmr.2020.04.024. Epub 2020 May 21.

引用本文的文献

1
A Machine Learning Approach for Identifying People With Neuroinfectious Diseases in Electronic Health Records: Algorithm Development and Validation.一种用于在电子健康记录中识别神经感染性疾病患者的机器学习方法:算法开发与验证
JMIR Med Inform. 2025 Aug 29;13:e63157. doi: 10.2196/63157.
2
Enhanced effective convolutional attention network with squeeze-and-excitation inception module for multi-label clinical document classification.基于挤压激励 inception 模块的增强型有效卷积注意力网络用于多标签临床文档分类
Sci Rep. 2025 May 16;15(1):16988. doi: 10.1038/s41598-025-98719-0.
3
Assessment of the Modified Rankin Scale in Electronic Health Records with a Fine-tuned Large Language Model.

本文引用的文献

1
Deep Transfer Learning Across Cancer Registries for Information Extraction from Pathology Reports.跨癌症登记处的深度迁移学习用于从病理报告中提取信息
IEEE EMBS Int Conf Biomed Health Inform. 2019 May;2019. doi: 10.1109/bhi.2019.8834586. Epub 2019 Sep 12.
2
Natural language processing of admission notes to predict severe maternal morbidity during the delivery encounter.入院记录的自然语言处理预测分娩时严重产妇发病率。
Am J Obstet Gynecol. 2022 Sep;227(3):511.e1-511.e8. doi: 10.1016/j.ajog.2022.04.008. Epub 2022 Apr 14.
3
Mixed-methods evaluation of three natural language processing modeling approaches for measuring documented goals-of-care discussions in the electronic health record.
使用微调的大语言模型评估电子健康记录中的改良Rankin量表。
medRxiv. 2025 May 2:2025.04.30.25326777. doi: 10.1101/2025.04.30.25326777.
4
Using Natural Language Processing and Machine Learning to classify the status of kidney allograft in Electronic Medical Records written in Spanish.使用自然语言处理和机器学习对西班牙语电子病历中同种异体肾移植的状态进行分类。
PLoS One. 2025 May 8;20(5):e0322587. doi: 10.1371/journal.pone.0322587. eCollection 2025.
5
Automated extraction of post-stroke functional outcomes from unstructured electronic health records.从非结构化电子健康记录中自动提取中风后功能结局
Eur Stroke J. 2025 Jan 22:23969873251314340. doi: 10.1177/23969873251314340.
6
Advancing equity in breast cancer care: natural language processing for analysing treatment outcomes in under-represented populations.推进乳腺癌护理中的公平性:自然语言处理分析代表性不足人群的治疗结果。
BMJ Health Care Inform. 2024 Jul 1;31(1):e100966. doi: 10.1136/bmjhci-2023-100966.
7
Large Language Models in Neurology Research and Future Practice.大语言模型在神经病学研究和未来实践中的应用。
Neurology. 2023 Dec 4;101(23):1058-1067. doi: 10.1212/WNL.0000000000207967.
8
AI and machine learning in resuscitation: Ongoing research, new concepts, and key challenges.复苏中的人工智能与机器学习:正在进行的研究、新概念及关键挑战。
Resusc Plus. 2023 Jul 28;15:100435. doi: 10.1016/j.resplu.2023.100435. eCollection 2023 Sep.
混合方法评估三种自然语言处理建模方法在电子健康记录中测量记录的目标关怀讨论的效果。
J Pain Symptom Manage. 2022 Jun;63(6):e713-e723. doi: 10.1016/j.jpainsymman.2022.02.006. Epub 2022 Feb 16.
4
Natural language processing of head CT reports to identify intracranial mass effect: CTIME algorithm.通过头部CT报告的自然语言处理识别颅内占位效应:CTIME算法
Am J Emerg Med. 2022 Jan;51:388-392. doi: 10.1016/j.ajem.2021.11.001. Epub 2021 Nov 9.
5
Structuring clinical text with AI: Old versus new natural language processing techniques evaluated on eight common cardiovascular diseases.利用人工智能构建临床文本:在八种常见心血管疾病上对新旧自然语言处理技术进行评估
Patterns (N Y). 2021 Jun 17;2(7):100289. doi: 10.1016/j.patter.2021.100289. eCollection 2021 Jul 9.
6
Automated Annotation of Epileptiform Burden and Its Association with Outcomes.癫痫样负荷的自动标注及其与结局的关系。
Ann Neurol. 2021 Aug;90(2):300-311. doi: 10.1002/ana.26161. Epub 2021 Jul 20.
7
Early prediction of diagnostic-related groups and estimation of hospital cost by processing clinical notes.通过处理临床记录对诊断相关分组进行早期预测并估算医院成本。
NPJ Digit Med. 2021 Jul 1;4(1):103. doi: 10.1038/s41746-021-00474-9.
8
Extracting Drug Names and Associated Attributes From Discharge Summaries: Text Mining Study.从出院小结中提取药物名称及相关属性:文本挖掘研究
JMIR Med Inform. 2021 May 5;9(5):e24678. doi: 10.2196/24678.
9
Selection of Clinical Text Features for Classifying Suicide Attempts.临床文本特征选择用于自杀未遂分类。
AMIA Annu Symp Proc. 2021 Jan 25;2020:273-282. eCollection 2020.
10
Automated Coding of Under-Studied Medical Concept Domains: Linking Physical Activity Reports to the International Classification of Functioning, Disability, and Health.对研究不足的医学概念领域进行自动编码:将身体活动报告与《国际功能、残疾和健康分类》相联系。
Front Digit Health. 2021 Mar;3. doi: 10.3389/fdgth.2021.620828. Epub 2021 Mar 10.