使用基于自然语言处理的脑磁共振成像放射学报告机器学习预测卒中结局

Prediction of Stroke Outcome Using Natural Language Processing-Based Machine Learning of Radiology Report of Brain MRI.

作者信息

Heo Tak Sung, Kim Yu Seop, Choi Jeong Myeong, Jeong Yeong Seok, Seo Soo Young, Lee Jun Ho, Jeon Jin Pyeong, Kim Chulho

机构信息

Department of Convergence Software, Hallym University, Chuncheon 24252, Korea.

Department of Otorhinolaryngology and Head and Neck Surgery, Chuncheon Sacred Heart Hospital, Chuncheon 24253, Korea.

出版信息

J Pers Med. 2020 Dec 16;10(4):286. doi: 10.3390/jpm10040286.

DOI:10.3390/jpm10040286

PMID:33339385

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7766032/

Abstract

Brain magnetic resonance imaging (MRI) is useful for predicting the outcome of patients with acute ischemic stroke (AIS). Although deep learning (DL) using brain MRI with certain image biomarkers has shown satisfactory results in predicting poor outcomes, no study has assessed the usefulness of natural language processing (NLP)-based machine learning (ML) algorithms using brain MRI free-text reports of AIS patients. Therefore, we aimed to assess whether NLP-based ML algorithms using brain MRI text reports could predict poor outcomes in AIS patients. This study included only English text reports of brain MRIs examined during admission of AIS patients. Poor outcome was defined as a modified Rankin Scale score of 3-6, and the data were captured by trained nurses and physicians. We only included MRI text report of the first MRI scan during the admission. The text dataset was randomly divided into a training and test dataset with a 7:3 ratio. Text was vectorized to word, sentence, and document levels. In the word level approach, which did not consider the sequence of words, and the "bag-of-words" model was used to reflect the number of repetitions of text token. The "sent2vec" method was used in the sensation-level approach considering the sequence of words, and the word embedding was used in the document level approach. In addition to conventional ML algorithms, DL algorithms such as the convolutional neural network (CNN), long short-term memory, and multilayer perceptron were used to predict poor outcomes using 5-fold cross-validation and grid search techniques. The performance of each ML classifier was compared with the area under the receiver operating characteristic (AUROC) curve. Among 1840 subjects with AIS, 645 patients (35.1%) had a poor outcome 3 months after the stroke onset. Random forest was the best classifier (0.782 of AUROC) using a word-level approach. Overall, the document-level approach exhibited better performance than did the word- or sentence-level approaches. Among all the ML classifiers, the multi-CNN algorithm demonstrated the best classification performance (0.805), followed by the CNN (0.799) algorithm. When predicting future clinical outcomes using NLP-based ML of radiology free-text reports of brain MRI, DL algorithms showed superior performance over the other ML algorithms. In particular, the prediction of poor outcomes in document-level NLP DL was improved more by multi-CNN and CNN than by recurrent neural network-based algorithms. NLP-based DL algorithms can be used as an important digital marker for unstructured electronic health record data DL prediction.

摘要

脑磁共振成像（MRI）有助于预测急性缺血性卒中（AIS）患者的预后。尽管利用带有特定图像生物标志物的脑MRI进行深度学习（DL）在预测不良预后方面已显示出令人满意的结果，但尚无研究评估基于自然语言处理（NLP）的机器学习（ML）算法对AIS患者脑MRI自由文本报告的实用性。因此，我们旨在评估基于NLP的使用脑MRI文本报告的ML算法能否预测AIS患者的不良预后。本研究仅纳入了AIS患者入院期间所做脑MRI的英文文本报告。不良预后定义为改良Rankin量表评分为3 - 6分，数据由经过培训的护士和医生收集。我们仅纳入了入院期间首次MRI扫描的文本报告。文本数据集以7:3的比例随机分为训练集和测试集。文本在词、句子和文档层面进行向量化。在不考虑词序的词层面方法中，使用“词袋”模型来反映文本标记的重复次数。在考虑词序的句子层面方法中使用“sent2vec”方法，在文档层面方法中使用词嵌入。除了传统的ML算法外，还使用了卷积神经网络（CNN）、长短期记忆网络和多层感知器等DL算法，通过5折交叉验证和网格搜索技术来预测不良预后。将每个ML分类器的性能与受试者工作特征（AUROC）曲线下面积进行比较。在1840例AIS患者中，645例（35.1%）在卒中发病3个月后出现不良预后。随机森林是在词层面方法中表现最佳的分类器（AUROC为0.782）。总体而言，文档层面方法的表现优于词或句子层面方法。在所有ML分类器中，多CNN算法表现出最佳的分类性能（0.805），其次是CNN算法（0.799）。当使用基于NLP的脑MRI放射学自由文本报告的ML来预测未来临床结果时，DL算法表现优于其他ML算法。特别是，与基于循环神经网络的算法相比，多CNN和CNN在文档层面NLP DL中对不良预后的预测改善更为明显。基于NLP的DL算法可作为非结构化电子健康记录数据DL预测的重要数字标志物。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9d3/7766032/7ce1df683ae8/jpm-10-00286-g001.jpg

相似文献

Prediction of Stroke Outcome Using Natural Language Processing-Based Machine Learning of Radiology Report of Brain MRI.使用基于自然语言处理的脑磁共振成像放射学报告机器学习预测卒中结局

J Pers Med. 2020 Dec 16;10(4):286. doi: 10.3390/jpm10040286.

Natural language processing and machine learning algorithm to identify brain MRI reports with acute ischemic stroke.自然语言处理和机器学习算法识别急性缺血性脑卒中的脑部 MRI 报告。

PLoS One. 2019 Feb 28;14(2):e0212778. doi: 10.1371/journal.pone.0212778. eCollection 2019.

Comprehensive Word-Level Classification of Screening Mammography Reports Using a Neural Network Sequence Labeling Approach.基于神经网络序列标注方法的乳腺 X 线摄影筛查报告的全面词级分类。

J Digit Imaging. 2019 Oct;32(5):685-692. doi: 10.1007/s10278-018-0141-4.

Natural Language Processing for Automated Quantification of Brain Metastases Reported in Free-Text Radiology Reports.用于对自由文本放射学报告中报告的脑转移瘤进行自动定量的自然语言处理

JCO Clin Cancer Inform. 2019 Apr;3:1-9. doi: 10.1200/CCI.18.00138.

A clinical text classification paradigm using weak supervision and deep representation.一种使用弱监督和深度表示的临床文本分类范式。

BMC Med Inform Decis Mak. 2019 Jan 7;19(1):1. doi: 10.1186/s12911-018-0723-6.

Medical subdomain classification of clinical notes using a machine learning-based natural language processing approach.基于机器学习的自然语言处理方法对临床笔记进行医学子域分类。

BMC Med Inform Decis Mak. 2017 Dec 1;17(1):155. doi: 10.1186/s12911-017-0556-8.

Artificial Intelligence Learning Semantics via External Resources for Classifying Diagnosis Codes in Discharge Notes.人工智能通过外部资源学习语义以对出院小结中的诊断代码进行分类。

J Med Internet Res. 2017 Nov 6;19(11):e380. doi: 10.2196/jmir.8344.

Machine learning and natural language processing methods to identify ischemic stroke, acuity and location from radiology reports.基于机器学习和自然语言处理方法，从放射学报告中识别缺血性脑卒中、发病急缓和病变部位。

PLoS One. 2020 Jun 19;15(6):e0234908. doi: 10.1371/journal.pone.0234908. eCollection 2020.

Early Prediction of Functional Outcomes After Acute Ischemic Stroke Using Unstructured Clinical Text: Retrospective Cohort Study.使用非结构化临床文本对急性缺血性中风后功能结局的早期预测：回顾性队列研究

JMIR Med Inform. 2022 Feb 17;10(2):e29806. doi: 10.2196/29806.

Natural Language Processing for the Identification of Silent Brain Infarcts From Neuroimaging Reports.用于从神经影像报告中识别无症状脑梗死的自然语言处理

JMIR Med Inform. 2019 Apr 21;7(2):e12109. doi: 10.2196/12109.

引用本文的文献

Automatic prediction of stroke treatment outcomes: latest advances and perspectives.中风治疗结果的自动预测：最新进展与展望。

Biomed Eng Lett. 2025 Feb 17;15(3):467-488. doi: 10.1007/s13534-025-00462-y. eCollection 2025 May.

Automated extraction of post-stroke functional outcomes from unstructured electronic health records.从非结构化电子健康记录中自动提取中风后功能结局

Eur Stroke J. 2025 Jan 22:23969873251314340. doi: 10.1177/23969873251314340.

Extraction of Radiological Characteristics From Free-Text Imaging Reports Using Natural Language Processing Among Patients With Ischemic and Hemorrhagic Stroke: Algorithm Development and Validation.使用自然语言处理从缺血性和出血性中风患者的自由文本影像报告中提取放射学特征：算法开发与验证

JMIR AI. 2023 Jun 6;2:e42884. doi: 10.2196/42884.

A weakly supervised deep learning model integrating noncontrasted computed tomography images and clinical factors facilitates haemorrhagic transformation prediction after intravenous thrombolysis in acute ischaemic stroke patients.一种弱监督深度学习模型，整合非对比计算机断层扫描图像和临床因素，有助于预测急性缺血性脑卒中患者静脉溶栓后出血性转化。

Biomed Eng Online. 2023 Dec 19;22(1):129. doi: 10.1186/s12938-023-01193-w.

Emerging frontiers of artificial intelligence and machine learning in ischemic stroke: a comprehensive investigation of state-of-the-art methodologies, clinical applications, and unraveling challenges.人工智能和机器学习在缺血性中风领域的新兴前沿：对前沿方法、临床应用及未解挑战的全面调查

EPMA J. 2023 Nov 2;14(4):645-661. doi: 10.1007/s13167-023-00343-3. eCollection 2023 Dec.

Integrative Approaches in Acute Ischemic Stroke: From Symptom Recognition to Future Innovations.急性缺血性卒中的综合治疗方法：从症状识别到未来创新

Biomedicines. 2023 Sep 23;11(10):2617. doi: 10.3390/biomedicines11102617.

Applications of Natural Language Processing for the Management of Stroke Disorders: Scoping Review.自然语言处理在中风疾病管理中的应用：范围综述

JMIR Med Inform. 2023 Sep 6;11:e48693. doi: 10.2196/48693.

Natural Language Processing Methods to Identify Oncology Patients at High Risk for Acute Care with Clinical Notes.利用临床记录识别急性护理高风险肿瘤患者的自然语言处理方法

AMIA Jt Summits Transl Sci Proc. 2023 Jun 16;2023:138-147. eCollection 2023.

Demystifying the Role of Natural Language Processing (NLP) in Smart City Applications: Background, Motivation, Recent Advances, and Future Research Directions.揭开自然语言处理（NLP）在智慧城市应用中的作用：背景、动机、最新进展及未来研究方向

Wirel Pers Commun. 2023;130(2):857-908. doi: 10.1007/s11277-023-10312-8. Epub 2023 Mar 16.

Big Data in Stroke: How to Use Big Data to Make the Next Management Decision.大数据与脑卒中：如何利用大数据做出下一步管理决策

Neurotherapeutics. 2023 Apr;20(3):744-757. doi: 10.1007/s13311-023-01358-4. Epub 2023 Mar 10.

本文引用的文献

Proximal hyper-intense vessel sign on initial FLAIR MRI in hyper-acute middle cerebral artery ischemic stroke: a retrospective observational study.初始 FLAIR MRI 上超急性大脑中动脉缺血性卒中的近端高信号血管征：一项回顾性观察研究。

Acta Radiol. 2021 Jul;62(7):922-931. doi: 10.1177/0284185120946718. Epub 2020 Aug 6.

PLoS One. 2020 Jun 19;15(6):e0234908. doi: 10.1371/journal.pone.0234908. eCollection 2020.

The Role of Hemorrhagic Transformation in Acute Ischemic Stroke Upon Clinical Complications and Outcomes.出血性转化在急性缺血性卒中临床并发症及预后中的作用。

J Stroke Cerebrovasc Dis. 2020 Aug;29(8):104898. doi: 10.1016/j.jstrokecerebrovasdis.2020.104898. Epub 2020 May 13.

Clinical Text Data in Machine Learning: Systematic Review.机器学习中的临床文本数据：系统综述

JMIR Med Inform. 2020 Mar 31;8(3):e17984. doi: 10.2196/17984.

Use of Deep Learning to Predict Final Ischemic Stroke Lesions From Initial Magnetic Resonance Imaging.利用深度学习从初始磁共振成像预测最终缺血性卒中病灶。

JAMA Netw Open. 2020 Mar 2;3(3):e200772. doi: 10.1001/jamanetworkopen.2020.0772.

Global, Regional and Country-Specific Burden of Ischaemic Stroke, Intracerebral Haemorrhage and Subarachnoid Haemorrhage: A Systematic Analysis of the Global Burden of Disease Study 2017.全球、区域和国家特定缺血性卒中、脑出血和蛛网膜下腔出血负担：2017 年全球疾病负担研究的系统分析。

Neuroepidemiology. 2020;54(2):171-179. doi: 10.1159/000506396. Epub 2020 Feb 20.

Evaluation of machine learning methods to stroke outcome prediction using a nationwide disease registry.利用全国性疾病登记系统评估机器学习方法对脑卒中结局的预测。

Comput Methods Programs Biomed. 2020 Jul;190:105381. doi: 10.1016/j.cmpb.2020.105381. Epub 2020 Feb 1.

Data-efficient deep learning of radiological image data for outcome prediction after endovascular treatment of patients with acute ischemic stroke.基于深度学习的医学影像数据的高效利用用于预测急性缺血性脑卒中患者血管内治疗后的结局。

Comput Biol Med. 2019 Dec;115:103516. doi: 10.1016/j.compbiomed.2019.103516. Epub 2019 Oct 22.

Admission Diffusion-Weighted Imaging Lesion Volume in Patients With Large Vessel Occlusion Stroke and Alberta Stroke Program Early CT Score of ≥6 Points: Serial Computed Tomography-Magnetic Resonance Imaging Collateral Measurements.大血管闭塞性卒中患者入院时的弥散加权成像病灶体积与 Alberta 卒中项目早期 CT 评分≥6 分：连续 CT-磁共振成像侧支测量。

Stroke. 2019 Nov;50(11):3115-3120. doi: 10.1161/STROKEAHA.119.026229. Epub 2019 Sep 26.

Automating Ischemic Stroke Subtype Classification Using Machine Learning and Natural Language Processing.使用机器学习和自然语言处理实现缺血性中风亚型分类的自动化

J Stroke Cerebrovasc Dis. 2019 Jul;28(7):2045-2051. doi: 10.1016/j.jstrokecerebrovasdis.2019.02.004. Epub 2019 May 15.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用基于自然语言处理的脑磁共振成像放射学报告机器学习预测卒中结局

Prediction of Stroke Outcome Using Natural Language Processing-Based Machine Learning of Radiology Report of Brain MRI.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献