Heo Tak Sung, Kim Yu Seop, Choi Jeong Myeong, Jeong Yeong Seok, Seo Soo Young, Lee Jun Ho, Jeon Jin Pyeong, Kim Chulho
Department of Convergence Software, Hallym University, Chuncheon 24252, Korea.
Department of Otorhinolaryngology and Head and Neck Surgery, Chuncheon Sacred Heart Hospital, Chuncheon 24253, Korea.
J Pers Med. 2020 Dec 16;10(4):286. doi: 10.3390/jpm10040286.
Brain magnetic resonance imaging (MRI) is useful for predicting the outcome of patients with acute ischemic stroke (AIS). Although deep learning (DL) using brain MRI with certain image biomarkers has shown satisfactory results in predicting poor outcomes, no study has assessed the usefulness of natural language processing (NLP)-based machine learning (ML) algorithms using brain MRI free-text reports of AIS patients. Therefore, we aimed to assess whether NLP-based ML algorithms using brain MRI text reports could predict poor outcomes in AIS patients. This study included only English text reports of brain MRIs examined during admission of AIS patients. Poor outcome was defined as a modified Rankin Scale score of 3-6, and the data were captured by trained nurses and physicians. We only included MRI text report of the first MRI scan during the admission. The text dataset was randomly divided into a training and test dataset with a 7:3 ratio. Text was vectorized to word, sentence, and document levels. In the word level approach, which did not consider the sequence of words, and the "bag-of-words" model was used to reflect the number of repetitions of text token. The "sent2vec" method was used in the sensation-level approach considering the sequence of words, and the word embedding was used in the document level approach. In addition to conventional ML algorithms, DL algorithms such as the convolutional neural network (CNN), long short-term memory, and multilayer perceptron were used to predict poor outcomes using 5-fold cross-validation and grid search techniques. The performance of each ML classifier was compared with the area under the receiver operating characteristic (AUROC) curve. Among 1840 subjects with AIS, 645 patients (35.1%) had a poor outcome 3 months after the stroke onset. Random forest was the best classifier (0.782 of AUROC) using a word-level approach. Overall, the document-level approach exhibited better performance than did the word- or sentence-level approaches. Among all the ML classifiers, the multi-CNN algorithm demonstrated the best classification performance (0.805), followed by the CNN (0.799) algorithm. When predicting future clinical outcomes using NLP-based ML of radiology free-text reports of brain MRI, DL algorithms showed superior performance over the other ML algorithms. In particular, the prediction of poor outcomes in document-level NLP DL was improved more by multi-CNN and CNN than by recurrent neural network-based algorithms. NLP-based DL algorithms can be used as an important digital marker for unstructured electronic health record data DL prediction.
脑磁共振成像(MRI)有助于预测急性缺血性卒中(AIS)患者的预后。尽管利用带有特定图像生物标志物的脑MRI进行深度学习(DL)在预测不良预后方面已显示出令人满意的结果,但尚无研究评估基于自然语言处理(NLP)的机器学习(ML)算法对AIS患者脑MRI自由文本报告的实用性。因此,我们旨在评估基于NLP的使用脑MRI文本报告的ML算法能否预测AIS患者的不良预后。本研究仅纳入了AIS患者入院期间所做脑MRI的英文文本报告。不良预后定义为改良Rankin量表评分为3 - 6分,数据由经过培训的护士和医生收集。我们仅纳入了入院期间首次MRI扫描的文本报告。文本数据集以7:3的比例随机分为训练集和测试集。文本在词、句子和文档层面进行向量化。在不考虑词序的词层面方法中,使用“词袋”模型来反映文本标记的重复次数。在考虑词序的句子层面方法中使用“sent2vec”方法,在文档层面方法中使用词嵌入。除了传统的ML算法外,还使用了卷积神经网络(CNN)、长短期记忆网络和多层感知器等DL算法,通过5折交叉验证和网格搜索技术来预测不良预后。将每个ML分类器的性能与受试者工作特征(AUROC)曲线下面积进行比较。在1840例AIS患者中,645例(35.1%)在卒中发病3个月后出现不良预后。随机森林是在词层面方法中表现最佳的分类器(AUROC为0.782)。总体而言,文档层面方法的表现优于词或句子层面方法。在所有ML分类器中,多CNN算法表现出最佳的分类性能(0.805),其次是CNN算法(0.799)。当使用基于NLP的脑MRI放射学自由文本报告的ML来预测未来临床结果时,DL算法表现优于其他ML算法。特别是,与基于循环神经网络的算法相比,多CNN和CNN在文档层面NLP DL中对不良预后的预测改善更为明显。基于NLP的DL算法可作为非结构化电子健康记录数据DL预测的重要数字标志物。