Zhao Yunsong, Ren Bin, Yu Wenjin, Zhang Haijun, Zhao Di, Lv Junchao, Xie Zhen, Jiang Kun, Shang Lei, Yao Han, Xu Yongyong, Zhao Gang
Department of Neurology, Xijing Hospital, Fourth Military Medical University, Xi'an, China.
Department of Information, Xijing Hospital, Fourth Military Medical University, Xi'an, China.
Neurol Ther. 2022 Sep;11(3):1117-1134. doi: 10.1007/s40120-022-00355-7. Epub 2022 May 11.
Early diagnosis and etiological treatment can effectively improve the prognosis of patients with autoimmune encephalitis (AE). However, anti-neuronal antibody tests which provide the definitive diagnosis require time and are not always abnormal. By using natural language processing (NLP) technology, our study proposes an assisted diagnostic method for early clinical diagnosis of AE and compares its sensitivity with that of previously established criteria.
Our model is based on the text classification model trained by the history of present illness (HPI) in electronic medical records (EMRs) that present a definite pathological diagnosis of AE or infectious encephalitis (IE). The definitive diagnosis of IE was based on the results of traditional etiological examinations. The definitive diagnosis of AE was based on the results of neuronal antibodies, and the diagnostic criteria of definite autoimmune limbic encephalitis proposed by Graus et al. used as the reference standard for antibody-negative AE. First, we automatically recognized and extracted symptoms for all HPI texts in EMRs by training a dataset of 552 cases. Second, four text classification models trained by a dataset of 199 cases were established for differential diagnosis of AE and IE based on a post-structuring text dataset of every HPI, which was completed using symptoms in English language after the process of normalization of synonyms. The optimal model was identified by evaluating and comparing the performance of the four models. Finally, combined with three typical symptoms and the results of standard paraclinical tests such as cerebrospinal fluid (CSF), magnetic resonance imaging (MRI), or electroencephalogram (EEG) proposed from Graus criteria, an assisted early diagnostic model for AE was established on the basis of the text classification model with the best performance.
The comparison results for the four models applied to the independent testing dataset showed the naïve Bayesian classifier with bag of words achieved the best performance, with an area under the receiver operating characteristic curve of 0.85, accuracy of 84.5% (95% confidence interval [CI] 74.0-92.0%), sensitivity of 86.7% (95% CI 69.3-96.2%), and specificity of 82.9% (95% CI 67.9-92.8%), respectively. Compared with the diagnostic criteria proposed previously, the early diagnostic sensitivity for possible AE using the assisted diagnostic model based on the independent testing dataset was improved from 73.3% (95% CI 54.1-87.7%) to 86.7% (95% CI 69.3-96.2%).
The assisted diagnostic model could effectively increase the early diagnostic sensitivity for AE compared to previous diagnostic criteria, assist physicians in establishing the diagnosis of AE automatically after inputting the HPI and the results of standard paraclinical tests according to their narrative habits for describing symptoms, avoiding misdiagnosis and allowing for prompt initiation of specific treatment.
早期诊断和病因治疗可有效改善自身免疫性脑炎(AE)患者的预后。然而,提供确诊诊断的抗神经元抗体检测需要时间,且并非总是异常。通过使用自然语言处理(NLP)技术,我们的研究提出了一种用于AE早期临床诊断的辅助诊断方法,并将其敏感性与先前确立的标准进行比较。
我们的模型基于由电子病历(EMR)中现病史(HPI)训练的文本分类模型,这些病历呈现出AE或感染性脑炎(IE)的明确病理诊断。IE的确诊基于传统病因学检查结果。AE的确诊基于神经元抗体结果,Graus等人提出的明确自身免疫性边缘性脑炎的诊断标准用作抗体阴性AE的参考标准。首先,我们通过训练一个包含552例病例的数据集,自动识别并提取EMR中所有HPI文本的症状。其次,基于每个HPI的结构化后文本数据集,使用同义词归一化后用英语表示的症状,建立了四个由199例病例数据集训练的文本分类模型,用于AE和IE的鉴别诊断。通过评估和比较这四个模型的性能来确定最佳模型。最后,结合Graus标准提出的三个典型症状以及脑脊液(CSF)、磁共振成像(MRI)或脑电图(EEG)等标准辅助检查结果,在性能最佳的文本分类模型基础上建立了AE的辅助早期诊断模型。
应用于独立测试数据集的四个模型的比较结果显示,词袋朴素贝叶斯分类器表现最佳,其受试者操作特征曲线下面积为0.85,准确率为84.5%(95%置信区间[CI]74.0 - 92.0%),敏感性为86.7%(95%CI 69.3 - 96.2%),特异性为82.9%(95%CI 67.9 - 92.8%)。与先前提出的诊断标准相比,使用基于独立测试数据集的辅助诊断模型对可能的AE的早期诊断敏感性从73.3%(95%CI 54.1 - 87.7%)提高到86.7%(95%CI 69.3 - 96.2%)。
与先前的诊断标准相比,辅助诊断模型可有效提高AE的早期诊断敏感性,帮助医生在输入HPI和标准辅助检查结果后,根据他们描述症状的叙述习惯自动建立AE诊断,避免误诊并允许及时开始特异性治疗。