[基于人工智能深度因子分解机（DeepFM）算法的脑卒中患者非侵入性院前筛查模型的构建与外部验证：一项研究]

[Construction and external validation of a non-invasive pre-hospital screening model for stroke patients: a study based on artificial intelligence DeepFM algorithm].

作者信息

Liu Chenyu, Zhang Ce, Chi Yuanhui, Ma Chunye, Zhang Lihong, Chen Shuliang

机构信息

Party and Government Office, the Second Affiliated Hospital of Dalian Medical University, Dalian 116027, Liaoning, China.

Department of Neurology, the Second Affiliated Hospital of Dalian Medical University, Dalian 116027, Liaoning, China.

出版信息

Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2024 Nov;36(11):1163-1168. doi: 10.3760/cma.j.cn121430-20240526-00461.

DOI:10.3760/cma.j.cn121430-20240526-00461

PMID:39697021

Abstract

OBJECTIVE

To construct a non-invasive pre-hospital screening model and early based on artificial intelligence algorithms to provide the severity of stroke in patients, provide screening, guidance and early warning for stroke patients and their families, and provide data support for clinical decision-making.

METHODS

A retrospective study was conducted. The clinical information of stroke patients (n = 53 793) were extracted from the Yidu cloud big data server system of the Second Affiliated Hospital of Dalian Medical University from January 1, 2001 to July 31, 2023. Combined with the results of single factor screening and the opinions of experts with senior professional titles in neurology, the input variable was determined, and the output variable was the National Institutes of Health Stroke Scale (NIHSS) representing the severity of the disease at admission. Python 3.7 was used to build DeepFM algorithm model, and five data mining models including Logistic regression, CART decision tree, C5.0 decision tree, Bayesian network and deep neural network (DNN) were built at the same time. The original data were randomly divided into 80% training set and 20% test set, which were used to train and test the models, adjust the parameters of each model, respectively calculate the accuracy, sensitivity and F-index of the six models, carry out the comprehensive comparison and evaluation of the model. The receiver operator characteristic curve (ROC curve) and calibration curve were drawn, compared the prediction performance of DeepFM model and the other five algorithms. In addition, the data of stroke patients (n = 1 028) were extracted from Dalian Central Hospital for external verification of the model.

RESULTS

A total of 14 015 stroke patients with complete information were selected, including 11 212 in the training set and 2 803 in the testing set. After univariate screening, 14 indicators were included to construct the model, including gender, age, recurrence, physical impairment, facial problems, speech disorders, head reactions, disturbance of consciousness, visual disorders, abnormal cough and swallowing, high risk factor, family history, smoking history and drinking history. DeepFM model adopted the two-order crossover feature. The number of hidden layers in DNN layer was 3. Dropout was used to discard the neurons in the neural network. Rule was used as the activation function. Each layer used Dense full connection. The objective function was random gradient descent. The number of iterations was 15. There were 133 922 training parameters in total. Comparing the predictive value of the six models showed that the accuracy of DeepFM model was 0.951, the sensitivity was 0.992, the specificity was 0.814, the F-index was 0.950, and the area under the curve (AUC) was 0.916. The accuracy of the other five data mining models was between 0.771-0.780, the sensitivity was between 0.978-0.987, the F-index was between 0.690-0.707, and the AUC was between 0.568-0.639. The calibration curve of the DeepFM model was more aligned with the ideal curve than the other five data mining models. Suggesting that the prediction performance of DeepFM model was the best. External validation was conducted on the DeepFM model, and its accuracy was 0.891, indicating good generalization performance of the model.

CONCLUSIONS

The pre-hospital non-invasive screening prediction model based on DeepFM can accurately predict the severity grading of stroke patients, and has potential application value in rapid screening and early clinical decision-making of stroke.

摘要

目的

构建基于人工智能算法的无创院前筛查模型，对脑卒中患者病情严重程度进行预判，为脑卒中患者及其家属提供筛查、指导与预警，并为临床决策提供数据支持。

方法

进行回顾性研究。从大连医科大学附属第二医院医渡云大数据服务器系统中提取2001年1月1日至2023年7月31日脑卒中患者（n = 53793）的临床信息。结合单因素筛查结果及神经内科高级职称专家意见确定输入变量，输出变量为代表入院时疾病严重程度的美国国立卫生研究院卒中量表（NIHSS）。使用Python 3.7构建深度因子分解机（DeepFM）算法模型，同时构建逻辑回归、分类与回归树（CART）决策树、C5.0决策树、贝叶斯网络和深度神经网络（DNN）5种数据挖掘模型。将原始数据随机分为80%训练集和20%测试集，分别用于训练和测试模型，调整各模型参数，分别计算6种模型的准确率、灵敏度和F指标，对模型进行综合比较与评价。绘制受试者工作特征曲线（ROC曲线）和校准曲线，比较DeepFM模型与其他5种算法的预测性能。此外，从大连市中心医院提取脑卒中患者（n = 1028）数据对模型进行外部验证。

结果

共纳入14015例信息完整的脑卒中患者，其中训练集11212例，测试集2803例。单因素筛查后纳入14项指标构建模型，包括性别、年龄、复发情况、身体残疾、面部问题、言语障碍、头部反应、意识障碍、视觉障碍、异常咳嗽与吞咽、高危因素、家族史、吸烟史和饮酒史。DeepFM模型采用二阶交叉特征。DNN层隐藏层数为3层。使用随机失活（Dropout）舍弃神经网络中的神经元。使用线性整流函数（ReLU）作为激活函数。每层使用全连接层（Dense）。目标函数为随机梯度下降。迭代次数为15次。共有133922个训练参数。比较6种模型预测值，结果显示DeepFM模型准确率为0.951，灵敏度为0.992，特异度为0.814，F指标为0.950，曲线下面积（AUC）为0.916。其他5种数据挖掘模型准确率在0.771 - 0.780之间，灵敏度在0.978 - 0.987之间，F指标在0.690 - 0.707之间，AUC在0.568 - 0.639之间。DeepFM模型校准曲线比其他5种数据挖掘模型更贴近理想曲线。提示DeepFM模型预测性能最佳。对DeepFM模型进行外部验证，其准确率为0.891，表明模型具有良好的泛化性能。