Department of Neurology, Brain Hospital Affiliated to Nanjing Medical University, Nanjing 210029, China.
Department of Computer Science and Technology,Nanjing University, Nanjing 210093, China.
Chin Med Sci J. 2020 Dec 31;35(4):297-305. doi: 10.24920/003703.
Objective Asymptomatic carotid stenosis (ACS) is closely associated to the incidence of severe cerebrovascular diseases. Early identifying the individuals with ACS and its associated risk factors could be beneficial for primary prevention of stroke. This study aimed to investigate a machine-learning algorithm for the detection of ACS among high-risk population of stroke based on the associated risk factors.Methods A novel model of machine learning was utilized to screen the associated predictors of ACS based on 30 potential risk factors. The algorithm of this model adopted a random forest pattern based on the training data and then was verified using the testing data. All of the original data were retrieved from the China National Stroke Screening and Prevention Project (CNSSPP), including demographic, clinical and laboratory characteristics. The individuals with high risk of stroke were enrolled and randomly divided into a training group and a testing group at a ratio of 4:1. The identification of carotid stenosis by carotid artery duplex scans was set as the golden standard. The receiver operating characteristic (ROC) curve and the area under the curve (AUC) was used to evaluate the efficacy of the model in detecting ACS.Results Of 2841 high risk individual of stroke enrolled, 326 (11.6%) were diagnosed as ACS by ultrasonography. The top five risk factors contributing to ACS in this model were identified as family history of dyslipidemia, high level of low-density lipoprotein cholesterol (LDL-c), low level of high-density lipoprotein cholesterol (HDL-c), aging, and low body mass index (BMI). Their weights were 11.8%, 7.6%, 7.1%, 6.1%, and 6.1%, respectively. The total weight of the top 15 risk factors was 85.5%. The AUC values of the model for detecting ACS with training dataset and testing dataset were 0.927 and 0.888, respectively.Conclusions This study demonstrated that the machine-learning algorithm could be used to identify the risk factors for ACS among high risk population of stroke. Family history of dyslipidemia may be the most important risk factor for ACS. This model could be a suitable tool to optimize the clinical approach for the primary prevention of stroke.
无症状性颈动脉狭窄(ACS)与严重脑血管疾病的发生密切相关。早期识别 ACS 及其相关危险因素,有利于脑卒中的一级预防。本研究旨在探讨基于相关危险因素的机器学习算法,用于筛查脑卒中高危人群中的 ACS。
利用一种新的机器学习模型,根据 30 个潜在危险因素筛选 ACS 的相关预测因素。该模型的算法采用基于训练数据的随机森林模式,然后使用测试数据进行验证。所有原始数据均来自中国国家脑卒中筛查与防治工程(CNSSPP),包括人口统计学、临床和实验室特征。入选高危脑卒中患者,按 4:1 的比例随机分为训练组和测试组。以颈动脉双功能超声检查确定的颈动脉狭窄为金标准。采用受试者工作特征(ROC)曲线和曲线下面积(AUC)评估模型对 ACS 的检测效能。
共纳入 2841 例高危脑卒中患者,326 例(11.6%)经超声检查诊断为 ACS。该模型中对 ACS 贡献最大的前 5 个危险因素分别为血脂异常家族史、低密度脂蛋白胆固醇(LDL-c)水平升高、高密度脂蛋白胆固醇(HDL-c)水平降低、年龄增长和低体重指数(BMI),权重分别为 11.8%、7.6%、7.1%、6.1%和 6.1%,前 15 个危险因素的总权重为 85.5%。该模型用于检测训练数据集和测试数据集 ACS 的 AUC 值分别为 0.927 和 0.888。
本研究表明,机器学习算法可用于识别脑卒中高危人群中 ACS 的危险因素。血脂异常家族史可能是 ACS 的最重要危险因素。该模型可能是优化脑卒中一级预防临床策略的合适工具。