Ma Xuefeng, Yang Chao, Liang Kun, Sun Baokai, Jin Wenwen, Chen Lizhen, Dong Mengzhen, Liu Shousheng, Xin Yongning, Zhuang Likun
Department of Infectious Disease, Qingdao Municipal Hospital, Qingdao University Qingdao 266000, Shandong, China.
Department of Infectious Disease, The Affiliated Hospital of Qingdao University Qingdao 266000, Shandong, China.
Am J Transl Res. 2021 Nov 15;13(11):12704-12713. eCollection 2021.
Diagnostic markers for non-alcoholic fatty liver disease (NAFLD) are still needed for screening individuals at risk. In recent years, the machine learning method was used to search for the diagnostic markers of multiple diseases. In this study, we developed and validated a machine learning model to diagnose NAFLD using laboratory indicators. NAFLD patients and non-NAFLD controls were recruited in the training and validation cohorts. The laboratory indicators of the participants in the training cohort were collected, and six indicators including alanine aminotransferase/aspartate aminotransferase (ALT/AST), white blood cells (WBC), alpha-L-fucosidase (AFU), hemoglobin (Hb), triglycerides (TG) and gamma-glutamyl transpeptidase (GGT) were screened out with higher weights by an integrate machine learning method. The areas under the receiver operating characteristic curves (AUROCs) for the selected indicators using logistic regression (LR), random forest (RF) and support vector machine (SVM) were 0.814, 0.837 and 0.810, respectively. Then the binary logistic regression was used to construct the predictive model. What's more, the AUROC of the predicted model was 0.732 in the validation cohort of patients with NAFLD. And the combined AUROC of the six parameters was 0.716 in the mouse model fed with high-fat diet (HFD). In summary, we created a predictive model with six laboratory indicators for the diagnosis of NAFLD based on the machine learning method, which has the potential value for the diagnosis of the NAFLD.
非酒精性脂肪性肝病(NAFLD)的诊断标志物对于筛查高危个体仍然是必要的。近年来,机器学习方法被用于寻找多种疾病的诊断标志物。在本研究中,我们开发并验证了一种使用实验室指标诊断NAFLD的机器学习模型。在训练和验证队列中招募了NAFLD患者和非NAFLD对照。收集了训练队列中参与者的实验室指标,并通过集成机器学习方法筛选出六个权重较高的指标,包括丙氨酸氨基转移酶/天冬氨酸氨基转移酶(ALT/AST)、白细胞(WBC)、α-L-岩藻糖苷酶(AFU)、血红蛋白(Hb)、甘油三酯(TG)和γ-谷氨酰转肽酶(GGT)。使用逻辑回归(LR)、随机森林(RF)和支持向量机(SVM)对所选指标的受试者工作特征曲线下面积(AUROC)分别为0.814、0.837和0.810。然后使用二元逻辑回归构建预测模型。此外,在NAFLD患者的验证队列中,预测模型的AUROC为0.732。在高脂饮食(HFD)喂养的小鼠模型中,六个参数的联合AUROC为0.716。总之,我们基于机器学习方法创建了一个具有六个实验室指标的NAFLD诊断预测模型,该模型对NAFLD的诊断具有潜在价值。