Occupational Disease Department, Hangzhou Occupational Disease Prevention and Control Hospital, Hangzhou, China.
Metabolic Disease Center, Affiliated Hospital of Hangzhou Normal University, Hangzhou, China.
Front Public Health. 2021 Oct 12;9:743731. doi: 10.3389/fpubh.2021.743731. eCollection 2021.
Metabolic syndrome (MS) screening is essential for the early detection of the occupational population. This study aimed to screen out biomarkers related to MS and establish a risk assessment and prediction model for the routine physical examination of an occupational population. The least absolute shrinkage and selection operator (Lasso) regression algorithm of machine learning was used to screen biomarkers related to MS. Then, the accuracy of the logistic regression model was further verified based on the Lasso regression algorithm. The areas under the receiving operating characteristic curves were used to evaluate the selection accuracy of biomarkers in identifying MS subjects with risk. The screened biomarkers were used to establish a logistic regression model and calculate the odds ratio (OR) of the corresponding biomarkers. A nomogram risk prediction model was established based on the selected biomarkers, and the consistency index (C-index) and calibration curve were derived. A total of 2,844 occupational workers were included, and 10 biomarkers related to MS were screened. The number of non-MS cases was 2,189 and that of MS was 655. The area under the curve (AUC) value for non-Lasso and Lasso logistic regression was 0.652 and 0.907, respectively. The established risk assessment model revealed that the main risk biomarkers were absolute basophil count (OR: 3.38, CI:1.05-6.85), platelet packed volume (OR: 2.63, CI:2.31-3.79), leukocyte count (OR: 2.01, CI:1.79-2.19), red blood cell count (OR: 1.99, CI:1.80-2.71), and alanine aminotransferase level (OR: 1.53, CI:1.12-1.98). Furthermore, favorable results with C-indexes (0.840) and calibration curves closer to ideal curves indicated the accurate predictive ability of this nomogram. The risk assessment model based on the Lasso logistic regression algorithm helped identify MS with high accuracy in physically examining an occupational population.
代谢综合征(MS)筛查对于职业人群的早期发现至关重要。本研究旨在筛选出与 MS 相关的生物标志物,并建立职业人群常规体检的风险评估和预测模型。采用机器学习的最小绝对收缩和选择算子(Lasso)回归算法筛选与 MS 相关的生物标志物。然后,基于 Lasso 回归算法进一步验证逻辑回归模型的准确性。使用受试者工作特征曲线下的面积来评估生物标志物识别有 MS 风险的个体的选择准确性。筛选出的生物标志物用于建立逻辑回归模型,并计算相应生物标志物的比值比(OR)。基于选定的生物标志物建立列线图风险预测模型,并推导一致性指数(C-index)和校准曲线。共纳入 2844 名职业工人,筛选出与 MS 相关的 10 个生物标志物。非 MS 病例数为 2189 例,MS 病例数为 655 例。非 Lasso 和 Lasso 逻辑回归的曲线下面积(AUC)值分别为 0.652 和 0.907。建立的风险评估模型显示,主要的风险生物标志物为绝对嗜碱性粒细胞计数(OR:3.38,CI:1.05-6.85)、血小板压积(OR:2.63,CI:2.31-3.79)、白细胞计数(OR:2.01,CI:1.79-2.19)、红细胞计数(OR:1.99,CI:1.80-2.71)和丙氨酸氨基转移酶水平(OR:1.53,CI:1.12-1.98)。此外,C 指数(0.840)较高,校准曲线更接近理想曲线,表明该列线图具有良好的预测能力。基于 Lasso 逻辑回归算法的风险评估模型有助于在职业人群的体检中准确识别 MS。