Zhou An, Chen Kui, Wei Yonghui, Ye Qu, Xiao Yuanming, Shi Rong, Wang Jiangang, Li Wei-Dong
Department of Genetics, College of Basic Medical Sciences, Tianjin Medical University, Tianjin, China.
Health Management Medical Center, Third Xiangya Hospital, Central South University, Changsha, China.
Front Med (Lausanne). 2025 Jun 12;12:1593662. doi: 10.3389/fmed.2025.1593662. eCollection 2025.
Early detection of subclinical atherosclerosis progression is crucial for preventing atherosclerotic cardiovascular disease (ASCVD). Carotid intima-media thickness (CIMT) is a recognized surrogate marker for atherosclerosis, but accurate prediction of its progression remains challenging. This study aimed to develop and validate machine learning models for predicting CIMT progression via routine clinical biomarkers.
In this three-year prospective cohort study, we analyzed data from 904 participants from the Third Xiangya Hospital of Central South University Health Examination Cohort who underwent three consecutive annual CIMT measurements. The participants were categorized into CIMT thickening and nonthickening groups on the basis of a final CIMT ≥1.0 mm or an increase ≥0.1 mm across consecutive measurements. We evaluated seven machine learning algorithms: logistic regression, random forest, XGBoost, support vector machine (SVM), elastic net, decision tree, and neural network. Model performance was assessed through discrimination (AUC, sensitivity, specificity) and calibration metrics, with Platt scaling applied to optimize probability estimates. Clinical utility was evaluated through decision curve analysis.
Compared with the more complex algorithms, the elastic net model demonstrated superior performance (AUC 0.754). Baseline CIMT, absolute monocyte count, sex, age, and LDL-C were identified as the most influential predictors. After Platt scaling, the calibration improved significantly across all the models. Decision curve analysis revealed a positive net benefit across a wide threshold range (0.01-0.5). On the basis of calibrated probabilities, we developed a three-tier risk stratification framework that identified distinct groups with progressively higher event rates: medium-risk (13.9%), high-risk (50.0%), and very-high-risk (60.0%). Subgroup analysis revealed better predictive performance in younger participants (<50 years), those with lower baseline CIMT (<0.8 mm), and females.
Machine learning approaches, particularly the elastic net model, can effectively identify individuals at high risk for CIMT progression via routine clinical biomarkers. The superior performance of simpler models suggests predominantly linear relationships between predictors and CIMT progression. Following appropriate calibration, the model demonstrated strong clinical utility across diverse decision thresholds, supporting a stratified approach to atherosclerosis prevention.
亚临床动脉粥样硬化进展的早期检测对于预防动脉粥样硬化性心血管疾病(ASCVD)至关重要。颈动脉内膜中层厚度(CIMT)是公认的动脉粥样硬化替代标志物,但其进展的准确预测仍然具有挑战性。本研究旨在开发并验证通过常规临床生物标志物预测CIMT进展的机器学习模型。
在这项为期三年的前瞻性队列研究中,我们分析了来自中南大学湘雅三医院健康体检队列的904名参与者的数据,这些参与者连续三年每年进行一次CIMT测量。根据最终CIMT≥1.0毫米或连续测量增加≥0.1毫米,将参与者分为CIMT增厚组和非增厚组。我们评估了七种机器学习算法:逻辑回归、随机森林、XGBoost、支持向量机(SVM)、弹性网络、决策树和神经网络。通过判别(AUC、敏感性、特异性)和校准指标评估模型性能,并应用Platt缩放优化概率估计。通过决策曲线分析评估临床实用性。
与更复杂的算法相比,弹性网络模型表现出更好的性能(AUC 0.754)。基线CIMT、绝对单核细胞计数、性别、年龄和低密度脂蛋白胆固醇被确定为最有影响力的预测因素。经过Platt缩放后,所有模型的校准均有显著改善。决策曲线分析显示在较宽的阈值范围(0.01 - 0.5)内净效益为正。基于校准概率,我们开发了一个三层风险分层框架,该框架识别出事件发生率逐渐升高的不同组:中风险(13.9%)、高风险(50.0%)和极高风险(60.0%)。亚组分析显示,在年轻参与者(<50岁)、基线CIMT较低(<0.8毫米)的参与者和女性中,预测性能更好。
机器学习方法,特别是弹性网络模型,能够通过常规临床生物标志物有效识别CIMT进展的高危个体。较简单模型的优越性能表明预测因素与CIMT进展之间主要存在线性关系。经过适当校准后,该模型在不同决策阈值下均显示出强大的临床实用性,支持采用分层方法预防动脉粥样硬化。