Al Bataineh Ali, Vamsi Bandi, El-Abd Mohammed, Doppala Bhanu Prakash
Artificial Intelligence Center, Norwich University, Northfield, VT, 05663, USA.
Department of Artificial Intelligence, Madanapalle Institute of Technology & Science, Madanapalle, 517325, Andhra Pradesh, India.
Sci Rep. 2025 Sep 1;15(1):32108. doi: 10.1038/s41598-025-14869-1.
Carotid Intima-Media Thickness (CIMT) is defined as a non-invasive and well-validated sign of asymptomatic atherosclerosis and an early predictor of cardiovascular disease (CVD). We assembled a carefully curated dataset of 100 adult patients, encompassing 13 clinical, biochemical and demographic variables routinely collected in outpatient practice. After a five-stage pre-processing pipeline median/mode imputation, categorical encoding, Min-Max scaling, inter-quartile-range outlier removal and SMOTE-NC balancing we trained a Kolmogorov-Arnold Network (KAN) to assign each patient to one of four CIMT-defined risk tiers mentioned as "No", "Low", "Medium", "High". Feature-selection tests (Spearman, Pearson, ANOVA and χ²) removed redundant predictors and improved interpretability. The KAN, implemented with ELU-activated hidden layers and a Softmax output was benchmarked against six conventional algorithms like Support Vector Machine, Decision Tree, Logistic Regression, Stochastic Gradient Descent, Deep Neural Network, Random Forest and Multi-Layer Perceptron. On stratification of five-fold cross-validation the proposed model achieved 93% accuracy, 93% precision, 93% recall, 91% F1-score and a ROC-AUC of 0.97, outperforming all baseline models by 8-19%. These results demonstrate that KAN's capacity in capturing arbitrary connections and handling multi-class tasks demonstrating its potential as a low-cost and promising tool for early cardiovascular risk hierarchy.
颈动脉内膜中层厚度(CIMT)被定义为无症状动脉粥样硬化的一种非侵入性且经过充分验证的标志,以及心血管疾病(CVD)的早期预测指标。我们精心整理了一个包含100名成年患者的数据集,涵盖了门诊实践中常规收集的13个临床、生化和人口统计学变量。经过一个五阶段的预处理流程,即中位数/众数插补、分类编码、最小-最大缩放、四分位距异常值去除和SMOTE-NC平衡,我们训练了一个柯尔莫哥洛夫-阿诺德网络(KAN),将每位患者分配到四个由CIMT定义的风险等级之一,即“无”、“低”、“中”、“高”。特征选择测试(斯皮尔曼、皮尔逊、方差分析和卡方检验)去除了冗余预测变量并提高了可解释性。使用ELU激活的隐藏层和Softmax输出实现的KAN与六种传统算法进行了基准测试,如支持向量机、决策树、逻辑回归、随机梯度下降、深度神经网络、随机森林和多层感知器。在五折交叉验证的分层中,所提出的模型实现了93%的准确率、93%的精确率、93%的召回率、91%的F1分数和0.97的ROC-AUC,比所有基线模型性能高出8-19%。这些结果表明,KAN在捕捉任意连接和处理多类任务方面的能力,证明了其作为一种低成本且有前景的早期心血管风险分层工具的潜力。