Sun Xue, Zhang Liping, Luo Qingfeng, Zhou Yan, Du Jun, Fu Dongmei, Wang Ziyu, Lei Yi, Wang Qing, Zhao Li
Department of General Practice, Beijing Hospital, National Center of Gerontology, Institute of Geriatric Medicine, Chinese Academy of Medical Sciences, Beijing 100730, China.
Pharmacovigilance Research Center for Information Technology and Data Science, Cross-Strait Tsinghua Research Institute, Xiamen 361015, China.
Bioengineering (Basel). 2024 Sep 27;11(10):973. doi: 10.3390/bioengineering11100973.
The early detection accuracy of early gastric cancer (EGC) determines the choice of the optimal treatment strategy and the related medical expenses. We aimed to develop a simple, affordable, and time-saving diagnostic model using six machine learning (ML) algorithms for the diagnosis of EGC. It is based on the endoscopy-based Kyoto classification score obtained after the completion of endoscopy and other clinical features obtained after medical consultation. We retrospectively evaluated 1999 patients who underwent gastrointestinal endoscopy at the China Beijing Hospital. Of these, 203 subjects were diagnosed with EGC. The data were randomly divided into training and test sets (ratio 4:1). We constructed six ML models, and the developed models were evaluated on the testing set. This procedure was repeated five times. The Kolmogorov-Arnold Networks (KANs) model achieved the best performance (mean AUC value: 0.76; mean balanced accuracy: 70.96%; mean precision: 58.91%; mean recall: 70.96%; mean false positive rate: 26.11%; mean false negative rate: 31.96%; and mean F1 score value: 58.46). The endoscopy-based Kyoto classification score was the most important feature with the highest feature importance score. The results suggest that the KAN model, the optimal ML model in this study, has the potential to identify EGC patients, which may result in a reduction in both the time cost and medical expenses in clinical practice.
早期胃癌(EGC)的早期检测准确性决定了最佳治疗策略的选择以及相关医疗费用。我们旨在使用六种机器学习(ML)算法开发一种简单、经济且省时的诊断模型,用于EGC的诊断。该模型基于内镜检查完成后获得的基于内镜的京都分类评分以及会诊后获得的其他临床特征。我们回顾性评估了在中国北京医院接受胃肠内镜检查的1999例患者。其中,203例被诊断为EGC。数据被随机分为训练集和测试集(比例为4:1)。我们构建了六个ML模型,并在测试集上对开发的模型进行评估。此过程重复五次。Kolmogorov-Arnold网络(KANs)模型表现最佳(平均AUC值:0.76;平均平衡准确率:70.96%;平均精确率:58.91%;平均召回率:70.96%;平均假阳性率:26.11%;平均假阴性率:31.96%;平均F1评分值:58.46)。基于内镜的京都分类评分是特征重要性得分最高的最重要特征。结果表明,本研究中的最佳ML模型KAN模型有潜力识别EGC患者,这可能会降低临床实践中的时间成本和医疗费用。