Jiang Siqing, Gao Haojun, He Jiajin, Shi Jiaqi, Tong Yuling, Wu Jian
Department of Public Health, Zhejiang University School of Medicine, Zhejiang University, Hangzhou, China.
Real-Doctor Artificial Intelligence Research Center, Zhejiang University, Hangzhou, China.
Front Artif Intell. 2022 Aug 16;5:956385. doi: 10.3389/frai.2022.956385. eCollection 2022.
Gastric cancer remains an enormous threat to human health. It is extremely significant to make a clear diagnosis and timely treatment of gastrointestinal tumors. The traditional diagnosis method (endoscope, surgery, and pathological tissue extraction) of gastric cancer is usually invasive, expensive, and time-consuming. The machine learning method is fast and low-cost, which breaks through the limitations of the traditional methods as we can apply the machine learning method to diagnose gastric cancer. This work aims to construct a cheap, non-invasive, rapid, and high-precision gastric cancer diagnostic model using personal behavioral lifestyles and non-invasive characteristics. A retrospective study was implemented on 3,630 participants. The developed models (extreme gradient boosting, decision tree, random forest, and logistic regression) were evaluated by cross-validation and the generalization ability in our test set. We found that the model developed using fingerprints based on the extreme gradient boosting (XGBoost) algorithm produced better results compared with the other models. The overall accuracy of which test set was 85.7%, AUC was 89.6%, sensitivity 78.7%, specificity 76.9%, and positive predictive values 73.8%, verifying that the proposed model has significant medical value and good application prospects.
胃癌仍然是对人类健康的巨大威胁。明确诊断并及时治疗胃肠道肿瘤极为重要。胃癌的传统诊断方法(内窥镜检查、手术和病理组织提取)通常具有侵入性、成本高且耗时。机器学习方法快速且低成本,突破了传统方法的局限性,因为我们可以应用机器学习方法来诊断胃癌。这项工作旨在利用个人行为生活方式和非侵入性特征构建一个廉价、非侵入性、快速且高精度的胃癌诊断模型。对3630名参与者进行了一项回顾性研究。通过交叉验证和我们测试集中的泛化能力对所开发的模型(极端梯度提升、决策树、随机森林和逻辑回归)进行了评估。我们发现,基于极端梯度提升(XGBoost)算法使用指纹开发的模型与其他模型相比产生了更好的结果。其测试集的总体准确率为85.7%,AUC为89.6%,灵敏度为78.7%,特异性为76.9%,阳性预测值为73.8%,证实了所提出的模型具有显著的医学价值和良好的应用前景。