构建基于机器学习的模型用于筛查胃癌前病变高危患者。

Construction of machine learning-based models for screening the high-risk patients with gastric precancerous lesions.

作者信息

Yu Shuxian, Jiang Haiyang, Xia Jing, Gu Jie, Chen Mengting, Wang Yan, Zhao Xiaohong, Liao Zehua, Zeng Puhua, Xie Tian, Sui Xinbing

机构信息

School of Pharmacy, Hangzhou Normal University, Hangzhou, China.

The First Affiliated Hospital of Zhejiang Chinese Medicine University, Hangzhou, China.

出版信息

Chin Med. 2025 Jan 7;20(1):7. doi: 10.1186/s13020-025-01059-4.

DOI:10.1186/s13020-025-01059-4

PMID:39773492

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11705657/

Abstract

BACKGROUND

The individualized prediction and discrimination of precancerous lesions of gastric cancer (PLGC) is critical for the early prevention of gastric cancer (GC). However, accurate non-invasive methods for distinguishing between PLGC and GC are currently lacking. This study therefore aimed to develop a risk prediction model by machine learning and deep learning techniques to aid the early diagnosis of GC.

METHODS

In this study, a total of 2229 subjects were recruited from nine tertiary hospitals between October 2022 and November 2023. We designed a comprehensive questionnaire, identified statistically significant factors, and created a web-based column chart. Then, a risk prediction model was subsequently developed by machine learning techniques. In addition, a tongue image-based risk prediction model was established by deep learning algorithms.

RESULTS

Based on logistic regression analysis, a dynamic web-based nomogram was developed and it is freely accessible at: https://yz6677.shinyapps.io/GC67/ . Then, the prediction model was established using ten different machine learning algorithms and the Random Forest (RF) model achieved the highest accuracy at 85.65%. According with the predictive results, the top 10 key risk factors were age, traditional Chinese medicine (TCM) constitution type, tongue coating color, tongue color, irregular meals, pickled food, greasy fur, over-hot eating habit, anxiety and sleep onset latency. These factors are all significant risk indicators for the progression of PLGC patients to GC patients. Subsequently, the Swin Transformer architecture was used to develop a tongue image-based model for predicting the risk for progression of PLGC. The verification set showed an accuracy of 73.33% and an area under curve (AUC) greater than 0.8 across all models.

CONCLUSIONS

Our study developed machine learning and deep learning-based models for predicting the risk for progression of PLGC to GC, which will offer the assistance to determine the high-risk patients from PLGC and improve the early diagnosis of GC.

摘要

背景

胃癌癌前病变（PLGC）的个体化预测和鉴别对于胃癌（GC）的早期预防至关重要。然而，目前缺乏准确区分PLGC和GC的非侵入性方法。因此，本研究旨在通过机器学习和深度学习技术开发一种风险预测模型，以辅助GC的早期诊断。

方法

本研究于2022年10月至2023年11月从9家三级医院招募了2229名受试者。我们设计了一份综合问卷，确定了具有统计学意义的因素，并创建了一个基于网络的柱状图。然后，通过机器学习技术开发了一种风险预测模型。此外，通过深度学习算法建立了基于舌象的风险预测模型。

结果

基于逻辑回归分析，开发了一个基于网络的动态列线图，可通过以下链接免费访问：https://yz6677.shinyapps.io/GC67/ 。然后，使用十种不同的机器学习算法建立了预测模型，随机森林（RF）模型的准确率最高，为85.65%。根据预测结果，前10个关键风险因素是年龄、中医体质类型、舌苔颜色、舌质颜色、饮食不规律、腌制食品、腻苔、过烫饮食习惯、焦虑和入睡潜伏期。这些因素都是PLGC患者进展为GC患者的重要风险指标。随后，使用Swin Transformer架构开发了一种基于舌象的模型，用于预测PLGC进展的风险。验证集显示，所有模型的准确率为73.33%，曲线下面积（AUC）大于0.8。

结论

我们的研究开发了基于机器学习和深度学习的模型，用于预测PLGC进展为GC的风险，这将有助于从PLGC中确定高危患者，并改善GC的早期诊断。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9f2/11705657/34cd055c4f21/13020_2025_1059_Fig1_HTML.jpg

相似文献

Construction of machine learning-based models for screening the high-risk patients with gastric precancerous lesions.

Chin Med. 2025 Jan 7;20(1):7. doi: 10.1186/s13020-025-01059-4.

Construction of Tongue Image-Based Machine Learning Model for Screening Patients with Gastric Precancerous Lesions.

J Pers Med. 2023 Jan 31;13(2):271. doi: 10.3390/jpm13020271.

Development of a tongue image-based machine learning tool for the diagnosis of gastric cancer: a prospective multicentre clinical cohort study.

EClinicalMedicine. 2023 Feb 6;57:101834. doi: 10.1016/j.eclinm.2023.101834. eCollection 2023 Mar.

Development of an artificial intelligent model for pre-endoscopic screening of precancerous lesions in gastric cancer.

Chin Med. 2024 Jun 29;19(1):90. doi: 10.1186/s13020-024-00963-5.

Manpixiao Decoction Halted the Malignant Transformation of Precancerous Lesions of Gastric Cancer: From Network Prediction to Verification.

Front Pharmacol. 2022 Aug 5;13:927731. doi: 10.3389/fphar.2022.927731. eCollection 2022.

Erianin, the main active ingredient of Dendrobium chrysotoxum Lindl, inhibits precancerous lesions of gastric cancer (PLGC) through suppression of the HRAS-PI3K-AKT signaling pathway as revealed by network pharmacology and in vitro experimental verification.

J Ethnopharmacol. 2021 Oct 28;279:114399. doi: 10.1016/j.jep.2021.114399. Epub 2021 Jul 8.

Traditional Chinese medicine for precancerous lesions of gastric cancer: A review.

Biomed Pharmacother. 2022 Feb;146:112542. doi: 10.1016/j.biopha.2021.112542. Epub 2021 Dec 20.

Identification and validation of ferroptosis-related biomarkers and the related pathogenesis in precancerous lesions of gastric cancer.

Sci Rep. 2023 Sep 26;13(1):16074. doi: 10.1038/s41598-023-43198-4.

Machine learning based models for predicting presentation delay risk among gastric cancer patients.

Front Oncol. 2025 Jan 13;14:1503047. doi: 10.3389/fonc.2024.1503047. eCollection 2024.

Traditional Chinese Medicine in the treatment of chronic atrophic gastritis, precancerous lesions and gastric cancer.

J Ethnopharmacol. 2025 Jan 30;337(Pt 1):118812. doi: 10.1016/j.jep.2024.118812. Epub 2024 Sep 10.

引用本文的文献

Precancerous pathways to gastric cancer: a review of experimental animal models recapitulating the correa cascade.

Front Cell Dev Biol. 2025 Jul 2;13:1620756. doi: 10.3389/fcell.2025.1620756. eCollection 2025.

Revolutionizing gastroenterology and hepatology with artificial intelligence: From precision diagnosis to equitable healthcare through interdisciplinary practice.

World J Gastroenterol. 2025 Jun 28;31(24):108021. doi: 10.3748/wjg.v31.i24.108021.

Prediction of herbal compatibility for colorectal adenoma treatment based on graph neural networks.

Chin Med. 2025 Mar 5;20(1):31. doi: 10.1186/s13020-025-01082-5.

本文引用的文献

Cancer incidence and mortality in China, 2022.

J Natl Cancer Cent. 2024 Feb 2;4(1):47-53. doi: 10.1016/j.jncc.2024.01.006. eCollection 2024 Mar.

Dynamic nomogram for predicting acute kidney injury in patients with community-acquired pneumonia.

BMJ Open Respir Res. 2023 Sep;10(1). doi: 10.1136/bmjresp-2022-001495.

A survey of artificial intelligence in tongue image for disease diagnosis and syndrome differentiation.

Digit Health. 2023 Aug 6;9:20552076231191044. doi: 10.1177/20552076231191044. eCollection 2023 Jan-Dec.

Front Endocrinol (Lausanne). 2023 Jan 31;14:1079628. doi: 10.3389/fendo.2023.1079628. eCollection 2023.

A Multi-Purpose Shallow Convolutional Neural Network for Chart Images.

Sensors (Basel). 2022 Oct 11;22(20):7695. doi: 10.3390/s22207695.

An Improved Machine-Learning Approach for COVID-19 Prediction Using Harris Hawks Optimization and Feature Analysis Using SHAP.

Diagnostics (Basel). 2022 Apr 19;12(5):1023. doi: 10.3390/diagnostics12051023.

Gastric Intestinal Metaplasia in Mucosa Adjacent to Gastric Cancers Is Rarely Associated With the Aneuploidy That Is Characteristic of Gastric Dysplasia or Cancer.

Am J Surg Pathol. 2021 Oct 1;45(10):1374-1381. doi: 10.1097/PAS.0000000000001764.

Dynamic Nomogram for Predicting Lateral Cervical Lymph Node Metastasis in Papillary Thyroid Carcinoma.

Otolaryngol Head Neck Surg. 2022 Mar;166(3):444-453. doi: 10.1177/01945998211009858. Epub 2021 Jun 1.

Application of Artificial Intelligence in the Establishment of an Association Model between Metabolic Syndrome, TCM Constitution, and the Guidance of Medicated Diet Care.

Evid Based Complement Alternat Med. 2021 Apr 30;2021:5530717. doi: 10.1155/2021/5530717. eCollection 2021.

Prognostic nomograms for predicting overall survival and cause-specific survival of signet ring cell carcinoma in colorectal cancer patients.

World J Clin Cases. 2021 Apr 16;9(11):2503-2518. doi: 10.12998/wjcc.v9.i11.2503.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

构建基于机器学习的模型用于筛查胃癌前病变高危患者。

Construction of machine learning-based models for screening the high-risk patients with gastric precancerous lesions.

作者信息

Yu Shuxian, Jiang Haiyang, Xia Jing, Gu Jie, Chen Mengting, Wang Yan, Zhao Xiaohong, Liao Zehua, Zeng Puhua, Xie Tian, Sui Xinbing

机构信息

School of Pharmacy, Hangzhou Normal University, Hangzhou, China.

The First Affiliated Hospital of Zhejiang Chinese Medicine University, Hangzhou, China.