一种使用机器学习技术和先进参数优化方法预测结直肠癌患者生存率的新方法。

A Novel Approach for Predicting the Survival of Colorectal Cancer Patients Using Machine Learning Techniques and Advanced Parameter Optimization Methods.

作者信息

Woźniacki Andrzej, Książek Wojciech, Mrowczyk Patrycja

机构信息

Department of Computer Science, Faculty of Computer Science and Telecommunications, Cracow University of Technology, Warszawska 24, 31-155 Cracow, Poland.

Oncology Clinical Department, The University Hospital in Cracow, Kopernika 50, 31-501 Cracow, Poland.

出版信息

Cancers (Basel). 2024 Sep 20;16(18):3205. doi: 10.3390/cancers16183205.

DOI:10.3390/cancers16183205

PMID:39335174

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11430446/

Abstract

BACKGROUND

Colorectal cancer is one of the most prevalent forms of cancer and is associated with a high mortality rate. Additionally, an increasing number of adults under 50 are being diagnosed with the disease. This underscores the importance of leveraging modern technologies, such as artificial intelligence, for early diagnosis and treatment support.

METHODS

Eight classifiers were utilized in this research: Random Forest, XGBoost, CatBoost, LightGBM, Gradient Boosting, Extra Trees, the k-nearest neighbor algorithm (KNN), and decision trees. These algorithms were optimized using the frameworks Optuna, RayTune, and HyperOpt. This study was conducted on a public dataset from Brazil, containing information on tens of thousands of patients.

RESULTS

The models developed in this study demonstrated high classification accuracy in predicting one-, three-, and five-year survival, as well as overall mortality and cancer-specific mortality. The CatBoost, LightGBM, Gradient Boosting, and Random Forest classifiers delivered the best performance, achieving an accuracy of approximately 80% across all the evaluated tasks.

CONCLUSIONS

This research enabled the development of effective classification models that can be applied in clinical practice.

摘要

背景

结直肠癌是最常见的癌症形式之一，死亡率很高。此外，越来越多50岁以下的成年人被诊断出患有这种疾病。这凸显了利用人工智能等现代技术进行早期诊断和治疗支持的重要性。

方法

本研究使用了八种分类器：随机森林、XGBoost、CatBoost、LightGBM、梯度提升、极端随机树、k近邻算法（KNN）和决策树。这些算法使用Optuna、RayTune和HyperOpt框架进行了优化。本研究基于巴西的一个公共数据集进行，该数据集包含数万名患者的信息。

结果

本研究开发的模型在预测一年、三年和五年生存率以及总死亡率和癌症特异性死亡率方面表现出较高的分类准确率。CatBoost、LightGBM、梯度提升和随机森林分类器表现最佳，在所有评估任务中准确率约为80%。

结论

本研究促成了可应用于临床实践的有效分类模型的开发。

相似文献

A Novel Approach for Predicting the Survival of Colorectal Cancer Patients Using Machine Learning Techniques and Advanced Parameter Optimization Methods.一种使用机器学习技术和先进参数优化方法预测结直肠癌患者生存率的新方法。

Cancers (Basel). 2024 Sep 20;16(18):3205. doi: 10.3390/cancers16183205.

Development and internal validation of machine learning models for personalized survival predictions in spinal cord glioma patients.机器学习模型在脊髓神经胶质瘤患者个体化生存预测中的开发和内部验证。

Spine J. 2024 Jun;24(6):1065-1076. doi: 10.1016/j.spinee.2024.02.002. Epub 2024 Feb 15.

Tree-Based Machine Learning Models with Optuna in Predicting Impedance Values for Circuit Analysis.基于树的机器学习模型与Optuna用于预测电路分析的阻抗值。

Micromachines (Basel). 2023 Jan 20;14(2):265. doi: 10.3390/mi14020265.

Machine learning algorithms for predicting COVID-19 mortality in Ethiopia.用于预测埃塞俄比亚 COVID-19 死亡率的机器学习算法。

BMC Public Health. 2024 Jun 28;24(1):1728. doi: 10.1186/s12889-024-19196-0.

Comparison of radiomics-based machine-learning classifiers for the pretreatment prediction of pathologic complete response to neoadjuvant therapy in breast cancer.基于放射组学的机器学习分类器在乳腺癌新辅助治疗前后病理完全缓解预测中的比较。

PeerJ. 2024 Jul 15;12:e17683. doi: 10.7717/peerj.17683. eCollection 2024.

Application of machine learning approaches to predict the 5-year survival status of patients with esophageal cancer.应用机器学习方法预测食管癌患者的5年生存状况。

J Thorac Dis. 2021 Nov;13(11):6240-6251. doi: 10.21037/jtd-21-1107.

Development of Cost-Effective Fatty Liver Disease Prediction Models in a Chinese Population: Statistical and Machine Learning Approaches.中国人群中具有成本效益的脂肪肝疾病预测模型的开发：统计和机器学习方法

JMIR Form Res. 2024 Feb 16;8:e53654. doi: 10.2196/53654.

A data-driven interpretable ensemble framework based on tree models for forecasting the occurrence of COVID-19 in the USA.基于树模型的数据驱动可解释集成框架，用于预测美国 COVID-19 的发生情况。

Environ Sci Pollut Res Int. 2023 Jan;30(5):13648-13659. doi: 10.1007/s11356-022-23132-3. Epub 2022 Sep 22.

Application of machine learning techniques for predicting survival in ovarian cancer.机器学习技术在卵巢癌生存预测中的应用。

BMC Med Inform Decis Mak. 2022 Dec 30;22(1):345. doi: 10.1186/s12911-022-02087-y.

A Bayesian optimization tunning integrated multi-stacking classifier framework for the prediction of radiodermatitis from 4D-CT of patients underwent breast cancer radiotherapy.一种用于从接受乳腺癌放疗患者的4D-CT预测放射性皮炎的贝叶斯优化调谐集成多堆叠分类器框架。

Front Oncol. 2023 Jun 13;13:1152020. doi: 10.3389/fonc.2023.1152020. eCollection 2023.

引用本文的文献

Short-term mortality prediction in children with gastrointestinal congenital anomalies using a random forest classifier.使用随机森林分类器预测胃肠道先天性异常儿童的短期死亡率

Pediatr Res. 2025 Sep 15. doi: 10.1038/s41390-025-04378-2.

Machine learning to evaluate the effects of non-clinical social determinant features in predicting colorectal Cancer mortality in a medically underserved Appalachian population.机器学习用于评估非临床社会决定因素特征在预测医疗服务不足的阿巴拉契亚人群结直肠癌死亡率中的作用。

Sci Rep. 2025 Jul 16;15(1):25781. doi: 10.1038/s41598-025-11074-y.

Optimizing prediction of metastasis among colorectal cancer patients using machine learning technology.使用机器学习技术优化结直肠癌患者转移的预测。

BMC Gastroenterol. 2025 Apr 18;25(1):272. doi: 10.1186/s12876-025-03841-y.

Towards precision oncology: a multi-level cancer classification system integrating liquid biopsy and machine learning.迈向精准肿瘤学：一种整合液体活检和机器学习的多层次癌症分类系统。

BioData Min. 2025 Apr 11;18(1):29. doi: 10.1186/s13040-025-00439-8.

Gamma-Glutamyl Transferase Plus Carcinoembryonic Antigen Ratio Index: A Promising Biomarker Associated with Treatment Response to Neoadjuvant Chemotherapy for Patients with Colorectal Cancer Liver Metastases.γ-谷氨酰转移酶加癌胚抗原比值指数：一种与结直肠癌肝转移患者新辅助化疗治疗反应相关的有前景的生物标志物。

Curr Oncol. 2025 Feb 18;32(2):117. doi: 10.3390/curroncol32020117.

Machine learning-based identification of proteomic markers in colorectal cancer using UK Biobank data.利用英国生物银行数据基于机器学习识别结直肠癌中的蛋白质组学标志物

Front Oncol. 2025 Jan 7;14:1505675. doi: 10.3389/fonc.2024.1505675. eCollection 2024.

Explainable Thyroid Cancer Diagnosis Through Two-Level Machine Learning Optimization with an Improved Naked Mole-Rat Algorithm.通过使用改进的裸鼹鼠算法进行两级机器学习优化实现可解释的甲状腺癌诊断

Cancers (Basel). 2024 Dec 10;16(24):4128. doi: 10.3390/cancers16244128.

本文引用的文献

Novel Artificial Intelligence Combining Convolutional Neural Network and Support Vector Machine to Predict Colorectal Cancer Prognosis and Mutational Signatures From Hematoxylin and Eosin Images.新型人工智能结合卷积神经网络和支持向量机，从苏木精和伊红图像预测结直肠癌预后和突变特征。

Mod Pathol. 2024 Oct;37(10):100562. doi: 10.1016/j.modpat.2024.100562. Epub 2024 Jul 15.

Colorectal cancer.结直肠癌。

Lancet. 2024 Jul 20;404(10449):294-310. doi: 10.1016/S0140-6736(24)00360-X. Epub 2024 Jun 20.

Colorectal Cancer: Epidemiology, Risk Factors, and Prevention.结直肠癌：流行病学、风险因素与预防

Cancers (Basel). 2024 Apr 17;16(8):1530. doi: 10.3390/cancers16081530.

Development of machine learning-based predictors for early diagnosis of hepatocellular carcinoma.基于机器学习的肝细胞癌早期诊断预测因子的研究进展。

Sci Rep. 2024 Mar 4;14(1):5274. doi: 10.1038/s41598-024-51265-7.

Colorectal cancer: a comprehensive review of carcinogenesis, diagnosis, and novel strategies for classified treatments.结直肠癌：癌变、诊断的全面综述及分类治疗的新策略。

Cancer Metastasis Rev. 2024 Jun;43(2):729-753. doi: 10.1007/s10555-023-10158-3. Epub 2023 Dec 19.

Breast Cancer Detection and Prevention Using Machine Learning.利用机器学习进行乳腺癌检测与预防

Diagnostics (Basel). 2023 Oct 2;13(19):3113. doi: 10.3390/diagnostics13193113.

Machine learning for predicting survival of colorectal cancer patients.机器学习预测结直肠癌患者的生存情况。

Sci Rep. 2023 Jun 1;13(1):8874. doi: 10.1038/s41598-023-35649-9.

Artificial intelligence in lung cancer diagnosis and prognosis: Current application and future perspective.人工智能在肺癌诊断和预后中的应用：现状与未来展望。

Semin Cancer Biol. 2023 Feb;89:30-37. doi: 10.1016/j.semcancer.2023.01.006. Epub 2023 Jan 20.

Colorectal Cancer in Younger Adults.年轻人中的结直肠癌。

Hematol Oncol Clin North Am. 2022 Jun;36(3):449-470. doi: 10.1016/j.hoc.2022.02.005. Epub 2022 May 13.

Comparative performance analysis of K-nearest neighbour (KNN) algorithm and its different variants for disease prediction.用于疾病预测的K近邻（KNN）算法及其不同变体的性能比较分析。

Sci Rep. 2022 Apr 15;12(1):6256. doi: 10.1038/s41598-022-10358-x.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一种使用机器学习技术和先进参数优化方法预测结直肠癌患者生存率的新方法。

A Novel Approach for Predicting the Survival of Colorectal Cancer Patients Using Machine Learning Techniques and Advanced Parameter Optimization Methods.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献