• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于早期结直肠癌检测的增强型非侵入性机器学习方法:约旦队列中的预测建模与验证

Enhanced non-invasive machine learning approach for early colorectal cancer detection: Predictive modeling and validation in a Jordanian cohort.

作者信息

Kanaan Soha, Altamimi Ahmad, Qattous Hazem, Rbeihat Haitham

机构信息

Princess Sumaya University for Technology,(PSUT), Amman, Jordan.

Department of Software Engineering, Princess Sumaya University for Technology (PSUT), Amman, Jordan.

出版信息

Comput Biol Med. 2025 Jun;191:110184. doi: 10.1016/j.compbiomed.2025.110184. Epub 2025 Apr 17.

DOI:10.1016/j.compbiomed.2025.110184
PMID:40249989
Abstract

BACKGROUND

Colorectal cancer (CRC) ranks as the third most prevalent cancer worldwide, posing significant public health challenges. Late-stage detection often results in poor treatment outcomes, elevating mortality rates. The economic and psychological burdens of CRC treatment underscore the need for early detection.

OBJECTIVE

This study aims to enhance the early detection of colorectal cancer by employing machine learning (ML) algorithms on non-invasive features. The focus is on constructing a comprehensive dataset, analyzing non-invasive features, and developing predictive models to minimize the necessity for invasive procedures such as colonoscopy. By focusing on non-invasive, easily accessible data, the study aims to develop a model that can be widely applied without the associated risks of invasive procedures.

METHODS

A retrospective dataset of 400 patients was sourced from the colorectal cancer unit of Royal Medical Services (2021-2022). The dataset included demographic data, imaging reports, laboratory results, and clinical evaluations. The study involved three experiments, training ML models (K-Nearest Neighbors (KNN), Super Vector Machine (SVM), Random Forest (RF), Decision Tree (DT), and Naïve Bayes (NB)) on the collected dataset and a public dataset to validate generalizability. The first experiment used 35 features across the ML algorithms. The second experiment focused on the most informative features. The third experiment validated the models using a public dataset, with Phase I including all data and Phase II excluding missing values.

RESULTS

The Random Forest (RF) algorithm consistently outperformed other models, achieving an accuracy of 95.8 % in the first experiment, increasing to 96.5 % in the second experiment. For the public dataset, RF accuracy was 66.0 % in Phase I and 68.9 % in Phase II. Conversely, the KNN algorithm exhibited the lowest accuracy across all experiments.

CONCLUSION

This study highlights the effectiveness of ML in early CRC detection using non-invasive techniques. The RF model demonstrated superior accuracy, suggesting its potential application in clinical settings. The research contributes valuable insights into CRC detection within the local context and emphasizes the broader applicability of ML in improving cancer diagnosis and personalized treatment.

摘要

背景

结直肠癌(CRC)是全球第三大常见癌症,对公共卫生构成重大挑战。晚期检测往往导致治疗效果不佳,死亡率上升。CRC治疗的经济和心理负担凸显了早期检测的必要性。

目的

本研究旨在通过对非侵入性特征应用机器学习(ML)算法来加强结直肠癌的早期检测。重点在于构建一个综合数据集,分析非侵入性特征,并开发预测模型,以尽量减少诸如结肠镜检查等侵入性程序的必要性。通过关注非侵入性、易于获取的数据,该研究旨在开发一种可广泛应用且无侵入性程序相关风险的模型。

方法

从皇家医疗服务机构的结直肠癌科室获取了一个包含400名患者的回顾性数据集(2021 - 2022年)。该数据集包括人口统计学数据、影像报告、实验室结果和临床评估。该研究涉及三个实验,在收集的数据集和一个公共数据集上训练ML模型(K近邻算法(KNN)、支持向量机(SVM)、随机森林(RF)、决策树(DT)和朴素贝叶斯(NB))以验证其通用性。第一个实验在ML算法中使用了35个特征。第二个实验聚焦于信息量最大的特征。第三个实验使用公共数据集验证模型,第一阶段包括所有数据,第二阶段排除缺失值。

结果

随机森林(RF)算法始终优于其他模型,在第一个实验中准确率达到95.8%,在第二个实验中提高到96.5%。对于公共数据集,RF在第一阶段的准确率为66.0%,在第二阶段为68.9%。相反,KNN算法在所有实验中准确率最低。

结论

本研究突出了ML在使用非侵入性技术进行早期CRC检测中的有效性。RF模型展示了卓越的准确率,表明其在临床环境中的潜在应用价值。该研究为本地背景下的CRC检测提供了有价值的见解,并强调了ML在改善癌症诊断和个性化治疗方面的更广泛适用性。

相似文献

1
Enhanced non-invasive machine learning approach for early colorectal cancer detection: Predictive modeling and validation in a Jordanian cohort.用于早期结直肠癌检测的增强型非侵入性机器学习方法:约旦队列中的预测建模与验证
Comput Biol Med. 2025 Jun;191:110184. doi: 10.1016/j.compbiomed.2025.110184. Epub 2025 Apr 17.
2
[Construction and preliminary validation of machine learning predictive models for cervical cancer screening based on human DNA methylation].基于人类DNA甲基化的宫颈癌筛查机器学习预测模型的构建与初步验证
Zhonghua Zhong Liu Za Zhi. 2025 Feb 23;47(2):193-200. doi: 10.3760/cma.j.cn112152-20230925-00156.
3
Machine learning algorithms for predicting COVID-19 mortality in Ethiopia.用于预测埃塞俄比亚 COVID-19 死亡率的机器学习算法。
BMC Public Health. 2024 Jun 28;24(1):1728. doi: 10.1186/s12889-024-19196-0.
4
Comparative assessment of the capability of machine learning-based radiomic models for predicting omental metastasis in locally advanced gastric cancer.基于机器学习的放射组学模型预测局部进展期胃癌网膜转移能力的比较评估。
Sci Rep. 2024 Jul 13;14(1):16208. doi: 10.1038/s41598-024-66979-x.
5
Human lung cancer classification and comprehensive analysis using different machine learning techniques.使用不同机器学习技术的人类肺癌分类与综合分析
Microsc Res Tech. 2025 Jan;88(1):234-250. doi: 10.1002/jemt.24682. Epub 2024 Sep 18.
6
Predicting maternal risk level using machine learning models.使用机器学习模型预测孕产妇风险水平。
BMC Pregnancy Childbirth. 2024 Dec 18;24(1):820. doi: 10.1186/s12884-024-07030-9.
7
Classification and Diagnostic Prediction of Colorectal Cancer Mortality Based on Machine Learning Algorithms: A Multicenter National Study.基于机器学习算法的结直肠癌死亡率的分类和诊断预测:一项多中心全国性研究。
Asian Pac J Cancer Prev. 2024 Jan 1;25(1):333-342. doi: 10.31557/APJCP.2024.25.1.333.
8
Blood Biomarkers Panels for Screening of Colorectal Cancer and Adenoma on a Machine Learning-Assisted Detection Platform.基于机器学习辅助检测平台的用于结直肠癌和腺瘤筛查的血液生物标志物检测面板。
Cancer Control. 2023 Jan-Dec;30:10732748231222109. doi: 10.1177/10732748231222109.
9
Colorectal Cancer Detected by Machine Learning Models Using Conventional Laboratory Test Data.基于常规实验室检验数据的机器学习模型检测结直肠癌
Technol Cancer Res Treat. 2021 Jan-Dec;20:15330338211058352. doi: 10.1177/15330338211058352.
10
Machine learning applications to classify and monitor medication adherence in patients with type 2 diabetes in Ethiopia.机器学习在埃塞俄比亚2型糖尿病患者用药依从性分类和监测中的应用。
Front Endocrinol (Lausanne). 2025 Mar 20;16:1486350. doi: 10.3389/fendo.2025.1486350. eCollection 2025.