• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用全血细胞计数的结直肠癌检测堆叠随机森林模型

Stacked random forest model for colorectal cancer detection using complete blood counts.

作者信息

Luo Junfeng, Tan Weiwei, Chen Shaobo, Chen Yijing, Fu Ya, Jing Xiaojuan, Kang Lingling, Li Qingyun, Ma Zhenjian, Sun Tingji, Xiao Peng, Xue Shigui, Wang Xiaozhi, Zhang Houde

机构信息

Department of Gastroenterology, Nanshan Hospital, Guangdong Medical University, Shenzhen, China.

Department of Neurology, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou, China.

出版信息

Digit Health. 2025 Jul 29;11:20552076251362072. doi: 10.1177/20552076251362072. eCollection 2025 Jan-Dec.

DOI:10.1177/20552076251362072
PMID:40755955
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12317218/
Abstract

BACKGROUND

In China, adherence to screening colonoscopy among eligible individuals remains suboptimal, primarily due to cost concerns and potential adverse effects. A machine learning model utilizing complete blood count (CBC) data could help prioritize colonoscopy referrals and improve screening participation.

METHOD

This multicenter study included participants who underwent CBC testing within three months before colonoscopy. CBC data were classified into three types (A, B, and C) based on hematology analyzer capabilities, with Type C excluded from analysis. Using Types A and B, we developed a stacking machine learning model incorporating 24 CBC features and 5 combined CBC components to predict colorectal cancer (CRC). Model performance was evaluated using the area under the curve (AUC), specificity, and sensitivity.

RESULTS

The study included 1795 CRC cases and 26,380 cancer-free individuals with CBC data. On external validation, the model achieved 80.3% specificity and 65.2% sensitivity. Notably, it demonstrated 41% sensitivity for Stage I CRC and 57.6% sensitivity for Stages I-III combined.

CONCLUSIONS

CBC testing, combined with electronic medical record data, is a low-cost and widely accessible tool. Our robust CRC risk prediction model can serve as a preliminary screening method, aiding in colonoscopy referral decisions and improving CRC screening efficiency.

摘要

背景

在中国,符合条件的个体对结肠镜筛查的依从性仍不理想,主要原因是费用问题和潜在的不良反应。利用全血细胞计数(CBC)数据的机器学习模型有助于确定结肠镜检查转诊的优先级并提高筛查参与率。

方法

这项多中心研究纳入了在结肠镜检查前三个月内接受CBC检测的参与者。根据血液分析仪的功能,CBC数据被分为三种类型(A、B和C),C型被排除在分析之外。利用A型和B型数据,我们开发了一个堆叠机器学习模型,该模型纳入了24个CBC特征和5个CBC组合成分,以预测结直肠癌(CRC)。使用曲线下面积(AUC)、特异性和敏感性来评估模型性能。

结果

该研究纳入了1795例CRC病例和26380名无癌个体的CBC数据。在外部验证中,该模型的特异性为80.3%,敏感性为65.2%。值得注意的是,它对I期CRC的敏感性为41%,对I-III期联合的敏感性为57.6%。

结论

CBC检测与电子病历数据相结合是一种低成本且广泛可用的工具。我们强大的CRC风险预测模型可作为一种初步筛查方法,有助于结肠镜检查转诊决策并提高CRC筛查效率。

相似文献

1
Stacked random forest model for colorectal cancer detection using complete blood counts.使用全血细胞计数的结直肠癌检测堆叠随机森林模型
Digit Health. 2025 Jul 29;11:20552076251362072. doi: 10.1177/20552076251362072. eCollection 2025 Jan-Dec.
2
Competing risk and random survival forest models for predicting survival in post-resection elderly stage I-III colorectal cancer patients.用于预测I-III期老年结直肠癌患者术后生存情况的竞争风险和随机生存森林模型
Sci Rep. 2025 Jul 7;15(1):24269. doi: 10.1038/s41598-025-05824-1.
3
Guaiac-based faecal occult blood tests versus faecal immunochemical tests for colorectal cancer screening in average-risk individuals.基于愈创木脂的粪便潜血试验与粪便免疫化学试验用于一般风险人群结直肠癌筛查。
Cochrane Database Syst Rev. 2022 Jun 6;6(6):CD009276. doi: 10.1002/14651858.CD009276.pub2.
4
Faecal immunochemical tests to triage patients with lower abdominal symptoms for suspected colorectal cancer referrals in primary care: a systematic review and cost-effectiveness analysis.粪便免疫化学检测用于在初级保健中对有下腹部症状的患者进行分流,以确定是否需要转诊疑似结直肠癌患者:一项系统评价和成本效益分析。
Health Technol Assess. 2017 May;21(33):1-234. doi: 10.3310/hta21330.
5
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
6
Development and Validation of a Convolutional Neural Network Model to Predict a Pathologic Fracture in the Proximal Femur Using Abdomen and Pelvis CT Images of Patients With Advanced Cancer.利用晚期癌症患者腹部和骨盆 CT 图像建立卷积神经网络模型预测股骨近端病理性骨折的研究
Clin Orthop Relat Res. 2023 Nov 1;481(11):2247-2256. doi: 10.1097/CORR.0000000000002771. Epub 2023 Aug 23.
7
Predicting Early-Onset Colorectal Cancer in Individuals Below Screening Age Using Machine Learning and Real-World Data: Case Control Study.利用机器学习和真实世界数据预测筛查年龄以下个体的早发性结直肠癌:病例对照研究
JMIR Cancer. 2025 Jun 19;11:e64506. doi: 10.2196/64506.
8
Strategies for detecting colon cancer in patients with inflammatory bowel disease.炎症性肠病患者结肠癌的检测策略。
Cochrane Database Syst Rev. 2017 Sep 18;9(9):CD000279. doi: 10.1002/14651858.CD000279.pub4.
9
Prediction of Insulin Resistance in Nondiabetic Population Using LightGBM and Cohort Validation of Its Clinical Value: Cross-Sectional and Retrospective Cohort Study.使用LightGBM预测非糖尿病人群的胰岛素抵抗及其临床价值的队列验证:横断面和回顾性队列研究
JMIR Med Inform. 2025 Jun 13;13:e72238. doi: 10.2196/72238.
10
The effect of sample site and collection procedure on identification of SARS-CoV-2 infection.样本采集部位和采集程序对严重急性呼吸综合征冠状病毒2(SARS-CoV-2)感染鉴定的影响。
Cochrane Database Syst Rev. 2024 Dec 16;12(12):CD014780. doi: 10.1002/14651858.CD014780.

本文引用的文献

1
The prognostic value of the neutrophil-to-lymphocyte ratio (NLR) and platelet-to-lymphocyte ratio (PLR) in colorectal cancer and colorectal anastomotic leakage patients: a retrospective study.中性粒细胞与淋巴细胞比值(NLR)和血小板与淋巴细胞比值(PLR)在结直肠癌及结直肠吻合口漏患者中的预后价值:一项回顾性研究。
BMC Surg. 2025 Feb 5;25(1):57. doi: 10.1186/s12893-024-02708-5.
2
COLOFIT: Development and Internal-External Validation of Models Using Age, Sex, Faecal Immunochemical and Blood Tests to Optimise Diagnosis of Colorectal Cancer in Symptomatic Patients.COLOFIT:利用年龄、性别、粪便免疫化学检测和血液检测优化有症状患者结直肠癌诊断的模型开发及内部-外部验证
Aliment Pharmacol Ther. 2025 Mar;61(5):852-864. doi: 10.1111/apt.18459. Epub 2025 Jan 7.
3
Associations between non-anaemic iron deficiency and outcomes following elective surgery for colorectal cancer: a prospective cohort study.非贫血性缺铁与结直肠癌择期手术后结局的关联:一项前瞻性队列研究。
Anaesthesia. 2025 Jan;80(1):48-58. doi: 10.1111/anae.16444. Epub 2024 Oct 15.
4
Cancer incidence and mortality in China, 2022.2022年中国癌症发病率与死亡率
J Natl Cancer Cent. 2024 Feb 2;4(1):47-53. doi: 10.1016/j.jncc.2024.01.006. eCollection 2024 Mar.
5
Next-Generation Multitarget Stool DNA Test for Colorectal Cancer Screening.用于结直肠癌筛查的下一代多靶点粪便 DNA 检测。
N Engl J Med. 2024 Mar 14;390(11):984-993. doi: 10.1056/NEJMoa2310336.
6
A Cell-free DNA Blood-Based Test for Colorectal Cancer Screening.基于无细胞游离 DNA 的血液检测用于结直肠癌筛查。
N Engl J Med. 2024 Mar 14;390(11):973-983. doi: 10.1056/NEJMoa2304714.
7
Effectiveness of Colorectal Cancer (CRC) Screening on All-Cause and CRC-Specific Mortality Reduction: A Systematic Review and Meta-Analysis.结直肠癌(CRC)筛查对降低全因死亡率和CRC特异性死亡率的有效性:一项系统评价和荟萃分析
Cancers (Basel). 2023 Mar 24;15(7):1948. doi: 10.3390/cancers15071948.
8
PCA outperforms popular hidden variable inference methods for molecular QTL mapping.主成分分析在分子数量性状定位的隐变量推断方法中表现出色。
Genome Biol. 2022 Oct 11;23(1):210. doi: 10.1186/s13059-022-02761-4.
9
A global view of adherence to colonoscopy follow-up in cascade screening of colorectal cancer.结直肠癌级联筛查中结肠镜检查随访依从性的全球观察。
Eur J Cancer Care (Engl). 2022 Sep;31(5):e13577. doi: 10.1111/ecc.13577. Epub 2022 Mar 21.
10
A stacking ensemble deep learning approach to cancer type classification based on TCGA data.基于 TCGA 数据的癌症类型分类的堆叠集成深度学习方法。
Sci Rep. 2021 Aug 2;11(1):15626. doi: 10.1038/s41598-021-95128-x.