• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于常规实验室检验数据的机器学习模型检测结直肠癌

Colorectal Cancer Detected by Machine Learning Models Using Conventional Laboratory Test Data.

机构信息

373651Department of Clinical Laboratory, The Sixth Affiliated Hospital, 26469Sun Yat-sen University, Guangzhou, Guangdong, China.

373651Guangdong Provincial Key Laboratory of Colorectal and Pelvic Floor Diseases, Guangdong Institute of Gastroenterology, The Sixth Affiliated Hospital, 26469Sun Yat-sen University, Guangzhou, Guangdong, China.

出版信息

Technol Cancer Res Treat. 2021 Jan-Dec;20:15330338211058352. doi: 10.1177/15330338211058352.

DOI:10.1177/15330338211058352
PMID:34806496
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8606732/
Abstract

Current diagnostic methods for colorectal cancer (CRC) are colonoscopy and sigmoidoscopy, which are invasive and complex procedures with possible complications. This study aimed to determine models for CRC identification that involve minimally invasive, affordable, portable, and accurate screening variables. This was a retrospective study that used data from electronic medical records of patients with CRC and healthy individuals between July 2017 and June 2018. Laboratory data, including liver enzymes, lipid profiles, complete blood counts, and tumor biomarkers, were extracted from the electronic medical records. Five machine learning models (logistic regression, random forest, k-nearest neighbors, support vector machine [SVM], and naïve Bayes) were used to identify CRC. The performances were evaluated using the areas under the curve (AUCs), sensitivity, specificity, positive predictive values (PPV), and negative predictive values (NPV). A total of 1164 electronic medical records (CRC patients: 582; healthy controls: 582) were included. The logistic regression model achieved the highest performance in identifying CRC (AUC: 0.865, sensitivity: 89.5%, specificity: 83.5%, PPV: 84.4%, NPV: 88.9%). The first four weighted features in the model were carcinoembryonic antigen (CEA), hemoglobin (HGB), lipoprotein (a) (Lp(a)), and high-density lipoprotein (HDL). A diagnostic model for CRC was established based on the four indicators, with an AUC of 0.849 (0.840-0.860) for identifying all CRC patients, and it performed best in discriminating patients with late colon cancer from healthy individuals with an AUC of 0.905 (0.889-0.929). The logistic regression model based on CEA, HGB, Lp(a), and HDL might be a powerful, noninvasive, and cost-effective method to identify CRC.

摘要

目前用于结直肠癌(CRC)的诊断方法为结肠镜检查和乙状结肠镜检查,这两种方法均具有侵入性和复杂性,且可能引发并发症。本研究旨在确定涉及微创、经济实惠、便携且准确的筛查变量的 CRC 识别模型。这是一项回顾性研究,使用了 2017 年 7 月至 2018 年 6 月间 CRC 患者和健康个体的电子病历数据。从电子病历中提取了实验室数据,包括肝酶、血脂谱、全血细胞计数和肿瘤生物标志物。使用了五种机器学习模型(逻辑回归、随机森林、k-最近邻、支持向量机[ SVM ]和朴素贝叶斯)来识别 CRC。使用曲线下面积(AUC)、敏感性、特异性、阳性预测值(PPV)和阴性预测值(NPV)来评估性能。共纳入 1164 份电子病历(CRC 患者:582 例;健康对照者:582 例)。逻辑回归模型在识别 CRC 方面表现最佳(AUC:0.865,敏感性:89.5%,特异性:83.5%,PPV:84.4%,NPV:88.9%)。模型中的前四个加权特征是癌胚抗原(CEA)、血红蛋白(HGB)、脂蛋白(a)(Lp(a))和高密度脂蛋白(HDL)。基于这四个指标建立了 CRC 诊断模型,该模型对所有 CRC 患者的识别 AUC 为 0.849(0.840-0.860),在区分晚期结肠癌患者与健康个体方面表现最佳,AUC 为 0.905(0.889-0.929)。基于 CEA、HGB、Lp(a)和 HDL 的逻辑回归模型可能是一种强大、非侵入性且具有成本效益的 CRC 识别方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47cd/8606732/1f7fa99eb048/10.1177_15330338211058352-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47cd/8606732/2f67d8c528c8/10.1177_15330338211058352-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47cd/8606732/8e3a8a34284e/10.1177_15330338211058352-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47cd/8606732/8b61641cecad/10.1177_15330338211058352-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47cd/8606732/1f7fa99eb048/10.1177_15330338211058352-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47cd/8606732/2f67d8c528c8/10.1177_15330338211058352-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47cd/8606732/8e3a8a34284e/10.1177_15330338211058352-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47cd/8606732/8b61641cecad/10.1177_15330338211058352-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47cd/8606732/1f7fa99eb048/10.1177_15330338211058352-fig4.jpg

相似文献

1
Colorectal Cancer Detected by Machine Learning Models Using Conventional Laboratory Test Data.基于常规实验室检验数据的机器学习模型检测结直肠癌
Technol Cancer Res Treat. 2021 Jan-Dec;20:15330338211058352. doi: 10.1177/15330338211058352.
2
Blood Biomarkers Panels for Screening of Colorectal Cancer and Adenoma on a Machine Learning-Assisted Detection Platform.基于机器学习辅助检测平台的用于结直肠癌和腺瘤筛查的血液生物标志物检测面板。
Cancer Control. 2023 Jan-Dec;30:10732748231222109. doi: 10.1177/10732748231222109.
3
Altered Serum Annexin A2 Might Be a New Potential Diagnostic Biomarker in Human Colorectal Cancer.血清膜联蛋白 A2 的改变可能是人类结直肠癌的一种新的潜在诊断生物标志物。
Ann Clin Lab Sci. 2020 Nov;50(6):726-733.
4
Classification and Diagnostic Prediction of Colorectal Cancer Mortality Based on Machine Learning Algorithms: A Multicenter National Study.基于机器学习算法的结直肠癌死亡率的分类和诊断预测:一项多中心全国性研究。
Asian Pac J Cancer Prev. 2024 Jan 1;25(1):333-342. doi: 10.31557/APJCP.2024.25.1.333.
5
A reliable method for colorectal cancer prediction based on feature selection and support vector machine.基于特征选择和支持向量机的结直肠癌预测可靠方法。
Med Biol Eng Comput. 2019 Apr;57(4):901-912. doi: 10.1007/s11517-018-1930-0. Epub 2018 Nov 26.
6
Utility of machine learning in developing a predictive model for early-age-onset colorectal neoplasia using electronic health records.机器学习在利用电子健康记录开发早发性结直肠肿瘤预测模型中的应用。
PLoS One. 2022 Mar 10;17(3):e0265209. doi: 10.1371/journal.pone.0265209. eCollection 2022.
7
Comparison of machine learning algorithms for the identification of acute exacerbations in chronic obstructive pulmonary disease.机器学习算法在慢性阻塞性肺疾病急性加重识别中的比较。
Comput Methods Programs Biomed. 2020 May;188:105267. doi: 10.1016/j.cmpb.2019.105267. Epub 2019 Dec 9.
8
Circulating fibrinogen to pre-albumin ratio is a promising biomarker for diagnosis of colorectal cancer.循环纤维蛋白原与前白蛋白比值是诊断结直肠癌的一种有前景的生物标志物。
J Clin Lab Anal. 2019 Jan;33(1):e22635. doi: 10.1002/jcla.22635. Epub 2018 Jul 25.
9
Diagnostic Value of Carcinoembryonic Antigen in Ascites for Colorectal Cancer with Peritoneal Carcinomatosis.癌胚抗原在结直肠癌伴腹膜转移癌腹水中的诊断价值
Korean J Gastroenterol. 2018 Jun 25;71(6):332-337. doi: 10.4166/kjg.2018.71.6.332.
10
Exploration of the application potential of serum multi-biomarker model in colorectal cancer screening.探讨血清多标志物模型在结直肠癌筛查中的应用潜力。
Sci Rep. 2024 May 2;14(1):10127. doi: 10.1038/s41598-024-60867-0.

引用本文的文献

1
Expression of VISTA on T cells in patients with cancer colon.结肠癌患者T细胞上VISTA的表达。
J Egypt Natl Canc Inst. 2025 May 23;37(1):41. doi: 10.1186/s43046-025-00287-x.
2
Explainable Machine Learning Models for Colorectal Cancer Prediction Using Clinical Laboratory Data.使用临床实验室数据进行结直肠癌预测的可解释机器学习模型
Cancer Control. 2025 Jan-Dec;32:10732748251336417. doi: 10.1177/10732748251336417. Epub 2025 May 7.
3
A practical approach for colorectal cancer diagnosis based on machine learning.一种基于机器学习的结直肠癌诊断实用方法。

本文引用的文献

1
Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries.《全球癌症统计数据 2020:全球 185 个国家和地区 36 种癌症的发病率和死亡率估计》。
CA Cancer J Clin. 2021 May;71(3):209-249. doi: 10.3322/caac.21660. Epub 2021 Feb 4.
2
Neutrophil-to-lymphocyte ratio is a prognostic factor for colon cancer: a propensity score analysis.中性粒细胞与淋巴细胞比值是结肠癌的预后因素:倾向评分分析。
BMC Cancer. 2020 Sep 25;20(1):922. doi: 10.1186/s12885-020-07429-5.
3
Colorectal cancer statistics, 2020.
PLoS One. 2025 Apr 29;20(4):e0321009. doi: 10.1371/journal.pone.0321009. eCollection 2025.
4
Non-Invasive Cancer Detection Using Blood Test and Predictive Modeling Approach.使用血液检测和预测建模方法进行非侵入性癌症检测。
Adv Appl Bioinform Chem. 2025 Jan 10;17:159-178. doi: 10.2147/AABC.S488604. eCollection 2024.
5
Prediction of Acute Kidney Injury for Critically Ill Cardiogenic Shock Patients with Machine Learning Algorithms.使用机器学习算法预测重症心源性休克患者的急性肾损伤
Int J Gen Med. 2025 Jan 6;18:33-42. doi: 10.2147/IJGM.S489362. eCollection 2025.
6
Are we ready to integrate advanced artificial intelligence models in clinical laboratory?我们准备好将先进的人工智能模型整合到临床实验室中了吗?
Biochem Med (Zagreb). 2025 Feb 15;35(1):010501. doi: 10.11613/BM.2025.010501. Epub 2024 Dec 15.
7
Construction of diagnostic models with machine-learning algorithms for colorectal cancer based on clinical laboratory parameters.基于临床实验室参数,使用机器学习算法构建结直肠癌诊断模型。
J Gastrointest Oncol. 2024 Oct 31;15(5):2145-2156. doi: 10.21037/jgo-24-516. Epub 2024 Sep 12.
8
A prognostic nomogram integrating carcinoembryonic antigen levels for predicting overall survival in elderly patients with stage II-III colorectal cancer.一种整合癌胚抗原水平的预后列线图,用于预测老年II-III期结直肠癌患者的总生存期。
J Gastrointest Oncol. 2024 Feb 29;15(1):164-178. doi: 10.21037/jgo-23-863. Epub 2024 Feb 27.
9
Machine learning-based identification of colorectal advanced adenoma using clinical and laboratory data: a phase I exploratory study in accordance with updated World Endoscopy Organization guidelines for noninvasive colorectal cancer screening tests.利用临床和实验室数据基于机器学习识别结直肠高级别腺瘤:一项按照世界内镜组织非侵入性结直肠癌筛查试验更新指南开展的I期探索性研究
Front Oncol. 2024 Feb 23;14:1325514. doi: 10.3389/fonc.2024.1325514. eCollection 2024.
10
Blood Biomarkers Panels for Screening of Colorectal Cancer and Adenoma on a Machine Learning-Assisted Detection Platform.基于机器学习辅助检测平台的用于结直肠癌和腺瘤筛查的血液生物标志物检测面板。
Cancer Control. 2023 Jan-Dec;30:10732748231222109. doi: 10.1177/10732748231222109.
2020 年结直肠癌统计数据。
CA Cancer J Clin. 2020 May;70(3):145-164. doi: 10.3322/caac.21601. Epub 2020 Mar 5.
4
Platelet Indices in Colorectal Cancer Patients with Synchronous Liver Metastases.伴有同时性肝转移的结直肠癌患者的血小板指数
Gastroenterol Res Pract. 2019 Nov 4;2019:6397513. doi: 10.1155/2019/6397513. eCollection 2019.
5
Machine learning enables detection of early-stage colorectal cancer by whole-genome sequencing of plasma cell-free DNA.机器学习通过对血浆游离细胞 DNA 进行全基因组测序来检测早期结直肠癌。
BMC Cancer. 2019 Aug 23;19(1):832. doi: 10.1186/s12885-019-6003-8.
6
Terminology, Molecular Features, Epidemiology, and Management of Serrated Colorectal Neoplasia.锯齿状结直肠肿瘤的术语、分子特征、流行病学和处理。
Gastroenterology. 2019 Oct;157(4):949-966.e4. doi: 10.1053/j.gastro.2019.06.041. Epub 2019 Jul 16.
7
Inflammation and Cancer: Triggers, Mechanisms, and Consequences.炎症与癌症:触发因素、机制与后果。
Immunity. 2019 Jul 16;51(1):27-41. doi: 10.1016/j.immuni.2019.06.025.
8
Chinese Society of Clinical Oncology (CSCO) diagnosis and treatment guidelines for colorectal cancer 2018 (English version).中国临床肿瘤学会(CSCO)结直肠癌诊疗指南2018(英文版)
Chin J Cancer Res. 2019 Feb;31(1):117-134. doi: 10.21147/j.issn.1000-9604.2019.01.07.
9
Changes in lipids composition and metabolism in colorectal cancer: a review.结直肠癌中脂质组成和代谢的变化:综述。
Lipids Health Dis. 2019 Jan 26;18(1):29. doi: 10.1186/s12944-019-0977-8.
10
High-Throughput Omics and Statistical Learning Integration for the Discovery and Validation of Novel Diagnostic Signatures in Colorectal Cancer.高通量组学与统计学习在结直肠癌新型诊断标志物发现与验证中的整合。
Int J Mol Sci. 2019 Jan 12;20(2):296. doi: 10.3390/ijms20020296.