• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

运用机器学习算法改进多肿瘤生物标志物健康检查测试

Improving Multi-Tumor Biomarker Health Check-up Tests with Machine Learning Algorithms.

作者信息

Wang Hsin-Yao, Chen Chun-Hsien, Shi Steve, Chung Chia-Ru, Wen Ying-Hao, Wu Min-Hsien, Lebowitz Michael S, Zhou Jiming, Lu Jang-Jih

机构信息

Department of Laboratory Medicine, Chang Gung Memorial Hospital at Linkou, Taoyuan City 33305, Taiwan.

20/20 GeneSystems, Inc, Rockville, MD 20850, USA.

出版信息

Cancers (Basel). 2020 Jun 1;12(6):1442. doi: 10.3390/cancers12061442.

DOI:10.3390/cancers12061442
PMID:32492934
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7352838/
Abstract

BACKGROUND

Tumor markers are used to screen tens of millions of individuals worldwide at annual health check-ups, especially in East Asia. Machine learning (ML)-based algorithms that improve the diagnostic accuracy and clinical utility of these tests can have substantial impact leading to the early diagnosis of cancer.

METHODS

ML-based algorithms, including a cancer screening algorithm and a secondary organ of origin algorithm, were developed and validated using a large real world dataset (RWD) from asymptomatic individuals undergoing routine cancer screening at a Taiwanese medical center between May 2001 and April 2015. External validation was performed using data from the same period from a separate medical center. The data set included tumor marker values, age, and gender from 27,938 individuals, including 342 subsequently confirmed cancer cases.

RESULTS

Separate gender-specific cancer screening algorithms were developed. For men, a logistic regression-based algorithm outperformed single-marker and other ML-based algorithms, with a mean area under the receiver operating characteristic curve (AUROC) of 0.7654 in internal and 0.8736 in external cross validation. For women, a random forest-based algorithm attained a mean AUROC of 0.6665 in internal and 0.6938 in external cross validation. The median time to cancer diagnosis (TTD) in men was 451.5, 204.5, and 28 days for the mild, moderate, and high-risk groups, respectively; for women, the median TTD was 229, 132, and 125 days for the mild, moderate, and high-risk groups. A second algorithm was developed to predict the most likely affected organ systems for at-risk individuals. The algorithm yielded 0.8120 sensitivity and 0.6490 specificity for men, and 0.8170 sensitivity and 0.6750 specificity for women.

CONCLUSIONS

ML-derived algorithms, trained and validated by using a RWD, can significantly improve tumor marker-based screening for multiple types of early stage cancers, suggest the tissue of origin, and provide guidance for patient follow-up.

摘要

背景

肿瘤标志物用于全球数千万人的年度健康检查,尤其是在东亚地区。基于机器学习(ML)的算法可提高这些检测的诊断准确性和临床实用性,对癌症的早期诊断具有重大影响。

方法

使用2001年5月至2015年4月在台湾一家医疗中心接受常规癌症筛查的无症状个体的大型真实世界数据集(RWD),开发并验证了基于ML的算法,包括癌症筛查算法和继发器官起源算法。使用同期另一家医疗中心的数据进行外部验证。数据集包括27938名个体的肿瘤标志物值、年龄和性别,其中包括342例随后确诊的癌症病例。

结果

开发了针对不同性别的癌症筛查算法。对于男性,基于逻辑回归的算法优于单标志物算法和其他基于ML的算法,在内部交叉验证中,受试者操作特征曲线下面积(AUROC)的平均值为0.7654,在外部交叉验证中为0.8736。对于女性,基于随机森林的算法在内部交叉验证中的平均AUROC为0.6665,在外部交叉验证中为0.6938。男性轻度、中度和高危组的癌症诊断中位时间(TTD)分别为451.5天、204.5天和28天;女性轻度、中度和高危组的中位TTD分别为229天、132天和125天。开发了第二种算法来预测高危个体最可能受影响的器官系统。该算法对男性的敏感性为0.8120,特异性为0.6490;对女性的敏感性为0.8170,特异性为0.6750。

结论

通过使用RWD进行训练和验证的基于ML的算法,可显著改善基于肿瘤标志物的多种早期癌症筛查,提示起源组织,并为患者随访提供指导。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0499/7352838/93101c6adc53/cancers-12-01442-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0499/7352838/b71974018048/cancers-12-01442-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0499/7352838/3372ae3161e0/cancers-12-01442-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0499/7352838/dc3af0eee30c/cancers-12-01442-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0499/7352838/8e7b09ae1ad6/cancers-12-01442-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0499/7352838/93101c6adc53/cancers-12-01442-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0499/7352838/b71974018048/cancers-12-01442-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0499/7352838/3372ae3161e0/cancers-12-01442-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0499/7352838/dc3af0eee30c/cancers-12-01442-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0499/7352838/8e7b09ae1ad6/cancers-12-01442-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0499/7352838/93101c6adc53/cancers-12-01442-g005.jpg

相似文献

1
Improving Multi-Tumor Biomarker Health Check-up Tests with Machine Learning Algorithms.运用机器学习算法改进多肿瘤生物标志物健康检查测试
Cancers (Basel). 2020 Jun 1;12(6):1442. doi: 10.3390/cancers12061442.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Development and validation of explainable machine-learning models for carotid atherosclerosis early screening.开发和验证可解释的机器学习模型,用于颈动脉粥样硬化的早期筛查。
J Transl Med. 2023 May 29;21(1):353. doi: 10.1186/s12967-023-04093-8.
4
Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?预测模型工具能否识别 ACL 重建术后阿片类药物使用时间延长的高风险患者?
Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.
5
Machine learning algorithms for early sepsis detection in the emergency department: A retrospective study.机器学习算法在急诊科早期脓毒症检测中的应用:一项回顾性研究。
Int J Med Inform. 2022 Apr;160:104689. doi: 10.1016/j.ijmedinf.2022.104689. Epub 2022 Jan 20.
6
Claims-Based Algorithms for Identifying Patients With Pulmonary Hypertension: A Comparison of Decision Rules and Machine-Learning Approaches.基于索赔的肺动脉高压患者识别算法:决策规则与机器学习方法的比较。
J Am Heart Assoc. 2020 Oct 20;9(19):e016648. doi: 10.1161/JAHA.120.016648. Epub 2020 Sep 29.
7
Prediction of myopia development among Chinese school-aged children using refraction data from electronic medical records: A retrospective, multicentre machine learning study.基于电子病历中的屈光数据预测中国学龄儿童近视进展:一项回顾性、多中心机器学习研究。
PLoS Med. 2018 Nov 6;15(11):e1002674. doi: 10.1371/journal.pmed.1002674. eCollection 2018 Nov.
8
Can Machine-learning Algorithms Predict Early Revision TKA in the Danish Knee Arthroplasty Registry?机器学习算法能否预测丹麦膝关节置换登记处的早期翻修 TKA?
Clin Orthop Relat Res. 2020 Sep;478(9):2088-2101. doi: 10.1097/CORR.0000000000001343.
9
Machine Learning Approaches to Predict 6-Month Mortality Among Patients With Cancer.机器学习方法预测癌症患者 6 个月死亡率。
JAMA Netw Open. 2019 Oct 2;2(10):e1915997. doi: 10.1001/jamanetworkopen.2019.15997.
10
Comparison Between Statistical Model and Machine Learning Methods for Predicting the Risk of Renal Function Decline Using Routine Clinical Data in Health Screening.利用健康筛查中的常规临床数据预测肾功能下降风险的统计模型与机器学习方法的比较
Risk Manag Healthc Policy. 2022 Apr 26;15:817-826. doi: 10.2147/RMHP.S346856. eCollection 2022.

引用本文的文献

1
Artificial intelligence utilization in cancer screening program across ASEAN: a scoping review.东盟地区癌症筛查项目中人工智能的应用:一项范围综述
BMC Cancer. 2025 Apr 15;25(1):703. doi: 10.1186/s12885-025-14026-x.
2
Multicancer Detection (MCD) Testing in Gastrointestinal Cancers: An Evolving Tool for Early Diagnosis.胃肠道癌症的多癌检测(MCD)测试:一种不断发展的早期诊断工具。
Curr Gastroenterol Rep. 2025 Mar 6;27(1):19. doi: 10.1007/s11894-025-00970-y.
3
Identifying heterogeneous subgroups of systemic autoimmune diseases by applying a joint dimension reduction and clustering approach to immunomarkers.

本文引用的文献

1
Rapid classification of group B Streptococcus serotypes based on matrix-assisted laser desorption ionization-time of flight mass spectrometry and machine learning techniques.基于基质辅助激光解吸电离飞行时间质谱和机器学习技术的 B 群链球菌血清型快速分类。
BMC Bioinformatics. 2019 Dec 24;20(Suppl 19):703. doi: 10.1186/s12859-019-3282-7.
2
Incorporating Statistical Test and Machine Intelligence Into Strain Typing of Based on Matrix-Assisted Laser Desorption Ionization-Time of Flight Mass Spectrometry.将统计检验和机器智能纳入基于基质辅助激光解吸电离飞行时间质谱的菌株分型
Front Microbiol. 2019 Sep 13;10:2120. doi: 10.3389/fmicb.2019.02120. eCollection 2019.
3
通过对免疫标志物应用联合降维和聚类方法来识别系统性自身免疫性疾病的异质子组。
BioData Min. 2024 Sep 16;17(1):36. doi: 10.1186/s13040-024-00389-7.
4
Integrating Artificial Intelligence for Advancing Multiple-Cancer Early Detection via Serum Biomarkers: A Narrative Review.整合人工智能通过血清生物标志物推进多种癌症早期检测:一篇叙述性综述。
Cancers (Basel). 2024 Feb 21;16(5):862. doi: 10.3390/cancers16050862.
5
The Diagnostic Power of Circulating miR-1246 in Screening Cancer: An Updated Meta-analysis.循环 miR-1246 在癌症筛查中的诊断效能:一项更新的荟萃分析。
Oxid Med Cell Longev. 2023 Apr 20;2023:8379231. doi: 10.1155/2023/8379231. eCollection 2023.
6
Addressing the Clinical Feasibility of Adopting Circulating miRNA for Breast Cancer Detection, Monitoring and Management with Artificial Intelligence and Machine Learning Platforms.探讨采用循环 miRNA 与人工智能和机器学习平台进行乳腺癌检测、监测和管理的临床可行性。
Int J Mol Sci. 2022 Dec 6;23(23):15382. doi: 10.3390/ijms232315382.
7
Study design considerations for trials to evaluate multicancer early detection assays for clinical utility.用于评估多癌种早期检测检测试剂盒临床实用性的临床试验设计考虑因素。
J Natl Cancer Inst. 2023 Mar 9;115(3):250-257. doi: 10.1093/jnci/djac218.
8
Uncovering nasopharyngeal carcinoma from chronic rhinosinusitis and healthy subjects using routine medical tests via machine learning.利用机器学习从慢性鼻-鼻窦炎和健康受试者中挖掘鼻咽癌,采用常规医疗检测手段。
PLoS One. 2022 Sep 9;17(9):e0274263. doi: 10.1371/journal.pone.0274263. eCollection 2022.
9
Energy Efficiency of Inference Algorithms for Clinical Laboratory Data Sets: Green Artificial Intelligence Study.临床实验室数据集推断算法的能效:绿色人工智能研究。
J Med Internet Res. 2022 Jan 25;24(1):e28036. doi: 10.2196/28036.
10
Machine Learning in Prediction of Bladder Cancer on Clinical Laboratory Data.基于临床实验室数据的机器学习在膀胱癌预测中的应用
Diagnostics (Basel). 2022 Jan 14;12(1):203. doi: 10.3390/diagnostics12010203.
Increase Trichomonas vaginalis detection based on urine routine analysis through a machine learning approach.
基于机器学习方法提高尿常规分析中阴道毛滴虫的检出率。
Sci Rep. 2019 Aug 19;9(1):11074. doi: 10.1038/s41598-019-47361-8.
4
Making Machine Learning Models Clinically Useful.让机器学习模型在临床上发挥作用。
JAMA. 2019 Oct 8;322(14):1351-1352. doi: 10.1001/jama.2019.10306.
5
National and State Estimates of Lost Earnings From Cancer Deaths in the United States.美国癌症死亡导致的收入损失的全国及各州估计数。
JAMA Oncol. 2019 Sep 1;5(9):e191460. doi: 10.1001/jamaoncol.2019.1460. Epub 2019 Sep 12.
6
Predicting breast cancer metastasis by using serum biomarkers and clinicopathological data with machine learning technologies.利用机器学习技术,通过血清生物标志物和临床病理数据预测乳腺癌转移。
Int J Med Inform. 2019 Aug;128:79-86. doi: 10.1016/j.ijmedinf.2019.05.003. Epub 2019 May 7.
7
Cervical and colorectal cancer screening prevalence before and after Affordable Care Act Medicaid expansion.平价医疗法案扩大医疗补助计划前后的宫颈癌和结直肠癌筛查普及率。
Prev Med. 2019 Jul;124:91-97. doi: 10.1016/j.ypmed.2019.05.003. Epub 2019 May 8.
8
Rapid Detection of Heterogeneous Vancomycin-Intermediate Based on Matrix-Assisted Laser Desorption Ionization Time-of-Flight: Using a Machine Learning Approach and Unbiased Validation.基于基质辅助激光解吸电离飞行时间质谱的异质性万古霉素中介菌快速检测:采用机器学习方法及无偏倚验证
Front Microbiol. 2018 Oct 11;9:2393. doi: 10.3389/fmicb.2018.02393. eCollection 2018.
9
Geographic Availability of Low-Dose Computed Tomography for Lung Cancer Screening in the United States, 2017.2017年美国低剂量计算机断层扫描用于肺癌筛查的地理可及性
Prev Chronic Dis. 2018 Oct 4;15:E119. doi: 10.5888/pcd15.180241.
10
Faecal Occult Blood Point-of-Care Tests.粪便潜血即时检验
J Gastrointest Cancer. 2018 Dec;49(4):402-405. doi: 10.1007/s12029-018-0169-1.