• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

堆叠机器学习算法用于基于生物标志物的盆腔肿块术前诊断

Stacking Machine Learning Algorithms for Biomarker-Based Preoperative Diagnosis of a Pelvic Mass.

作者信息

Shaw Reid, Lokshin Anna E, Miller Michael C, Messerlian-Lambert Geralyn, Moore Richard G

机构信息

Division of Gynecologic Oncology, Department of Obstetrics and Gynecology, Wilmot Cancer Institute, University of Rochester, Rochester, NY 14642, USA.

Hillman Cancer Center, University of Pittsburgh, Pittsburg, PA 15219, USA.

出版信息

Cancers (Basel). 2022 Mar 2;14(5):1291. doi: 10.3390/cancers14051291.

DOI:10.3390/cancers14051291
PMID:35267599
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8909341/
Abstract

OBJECTIVE

To identify the most predictive parameters of ovarian malignancy and develop a machine learning (ML) based algorithm to preoperatively distinguish between a benign and malignant pelvic mass.

METHODS

Retrospective study of 70 predictive parameters collected from 140 women with a pelvic mass. The women were split into a 3:1 "training" to "testing" dataset. Feature selection was performed using Gini impurity through an embedded random forest model and principal component analysis. Nine unique ML classifiers were assessed across a variety of model-specific hyperparameters using 25 bootstrap resamples of the training data. Model predictions were then combined into an ensemble stack by LASSO regression. The final ensemble stack and individual classifiers were then applied to the testing dataset to assess model performance.

RESULTS

Feature selection identified HE4, CA125, and transferrin as three predictive parameters of malignancy. Assessment of the ensemble stack on the testing dataset outperformed all individual ML classifiers in predicting malignancy. The ensemble stack demonstrated an accuracy of 97.1%, a receiver operating characteristic (ROC) area under the curve (AUC) of 0.951, and a sensitivity of 93.3% with a specificity of 100%.

CONCLUSIONS

Combining the measurement of three distinct biomarkers with the stacking of multiple ML classifiers into an ensemble can provide valuable preoperative diagnostic predictions for patients with a pelvic mass.

摘要

目的

确定卵巢恶性肿瘤最具预测性的参数,并开发一种基于机器学习(ML)的算法,以在术前区分盆腔肿块的良恶性。

方法

对140例盆腔肿块女性患者收集的70个预测参数进行回顾性研究。将这些女性患者分为3:1的“训练”数据集和“测试”数据集。通过嵌入式随机森林模型和主成分分析,使用基尼不纯度进行特征选择。使用训练数据的25次自助重采样,在各种特定于模型的超参数上评估9种独特的ML分类器。然后通过LASSO回归将模型预测合并为一个集成堆叠。最后将集成堆叠和各个分类器应用于测试数据集,以评估模型性能。

结果

特征选择确定人附睾蛋白4(HE4)、癌抗原125(CA125)和转铁蛋白为恶性肿瘤的三个预测参数。在测试数据集上对集成堆叠的评估在预测恶性肿瘤方面优于所有单个ML分类器。集成堆叠的准确率为97.1%,曲线下面积(AUC)为0.951,灵敏度为93.3%,特异性为100%。

结论

将三种不同生物标志物的测量与多个ML分类器堆叠成一个集成相结合,可以为盆腔肿块患者提供有价值的术前诊断预测。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/276e/8909341/1a8a15de3806/cancers-14-01291-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/276e/8909341/97e16c78132a/cancers-14-01291-g001a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/276e/8909341/3cbd909b69e2/cancers-14-01291-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/276e/8909341/1a8a15de3806/cancers-14-01291-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/276e/8909341/97e16c78132a/cancers-14-01291-g001a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/276e/8909341/3cbd909b69e2/cancers-14-01291-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/276e/8909341/1a8a15de3806/cancers-14-01291-g003.jpg

相似文献

1
Stacking Machine Learning Algorithms for Biomarker-Based Preoperative Diagnosis of a Pelvic Mass.堆叠机器学习算法用于基于生物标志物的盆腔肿块术前诊断
Cancers (Basel). 2022 Mar 2;14(5):1291. doi: 10.3390/cancers14051291.
2
HE4, CA125, the Risk of Malignancy Algorithm and the Risk of Malignancy Index and complex pelvic masses - a prospective comparison in the pre-operative evaluation of pelvic masses in an Australian population.人附睾蛋白4、癌抗原125、恶性风险算法和恶性风险指数与复杂盆腔肿块——澳大利亚人群盆腔肿块术前评估中的前瞻性比较
Aust N Z J Obstet Gynaecol. 2015 Oct;55(5):493-7. doi: 10.1111/ajo.12363. Epub 2015 Jul 14.
3
The ROMA (Risk of Ovarian Malignancy Algorithm) for estimating the risk of epithelial ovarian cancer in women presenting with pelvic mass: is it really useful?ROMA(卵巢恶性肿瘤风险算法)用于评估有盆腔肿块表现的女性患上皮性卵巢癌的风险:它真的有用吗?
Clin Chem Lab Med. 2011 Mar;49(3):521-5. doi: 10.1515/CCLM.2011.075. Epub 2011 Feb 3.
4
The utility of human epididymal protein 4, cancer antigen 125, and risk for malignancy algorithm in ovarian cancer and endometriosis.人附睾蛋白 4、癌抗原 125 及恶性肿瘤风险算法在卵巢癌和子宫内膜异位症中的应用。
Int J Gynecol Cancer. 2012 Feb;22(2):238-44. doi: 10.1097/IGC.0b013e318234f852.
5
Does risk for ovarian malignancy algorithm excel human epididymis protein 4 and CA125 in predicting epithelial ovarian cancer: a meta-analysis.卵巢恶性肿瘤风险算法是否优于人附睾蛋白 4 和 CA125 预测上皮性卵巢癌:一项荟萃分析。
BMC Cancer. 2012 Jun 19;12:258. doi: 10.1186/1471-2407-12-258.
6
A predictive model combining human epididymal protein 4 and radiologic features for the diagnosis of ovarian cancer.一种结合人附睾蛋白 4 和影像学特征的预测模型,用于卵巢癌的诊断。
Gynecol Oncol. 2014 Mar;132(3):573-7. doi: 10.1016/j.ygyno.2014.01.005. Epub 2014 Jan 10.
7
Development of a multimarker assay for differential diagnosis of benign and malignant pelvic masses.开发一种多标志物检测方法用于鉴别诊断良恶性盆腔包块。
Clin Chim Acta. 2015 Feb 2;440:57-63. doi: 10.1016/j.cca.2014.11.013. Epub 2014 Nov 15.
8
Evaluation of HE4, CA125, risk of ovarian malignancy algorithm (ROMA) and risk of malignancy index (RMI) as diagnostic tools of epithelial ovarian cancer in patients with a pelvic mass.评估 HE4、CA125、卵巢恶性肿瘤风险算法(ROMA)和恶性肿瘤指数(RMI)在盆腔肿块患者中作为上皮性卵巢癌的诊断工具。
Gynecol Oncol. 2012 Nov;127(2):379-83. doi: 10.1016/j.ygyno.2012.07.106. Epub 2012 Jul 24.
9
[Comparison of serum cancer antigen 125, human epididymis protein 4, ROMA, and CPH-I for diagnosis of ovarian cancer in Chinese patients with ovarian mass].血清癌抗原125、人附睾蛋白4、ROMA及CPH-I在中国卵巢肿物患者中诊断卵巢癌的比较
Nan Fang Yi Ke Da Xue Xue Bao. 2019 Dec 30;39(12):1393-1401. doi: 10.12122/j.issn.1673-4254.2019.12.02.
10
Multiple biomarker algorithms to predict epithelial ovarian cancer in women with a pelvic mass: Can additional makers improve performance?多种生物标志物算法预测盆腔肿块女性的上皮性卵巢癌:是否有额外的标志物可以改善性能?
Gynecol Oncol. 2019 Jul;154(1):150-155. doi: 10.1016/j.ygyno.2019.04.006. Epub 2019 Apr 13.

引用本文的文献

1
Predictive machine learning and multimodal data to develop highly sensitive, composite biomarkers of disease progression in Friedreich ataxia.利用预测性机器学习和多模态数据开发高度敏感的弗里德赖希共济失调疾病进展复合生物标志物。
Sci Rep. 2025 May 21;15(1):17629. doi: 10.1038/s41598-025-01047-6.
2
Predicting a failure of postoperative thromboprophylaxis in non-small cell lung cancer: A stacking machine learning approach.预测非小细胞肺癌术后血栓预防失败:一种堆叠式机器学习方法。
PLoS One. 2025 Apr 1;20(4):e0320674. doi: 10.1371/journal.pone.0320674. eCollection 2025.
3
Emerging applications of gene editing technologies for the development of climate-resilient crops.

本文引用的文献

1
Biomarkers and Strategies for Early Detection of Ovarian Cancer.卵巢癌的早期检测的生物标志物和策略。
Cancer Epidemiol Biomarkers Prev. 2020 Dec;29(12):2504-2512. doi: 10.1158/1055-9965.EPI-20-1057. Epub 2020 Oct 13.
2
Cancer statistics, 2020.癌症统计数据,2020 年。
CA Cancer J Clin. 2020 Jan;70(1):7-30. doi: 10.3322/caac.21590. Epub 2020 Jan 8.
3
Application of Artificial Intelligence for Preoperative Diagnostic and Prognostic Prediction in Epithelial Ovarian Cancer Based on Blood Biomarkers.基于血液生物标志物的人工智能在卵巢上皮性癌术前诊断和预后预测中的应用。
基因编辑技术在培育气候适应型作物方面的新兴应用。
Front Genome Ed. 2025 Mar 10;7:1524767. doi: 10.3389/fgeed.2025.1524767. eCollection 2025.
4
Cell-free plasma telomere length correlated with the risk of cardiovascular events using machine learning classifiers.使用机器学习分类器,游离血浆端粒长度与心血管事件风险相关。
Sci Rep. 2024 Dec 5;14(1):30390. doi: 10.1038/s41598-024-76686-2.
5
Multicenter analysis of immunosuppressive medications on the risk of malignancy following adult solid organ transplantation.免疫抑制药物对成人实体器官移植后恶性肿瘤风险影响的多中心分析
Front Oncol. 2023 Jun 16;13:1146002. doi: 10.3389/fonc.2023.1146002. eCollection 2023.
6
A real-world comparison of the clinical and economic utility of OVA1 and CA125 in assessing ovarian tumor malignancy risk.在评估卵巢肿瘤恶性风险方面,OVA1 和 CA125 的临床和经济实用性的真实世界比较。
J Comp Eff Res. 2023 Jun;12(6):e230025. doi: 10.57264/cer-2023-0025. Epub 2023 May 22.
Clin Cancer Res. 2019 May 15;25(10):3006-3015. doi: 10.1158/1078-0432.CCR-18-3378. Epub 2019 Apr 11.
4
Validation of a second-generation multivariate index assay for malignancy risk of adnexal masses.附件包块良恶性风险第二代多变量指数检测方法的验证。
Am J Obstet Gynecol. 2016 Jul;215(1):82.e1-82.e11. doi: 10.1016/j.ajog.2016.03.003. Epub 2016 Mar 10.
5
Radiomic Machine-Learning Classifiers for Prognostic Biomarkers of Head and Neck Cancer.用于头颈癌预后生物标志物的放射组学机器学习分类器
Front Oncol. 2015 Dec 3;5:272. doi: 10.3389/fonc.2015.00272. eCollection 2015.
6
ENCAPP: elastic-net-based prognosis prediction and biomarker discovery for human cancers.ENCAPP:基于弹性网络的人类癌症预后预测和生物标志物发现
BMC Genomics. 2015 Apr 3;16(1):263. doi: 10.1186/s12864-015-1465-9.
7
Inclusion of minorities and women in cancer clinical trials, a decade later: Have we improved?癌症临床试验中少数族裔和女性的纳入:十年后,我们有进步吗?
Cancer. 2013 Aug 15;119(16):2956-63. doi: 10.1002/cncr.28168. Epub 2013 May 14.
8
Biomarker testing for ovarian cancer: clinical utility of multiplex assays.卵巢癌的生物标志物检测:多重分析的临床效用。
Mol Diagn Ther. 2013 Jun;17(3):139-46. doi: 10.1007/s40291-013-0027-6.
9
Ovarian malignancy risk stratification of the adnexal mass using a multivariate index assay.采用多变量指数检测对附件包块的卵巢恶性肿瘤风险进行分层。
Gynecol Oncol. 2013 Feb;128(2):252-9. doi: 10.1016/j.ygyno.2012.11.022. Epub 2012 Nov 21.
10
Human epididymis protein 4 is up-regulated in gastric and pancreatic adenocarcinomas.人附睾蛋白 4 在胃腺癌和胰腺腺癌中上调。
Hum Pathol. 2013 May;44(5):734-42. doi: 10.1016/j.humpath.2012.07.017. Epub 2012 Oct 16.