膀胱癌根治性膀胱切除术后的生存情况：一种合理的机器学习模型的开发

Survival After Radical Cystectomy for Bladder Cancer: Development of a Fair Machine Learning Model.

作者信息

Carbunaru Samuel, Neshatvar Yassamin, Do Hyungrok, Murray Katie, Ranganath Rajesh, Nayan Madhur

机构信息

Department of Urology, New York University School of Medicine, New York, NY, United States.

Department of Population Health, New York University School of Medicine, New York, NY, United States.

出版信息

JMIR Med Inform. 2024 Dec 13;12:e63289. doi: 10.2196/63289.

DOI:10.2196/63289

PMID:39671594

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11694706/

Abstract

BACKGROUND

Prediction models based on machine learning (ML) methods are being increasingly developed and adopted in health care. However, these models may be prone to bias and considered unfair if they demonstrate variable performance in population subgroups. An unfair model is of particular concern in bladder cancer, where disparities have been identified in sex and racial subgroups.

OBJECTIVE

This study aims (1) to develop a ML model to predict survival after radical cystectomy for bladder cancer and evaluate for potential model bias in sex and racial subgroups; and (2) to compare algorithm unfairness mitigation techniques to improve model fairness.

METHODS

We trained and compared various ML classification algorithms to predict 5-year survival after radical cystectomy using the National Cancer Database. The primary model performance metric was the F-score. The primary metric for model fairness was the equalized odds ratio (eOR). We compared 3 algorithm unfairness mitigation techniques to improve eOR.

RESULTS

We identified 16,481 patients; 23.1% (n=3800) were female, and 91.5% (n=15,080) were "White," 5% (n=832) were "Black," 2.3% (n=373) were "Hispanic," and 1.2% (n=196) were "Asian." The 5-year mortality rate was 75% (n=12,290). The best naive model was extreme gradient boosting (XGBoost), which had an F-score of 0.860 and eOR of 0.619. All unfairness mitigation techniques increased the eOR, with correlation remover showing the highest increase and resulting in a final eOR of 0.750. This mitigated model had F-scores of 0.86, 0.904, and 0.824 in the full, Black male, and Asian female test sets, respectively.

CONCLUSIONS

The ML model predicting survival after radical cystectomy exhibited bias across sex and racial subgroups. By using algorithm unfairness mitigation techniques, we improved algorithmic fairness as measured by the eOR. Our study highlights the role of not only evaluating for model bias but also actively mitigating such disparities to ensure equitable health care delivery. We also deployed the first web-based fair ML model for predicting survival after radical cystectomy.

摘要

背景

基于机器学习（ML）方法的预测模型在医疗保健领域正得到越来越多的开发和应用。然而，如果这些模型在人群亚组中表现出不同的性能，可能会存在偏差并被认为不公平。在膀胱癌中，不公平模型尤其令人担忧，因为在性别和种族亚组中已发现存在差异。

目的

本研究旨在（1）开发一个ML模型来预测膀胱癌根治性膀胱切除术后的生存率，并评估性别和种族亚组中潜在的模型偏差；（2）比较算法不公平性缓解技术以提高模型公平性。

方法

我们使用国家癌症数据库训练并比较了各种ML分类算法，以预测根治性膀胱切除术后的5年生存率。主要的模型性能指标是F分数。模型公平性的主要指标是均衡优势比（eOR）。我们比较了3种算法不公平性缓解技术以提高eOR。

结果

我们识别出16481例患者；23.1%（n = 3800）为女性，91.5%（n = 15080）为“白人”，5%（n = 832）为“黑人”，2.3%（n = 373）为“西班牙裔”，1.2%（n = 196）为“亚洲人”。5年死亡率为75%（n = 1229）。最佳的朴素模型是极端梯度提升（XGBoost），其F分数为0.860，eOR为0.619。所有不公平性缓解技术均提高了eOR，其中相关性消除器的提升幅度最大，最终eOR为0.750。这个经过缓解的模型在全量、黑人男性和亚洲女性测试集中的F分数分别为0.86、0.904和0.824。

结论

预测根治性膀胱切除术后生存率的ML模型在性别和种族亚组中表现出偏差。通过使用算法不公平性缓解技术，我们以eOR衡量提高了算法公平性。我们的研究不仅强调了评估模型偏差的作用，还强调了积极缓解此类差异以确保公平医疗服务的作用。我们还部署了首个基于网络的公平ML模型来预测根治性膀胱切除术后的生存率。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/509e/11694706/e2cf77f74eba/medinform_v12i1e63289_fig1.jpg

相似文献

Survival After Radical Cystectomy for Bladder Cancer: Development of a Fair Machine Learning Model.膀胱癌根治性膀胱切除术后的生存情况：一种合理的机器学习模型的开发

JMIR Med Inform. 2024 Dec 13;12:e63289. doi: 10.2196/63289.

Using machine learning for predicting cancer-specific mortality in bladder cancer patients undergoing radical cystectomy: a SEER-based study.使用机器学习预测接受根治性膀胱切除术的膀胱癌患者的癌症特异性死亡率：一项基于监测、流行病学和最终结果（SEER）数据库的研究

BMC Cancer. 2025 Mar 21;25(1):523. doi: 10.1186/s12885-025-13942-2.

Fairness in Predicting Cancer Mortality Across Racial Subgroups.预测不同种族亚组癌症死亡率的公平性。

JAMA Netw Open. 2024 Jul 1;7(7):e2421290. doi: 10.1001/jamanetworkopen.2024.21290.

Prediction of mortality after radical cystectomy for bladder cancer by machine learning techniques.运用机器学习技术预测膀胱癌根治性膀胱切除术后的死亡率

Comput Biol Med. 2015 Aug;63:124-32. doi: 10.1016/j.compbiomed.2015.05.015. Epub 2015 May 29.

Evaluating machine learning model bias and racial disparities in non-small cell lung cancer using SEER registry data.利用监测、流行病学和最终结果（SEER）登记数据评估非小细胞肺癌中机器学习模型的偏差和种族差异。

Health Care Manag Sci. 2024 Dec;27(4):631-649. doi: 10.1007/s10729-024-09691-6. Epub 2024 Nov 4.

Assessing fairness in machine learning models: A study of racial bias using matched counterparts in mortality prediction for patients with chronic diseases.评估机器学习模型的公平性：使用慢性病患者死亡率预测中的匹配对照研究种族偏见。

J Biomed Inform. 2024 Aug;156:104677. doi: 10.1016/j.jbi.2024.104677. Epub 2024 Jun 13.

Population-based assessment of racial/ethnic differences in utilization of radical cystectomy for patients diagnosed with bladder cancer.基于人群的膀胱癌患者根治性膀胱切除术使用情况的种族/民族差异评估。

Cancer Causes Control. 2017 Jul;28(7):755-766. doi: 10.1007/s10552-017-0902-2. Epub 2017 May 5.

Racial disparity in quality of care and overall survival among black vs. white patients with muscle-invasive bladder cancer treated with radical cystectomy: A national cancer database analysis.接受根治性膀胱切除术治疗的肌层浸润性膀胱癌黑人和白人患者在医疗质量和总生存率方面的种族差异：一项国家癌症数据库分析。

Urol Oncol. 2018 Oct;36(10):469.e1-469.e11. doi: 10.1016/j.urolonc.2018.07.012. Epub 2018 Aug 20.

Algorithmic Fairness of Machine Learning Models for Alzheimer Disease Progression.机器学习模型在阿尔茨海默病进展中的算法公平性。

JAMA Netw Open. 2023 Nov 1;6(11):e2342203. doi: 10.1001/jamanetworkopen.2023.42203.

The SPARC score: a multifactorial outcome prediction model for patients undergoing radical cystectomy for bladder cancer.SPARC 评分：用于膀胱癌根治性切除术患者的多因素预后预测模型。

J Urol. 2013 Dec;190(6):2005-10. doi: 10.1016/j.juro.2013.06.022. Epub 2013 Jun 14.

本文引用的文献

Survival Prediction of Patients with Bladder Cancer after Cystectomy Based on Clinical, Radiomics, and Deep-Learning Descriptors.基于临床、影像组学和深度学习特征的膀胱癌患者膀胱切除术后生存预测

Cancers (Basel). 2023 Sep 1;15(17):4372. doi: 10.3390/cancers15174372.

Race and Ethnic Categories: A Brief Review of Global Terms and Nomenclature.种族和族裔类别：全球术语和命名法简述

Cureus. 2023 Jul 1;15(7):e41253. doi: 10.7759/cureus.41253. eCollection 2023 Jul.

Disparities in Survival and Comorbidity Burden Between Asian and Native Hawaiian and Other Pacific Islander Patients With Cancer.癌症患者中，亚裔、夏威夷原住民和其他太平洋岛民与本土美国人之间的生存和合并症负担存在差异。

JAMA Netw Open. 2022 Aug 1;5(8):e2226327. doi: 10.1001/jamanetworkopen.2022.26327.

Racial inequity and other social disparities in the diagnosis and management of bladder cancer.膀胱癌诊断和管理中的种族不平等和其他社会差异。

Cancer Med. 2023 Jan;12(1):640-650. doi: 10.1002/cam4.4917. Epub 2022 Jun 8.

Evaluation and Mitigation of Racial Bias in Clinical Machine Learning Models: Scoping Review.临床机器学习模型中种族偏见的评估与缓解：范围综述

JMIR Med Inform. 2022 May 31;10(5):e36388. doi: 10.2196/36388.

A clarification of the nuances in the fairness metrics landscape.厘清公平性指标领域的细微差别。

Sci Rep. 2022 Mar 10;12(1):4209. doi: 10.1038/s41598-022-07939-1.

Conceptualising fairness: three pillars for medical algorithms and health equity.概念化公平：医疗算法和健康公平的三个支柱。

BMJ Health Care Inform. 2022 Jan;29(1). doi: 10.1136/bmjhci-2021-100459.

Predicting survival after radical prostatectomy: Variation of machine learning performance by race.预测根治性前列腺切除术后的生存率：机器学习性能因种族而异。

Prostate. 2021 Dec;81(16):1355-1364. doi: 10.1002/pros.24233. Epub 2021 Sep 16.

Clinical Impact of the Predict Prostate Risk Communication Tool in Men Newly Diagnosed with Nonmetastatic Prostate Cancer: A Multicentre Randomised Controlled Trial.预测前列腺风险沟通工具对新诊断为非转移性前列腺癌男性的临床影响：一项多中心随机对照试验。

Eur Urol. 2021 Nov;80(5):661-669. doi: 10.1016/j.eururo.2021.08.001. Epub 2021 Sep 4.

A machine learning approach to predict progression on active surveillance for prostate cancer.机器学习在前列腺癌主动监测进展预测中的应用。

Urol Oncol. 2022 Apr;40(4):161.e1-161.e7. doi: 10.1016/j.urolonc.2021.08.007. Epub 2021 Aug 29.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

膀胱癌根治性膀胱切除术后的生存情况：一种合理的机器学习模型的开发

Survival After Radical Cystectomy for Bladder Cancer: Development of a Fair Machine Learning Model.

作者信息

机构信息

出版信息

BACKGROUND

OBJECTIVE

METHODS

RESULTS

CONCLUSIONS

背景

目的

方法

结果

结论

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献