Suppr超能文献

膀胱癌根治性膀胱切除术后的生存情况:一种合理的机器学习模型的开发

Survival After Radical Cystectomy for Bladder Cancer: Development of a Fair Machine Learning Model.

作者信息

Carbunaru Samuel, Neshatvar Yassamin, Do Hyungrok, Murray Katie, Ranganath Rajesh, Nayan Madhur

机构信息

Department of Urology, New York University School of Medicine, New York, NY, United States.

Department of Population Health, New York University School of Medicine, New York, NY, United States.

出版信息

JMIR Med Inform. 2024 Dec 13;12:e63289. doi: 10.2196/63289.

Abstract

BACKGROUND

Prediction models based on machine learning (ML) methods are being increasingly developed and adopted in health care. However, these models may be prone to bias and considered unfair if they demonstrate variable performance in population subgroups. An unfair model is of particular concern in bladder cancer, where disparities have been identified in sex and racial subgroups.

OBJECTIVE

This study aims (1) to develop a ML model to predict survival after radical cystectomy for bladder cancer and evaluate for potential model bias in sex and racial subgroups; and (2) to compare algorithm unfairness mitigation techniques to improve model fairness.

METHODS

We trained and compared various ML classification algorithms to predict 5-year survival after radical cystectomy using the National Cancer Database. The primary model performance metric was the F-score. The primary metric for model fairness was the equalized odds ratio (eOR). We compared 3 algorithm unfairness mitigation techniques to improve eOR.

RESULTS

We identified 16,481 patients; 23.1% (n=3800) were female, and 91.5% (n=15,080) were "White," 5% (n=832) were "Black," 2.3% (n=373) were "Hispanic," and 1.2% (n=196) were "Asian." The 5-year mortality rate was 75% (n=12,290). The best naive model was extreme gradient boosting (XGBoost), which had an F-score of 0.860 and eOR of 0.619. All unfairness mitigation techniques increased the eOR, with correlation remover showing the highest increase and resulting in a final eOR of 0.750. This mitigated model had F-scores of 0.86, 0.904, and 0.824 in the full, Black male, and Asian female test sets, respectively.

CONCLUSIONS

The ML model predicting survival after radical cystectomy exhibited bias across sex and racial subgroups. By using algorithm unfairness mitigation techniques, we improved algorithmic fairness as measured by the eOR. Our study highlights the role of not only evaluating for model bias but also actively mitigating such disparities to ensure equitable health care delivery. We also deployed the first web-based fair ML model for predicting survival after radical cystectomy.

摘要

背景

基于机器学习(ML)方法的预测模型在医疗保健领域正得到越来越多的开发和应用。然而,如果这些模型在人群亚组中表现出不同的性能,可能会存在偏差并被认为不公平。在膀胱癌中,不公平模型尤其令人担忧,因为在性别和种族亚组中已发现存在差异。

目的

本研究旨在(1)开发一个ML模型来预测膀胱癌根治性膀胱切除术后的生存率,并评估性别和种族亚组中潜在的模型偏差;(2)比较算法不公平性缓解技术以提高模型公平性。

方法

我们使用国家癌症数据库训练并比较了各种ML分类算法,以预测根治性膀胱切除术后的5年生存率。主要的模型性能指标是F分数。模型公平性的主要指标是均衡优势比(eOR)。我们比较了3种算法不公平性缓解技术以提高eOR。

结果

我们识别出16481例患者;23.1%(n = 3800)为女性,91.5%(n = 15080)为“白人”,5%(n = 832)为“黑人”,2.3%(n = 373)为“西班牙裔”,1.2%(n = 196)为“亚洲人”。5年死亡率为75%(n = 1229)。最佳的朴素模型是极端梯度提升(XGBoost),其F分数为0.860,eOR为0.619。所有不公平性缓解技术均提高了eOR,其中相关性消除器的提升幅度最大,最终eOR为0.750。这个经过缓解的模型在全量、黑人男性和亚洲女性测试集中的F分数分别为0.86、0.904和0.824。

结论

预测根治性膀胱切除术后生存率的ML模型在性别和种族亚组中表现出偏差。通过使用算法不公平性缓解技术,我们以eOR衡量提高了算法公平性。我们的研究不仅强调了评估模型偏差的作用,还强调了积极缓解此类差异以确保公平医疗服务的作用。我们还部署了首个基于网络的公平ML模型来预测根治性膀胱切除术后的生存率。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/509e/11694706/e2cf77f74eba/medinform_v12i1e63289_fig1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验