Suppr超能文献

开发两种机器学习模型以预测原发性HER2-0乳腺癌向HER2低表达转移灶的转化:一项概念验证研究。

Development of two machine learning models to predict conversion from primary HER2-0 breast cancer to HER2-low metastases: a proof-of-concept study.

作者信息

Miglietta F, Collesei A, Vernieri C, Giarratano T, Giorgi C A, Girardi F, Griguolo G, Cacciatore M, Botticelli A, Vingiani A, Fotia G, Piacentini F, Massa D, Zanghì F, Marino M, Pruneri G, Fassan M, Dei Tos A P, Dieci M V, Guarneri V

机构信息

Oncology Unit 2, Istituto Oncologico Veneto (IOV) - IRCCS, Padova, Italy; Department of Surgery, Oncology and Gastroenterology, University of Padova, Padova, Italy.

Bioinformatics - Clinical Research Unit, Istituto Oncologico Veneto, IOV - IRCCS, Padova, Italy.

出版信息

ESMO Open. 2025 Jan;10(1):104087. doi: 10.1016/j.esmoop.2024.104087. Epub 2024 Dec 19.

Abstract

BACKGROUND

HER2-low expression has gained clinical relevance in breast cancer (BC) due to the availability of anti-HER2 antibody-drug conjugates for patients with HER2-low metastatic BC. The well-reported instability of HER2-low status during disease evolution highlights the need to identify patients with HER2-0 primary BC who may develop a HER2-low phenotype at relapse. In response to the urgency of maximizing treatment access, we utilized artificial intelligence to predict this occurrence.

PATIENTS AND METHODS

We included a large multicentric retrospective cohort of patients with BC who underwent tissue resampling at relapse. The dataset was preprocessed to address relevant issues such as missing data, feature abundance, and target class imbalance. We then trained two models: one focused on explainability [Extreme Gradient Boosting (XGBoost)] and another aimed at performance (an ensemble of XGBoost and support vector machine).

RESULTS

A total of 1200 patients were included in this study. Among 386 patients with HER2-0 primary BC and matched HER2 status at relapse, 42.5% (n = 157) converted to a HER2-low phenotype. The explainable model achieved a balanced accuracy of 58%, with a sensitivity of 53% and a specificity of 64%. The most important variables for this model were primary BC phenotype [mean Shapley value (SHAP) 0.540], primary BC histological type (SHAP 0.101), grade (SHAP 0.182), and sites of relapse (SHAP 0.008-0.213). The ensemble model had a balanced accuracy of 64%, with a sensitivity of 75% and a specificity of 53%.

CONCLUSIONS

This work represents one of the first proof-of-concept applications of machine learning models to predict a highly relevant phenomenon for drug access in modern BC oncology. Starting with an explainable model and subsequently integrating it with an ensemble approach enabled us to enhance performance while maintaining transparency, explainability, and intelligibility.

摘要

背景

由于抗HER2抗体药物偶联物可用于HER2低表达转移性乳腺癌患者,HER2低表达在乳腺癌(BC)中已具有临床相关性。疾病进展过程中HER2低表达状态的不稳定性已被充分报道,这凸显了识别HER2-0原发性乳腺癌患者的必要性,这些患者在复发时可能会出现HER2低表达表型。为了应对最大限度扩大治疗可及性的紧迫性,我们利用人工智能来预测这种情况的发生。

患者与方法

我们纳入了一个大型多中心回顾性队列,这些乳腺癌患者在复发时接受了组织重新取样。对数据集进行了预处理,以解决诸如数据缺失、特征丰富度和目标类不平衡等相关问题。然后我们训练了两个模型:一个侧重于可解释性[极端梯度提升(XGBoost)],另一个旨在提高性能(XGBoost和支持向量机的集成)。

结果

本研究共纳入1200例患者。在386例HER2-0原发性乳腺癌患者及复发时匹配的HER2状态中,42.5%(n = 157)转变为HER2低表达表型。可解释模型的平衡准确率为58%,敏感性为53%,特异性为64%。该模型最重要的变量是原发性乳腺癌表型[平均夏普利值(SHAP)0.540]、原发性乳腺癌组织学类型(SHAP 0.101)、分级(SHAP 0.182)和复发部位(SHAP 0.008 - 0.213)。集成模型的平衡准确率为64%,敏感性为75%,特异性为53%。

结论

这项工作代表了机器学习模型首次用于预测现代乳腺癌肿瘤学中与药物可及性高度相关现象的概念验证应用之一。从一个可解释模型开始,随后将其与集成方法相结合,使我们能够在保持透明度、可解释性和易懂性的同时提高性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4818/11730216/18302de92811/gr1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验