Suppr超能文献

开发基于超声的机器学习模型,用于准确区分硬化性腺病和浸润性导管癌。

Developing ultrasound-based machine learning models for accurate differentiation between sclerosing adenosis and invasive ductal carcinoma.

作者信息

Liu Guohao, Yang Na, Qu Yikun, Chen Guangxin, Wen Guiqiong, Li Gai, Deng Li, Mai Yuanqi

机构信息

Center of Scientific Research, Maoming People's Hospital, Maoming, China.

Department of Medical Imaging, Affiliated Hospital of Jilin Medical University, Jilin, China.

出版信息

Eur Radiol. 2025 Jun 28. doi: 10.1007/s00330-025-11777-w.

Abstract

OBJECTIVE

This study aimed to develop a machine learning model using breast ultrasound images to improve the non-invasive differential diagnosis between Sclerosing Adenosis (SA) and Invasive Ductal Carcinoma (IDC).

MATERIALS AND METHODS

2046 ultrasound images from 772 SA and IDC patients were collected, Regions of Interest (ROI) were delineated, and features were extracted. The dataset was split into training and test cohorts, and feature selection was performed by correlation coefficients and Recursive Feature Elimination. 10 classifiers with Grid Search and 5-fold cross-validation were applied during model training. Receiver Operating Characteristic (ROC) curve and Youden index were used to model evaluation. SHapley Additive exPlanations (SHAP) was employed for model interpretation. Another 224 ROIs of 84 patients from other hospitals were used for external validation.

RESULTS

For the ROI-level model, XGBoost with 18 features achieved an area under the curve (AUC) of 0.9758 (0.9654-0.9847) in the test cohort and 0.9906 (0.9805-0.9973) in the validation cohort. For the patient-level model, logistic regression with 9 features achieved an AUC of 0.9653 (0.9402-0.9859) in the test cohort and 0.9846 (0.9615-0.9978) in the validation cohort. The feature "Original shape Major Axis Length" was identified as the most important, with its value positively correlated with a higher likelihood of the sample being IDC. Feature contributions for specific ROIs were visualized as well.

CONCLUSION

We developed explainable, ultrasound-based machine learning models with high performance for differentiating SA and IDC, offering a potential non-invasive tool for improved differential diagnosis.

KEY POINTS

Question Accurately distinguishing between sclerosing adenosis (SA) and invasive ductal carcinoma (IDC) in a non-invasive manner has been a diagnostic challenge. Findings Explainable, ultrasound-based machine learning models with high performance were developed for differentiating SA and IDC, and validated well in external validation cohort. Critical relevance These models provide non-invasive tools to reduce misdiagnoses of SA and improve early detection for IDC.

摘要

目的

本研究旨在开发一种利用乳腺超声图像的机器学习模型,以改善硬化性腺病(SA)和浸润性导管癌(IDC)之间的非侵入性鉴别诊断。

材料与方法

收集了772例SA和IDC患者的2046幅超声图像,划定感兴趣区域(ROI)并提取特征。将数据集分为训练组和测试组,并通过相关系数和递归特征消除法进行特征选择。在模型训练过程中应用了10种采用网格搜索和五折交叉验证的分类器。采用受试者操作特征(ROC)曲线和尤登指数进行模型评估。采用SHapley加性解释(SHAP)进行模型解释。另外,使用了来自其他医院的84例患者的224个ROI进行外部验证。

结果

对于ROI水平的模型,具有18个特征的XGBoost在测试组中的曲线下面积(AUC)为0.9758(0.9654 - 0.9847),在验证组中为0.9906(0.9805 - 0.9973)。对于患者水平的模型,具有9个特征的逻辑回归在测试组中的AUC为0.9653(0.9402 - 0.9859),在验证组中为0.9846(0.9615 - 0.9978)。特征“原始形状长轴长度”被确定为最重要的特征,其值与样本为IDC的可能性较高呈正相关。还可视化了特定ROI的特征贡献。

结论

我们开发了基于超声的具有高性能的可解释机器学习模型,用于区分SA和IDC,为改善鉴别诊断提供了一种潜在的非侵入性工具。

关键点

问题以非侵入性方式准确区分硬化性腺病(SA)和浸润性导管癌(IDC)一直是一项诊断挑战。发现开发了基于超声的具有高性能的可解释机器学习模型,用于区分SA和IDC,并在外部验证队列中得到了良好验证。关键意义这些模型提供了非侵入性工具,以减少SA的误诊并改善IDC的早期检测。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验