Urology Research Center, Tehran University of Medical Sciences, Tehran, Iran.
Gleneagles Hospital Kuala Lumpur, Imaging Department, Jalan Ampang, Kampung Berembang, 50450 Kuala Lumpur, Malaysia.
Eur J Radiol. 2022 Dec;157:110591. doi: 10.1016/j.ejrad.2022.110591. Epub 2022 Nov 5.
To develop and validate a machine learning (ML) model for the classification of breast lesions on ultrasound images.
In the present study, three separate data cohorts containing 1288 breast lesions from three countries (Malaysia, Iran, and Turkey) were utilized for MLmodel development and external validation. The model was trained on ultrasound images of 725 breast lesions, and validation was done separately on the remaining data. An expert radiologist and a radiology resident classified the lesions based on the BI-RADS lexicon. Thirteen morphometric features were selected from a contour of the lesion and underwent a three-step feature selection process. Five features were chosen to be fed into the model separately and combined with the imaging signs mentioned in the BI-RADS reference guide. A support vector classifier was trained and optimized.
The diagnostic profile of the model with various input data was compared to the expert radiologist and radiology resident. The agreement of each approach with histopathologic specimens was also determined. Based on BI-RADS and morphometric features, the model achieved an area under the receiver operating characteristic (ROC) curve (AUC) of 0.885, which is higher than the expert radiologist and radiology resident performances with AUC of 0.814 and 0.632, respectively in all cohorts. DeLong's test also showed that the AUC of the ML protocol was significantly different from that of the expert radiologist (ΔAUCs = 0.071, 95%CI: (0.056, 0.086), P = 0.005).
These results support the possible role of morphometric features in enhancing the already well-excepted classification schemes.
开发和验证一种用于超声图像中乳腺病变分类的机器学习 (ML) 模型。
本研究利用来自三个国家(马来西亚、伊朗和土耳其)的三个独立数据队列共 1288 个乳腺病变,用于 ML 模型的开发和外部验证。模型在 725 个乳腺病变的超声图像上进行训练,并分别对其余数据进行验证。一名专家放射科医生和一名放射科住院医师根据 BI-RADS 词汇对病变进行分类。从病变轮廓中选择了 13 个形态特征,并经过了三步特征选择过程。选择了 5 个特征分别输入到模型中,并与 BI-RADS 参考指南中提到的影像学征象相结合。训练和优化了支持向量分类器。
比较了具有不同输入数据的模型的诊断特征与专家放射科医生和放射科住院医师的诊断特征。还确定了每种方法与组织病理学标本的一致性。基于 BI-RADS 和形态特征,模型在所有队列中的接收者操作特征(ROC)曲线下面积(AUC)为 0.885,高于专家放射科医生和放射科住院医师的 AUC,分别为 0.814 和 0.632。DeLong 检验还表明,ML 方案的 AUC 与专家放射科医生的 AUC 有显著差异(ΔAUCs=0.071,95%CI:(0.056,0.086),P=0.005)。
这些结果支持形态特征在增强已被广泛接受的分类方案方面的可能作用。