Department of Radiology, University of Pittsburgh, 4200 Fifth Ave, Pittsburgh, PA, 15260, USA.
Department of Radiology, Liaoning Cancer Hospital & Institute, 44 Xiaoheyan Rd, Dadong District, Shenyang City, Liaoning Province, 110042, China.
J Digit Imaging. 2018 Aug;31(4):387-392. doi: 10.1007/s10278-017-0022-2.
Mammographic breast density has been established as an independent risk marker for developing breast cancer. Breast density assessment is a routine clinical need in breast cancer screening and current standard is using the Breast Imaging and Reporting Data System (BI-RADS) criteria including four qualitative categories (i.e., fatty, scattered density, heterogeneously dense, or extremely dense). In each mammogram examination, a breast is typically imaged with two different views, i.e., the mediolateral oblique (MLO) view and cranial caudal (CC) view. The BI-RADS-based breast density assessment is a qualitative process made by visual observation of both the MLO and CC views by radiologists, where there is a notable inter- and intra-reader variability. In order to maintain consistency and accuracy in BI-RADS-based breast density assessment, gaining understanding on radiologists' reading behaviors will be educational. In this study, we proposed to leverage the newly emerged deep learning approach to investigate how the MLO and CC view images of a mammogram examination may have been clinically used by radiologists in coming up with a BI-RADS density category. We implemented a convolutional neural network (CNN)-based deep learning model, aimed at distinguishing the breast density categories using a large (15,415 images) set of real-world clinical mammogram images. Our results showed that the classification of density categories (in terms of area under the receiver operating characteristic curve) using MLO view images is significantly higher than that using the CC view. This indicates that most likely it is the MLO view that the radiologists have predominately used to determine the breast density BI-RADS categories. Our study holds a potential to further interpret radiologists' reading characteristics, enhance personalized clinical training to radiologists, and ultimately reduce reader variations in breast density assessment.
乳腺密度已被确定为乳腺癌发生的独立危险因素。乳腺密度评估是乳腺癌筛查的常规临床需求,目前的标准是使用乳腺影像报告和数据系统(BI-RADS)标准,包括四个定性类别(即脂肪、散在密度、不均匀密度或极度密度)。在每次乳房 X 光检查中,乳房通常会通过两种不同的视图进行成像,即内外斜位(MLO)视图和头尾位(CC)视图。BI-RADS 基于的乳腺密度评估是放射科医生通过观察 MLO 和 CC 视图进行的定性过程,存在明显的读者间和读者内变异性。为了在 BI-RADS 基于的乳腺密度评估中保持一致性和准确性,了解放射科医生的阅读行为将具有教育意义。在这项研究中,我们提出利用新兴的深度学习方法来研究放射科医生在确定 BI-RADS 密度类别时,如何在乳房 X 光检查的 MLO 和 CC 视图图像中进行临床应用。我们实施了基于卷积神经网络(CNN)的深度学习模型,旨在使用大量(15415 张图像)真实临床乳房 X 光图像来区分乳腺密度类别。我们的结果表明,使用 MLO 视图图像进行密度类别分类(以接收者操作特征曲线下的面积为指标)显著高于使用 CC 视图图像。这表明,放射科医生很可能主要使用 MLO 视图来确定乳腺密度 BI-RADS 类别。我们的研究有潜力进一步解释放射科医生的阅读特征,增强对放射科医生的个性化临床培训,最终减少乳腺密度评估中的读者变异。