Institute of Computer Technology, Universitat Politècnica de València, Camino de Vera, s/n, València, 46022 Spain.
National Center for Epidemiology, Carlos III Institute of Health, Monforte de lemos, 5, Madrid, 28029 Spain; Consortium for Biomedical Research in Epidemiology and Public Health (CIBER en Epidemiología y Salud Pública - CIBERESP), Carlos III Institute of Health, Monforte de Lemos, 5, Madrid, 28029 Spain.
Comput Methods Programs Biomed. 2019 Aug;177:123-132. doi: 10.1016/j.cmpb.2019.05.022. Epub 2019 May 22.
The breast dense tissue percentage on digital mammograms is one of the most commonly used markers for breast cancer risk estimation. Geometric features of dense tissue over the breast and the presence of texture structures contained in sliding windows that scan the mammograms may improve the predictive ability when combined with the breast dense tissue percentage.
A case/control study nested within a screening program covering 1563 women with craniocaudal and mediolateral-oblique mammograms (755 controls and the contralateral breast mammograms at the closest screening visit before cancer diagnostic for 808 cases) aging 45 to 70 from Comunitat Valenciana (Spain) was used to extract geometric and texture features. The dense tissue segmentation was performed using DMScan and validated by two experienced radiologists. A model based on Random Forests was trained several times varying the set of variables. A training dataset of 1172 patients was evaluated with a 10-stratified-fold cross-validation scheme. The area under the Receiver Operating Characteristic curve (AUC) was the metric for the predictive ability. The results were assessed by only considering the output after applying the model to the test set, which was composed of the remaining 391 patients.
The AUC score obtained by the dense tissue percentage (0.55) was compared to a machine learning-based classifier results. The classifier, apart from the percentage of dense tissue of both views, firstly included global geometric features such as the distance of dense tissue to the pectoral muscle, dense tissue eccentricity or the dense tissue perimeter, obtaining an accuracy of 0.56. By the inclusion of a global feature based on local histograms of oriented gradients, the accuracy of the classifier was significantly improved (0.61). The number of well-classified patients was improved up to 236 when it was 208.
Relative geometric features of dense tissue over the breast and histograms of standardized local texture features based on sliding windows scanning the whole breast improve risk prediction beyond the dense tissue percentage adjusted by geometrical variables. Other classifiers could improve the results obtained by the conventional Random Forests used in this study.
数字乳腺钼靶片中的乳腺致密组织百分比是乳腺癌风险估计中最常用的标志物之一。扫描乳腺钼靶片的致密组织的几何特征以及包含在滑动窗口中的纹理结构的存在,与乳腺致密组织百分比相结合,可能会提高预测能力。
本病例对照研究嵌套在一个筛查项目中,该项目覆盖了来自西班牙瓦伦西亚社区的 1563 名年龄在 45 至 70 岁的头尾位和内外斜位乳腺钼靶片(755 名对照和 808 例癌症诊断前最近筛查访问时对侧乳房的乳腺钼靶片),用于提取几何和纹理特征。致密组织分割使用 DMScan 进行,并由两名有经验的放射科医生进行验证。基于随机森林的模型经过多次训练,每次都改变变量集。使用 10 折分层交叉验证方案对 1172 例患者的训练数据集进行评估。接收者操作特征曲线下的面积(AUC)是预测能力的度量标准。通过仅考虑将模型应用于测试集后的输出来评估结果,该测试集由其余 391 例患者组成。
与基于机器学习的分类器的结果相比,致密组织百分比的 AUC 评分为 0.55。该分类器除了包含两个视图的致密组织百分比外,首先还包含全局几何特征,例如致密组织与胸肌的距离、致密组织的偏心度或致密组织的周长,其准确率为 0.56。通过包含基于局部方向梯度直方图的全局特征,分类器的准确率得到了显著提高(0.61)。当将准确率提高到 236 时,可正确分类的患者数量增加了 236 人。
与通过几何变量调整的致密组织百分比相比,乳腺上致密组织的相对几何特征和基于滑动窗口扫描整个乳房的标准化局部纹理特征的直方图可以提高风险预测。其他分类器可以提高本研究中使用的传统随机森林的结果。