Sutton Elizabeth J, Dashevsky Brittany Z, Oh Jung Hun, Veeraraghavan Harini, Apte Aditya P, Thakur Sunitha B, Morris Elizabeth A, Deasy Joseph O
Department of Radiology, Memorial Sloan Kettering Cancer Center, New York, New York, USA.
Weill Cornell Medical College, Cornell University, New York, New York, USA.
J Magn Reson Imaging. 2016 Jul;44(1):122-9. doi: 10.1002/jmri.25119. Epub 2016 Jan 12.
To use features extracted from magnetic resonance (MR) images and a machine-learning method to assist in differentiating breast cancer molecular subtypes.
This retrospective Health Insurance Portability and Accountability Act (HIPAA)-compliant study received Institutional Review Board (IRB) approval. We identified 178 breast cancer patients between 2006-2011 with: 1) ERPR + (n = 95, 53.4%), ERPR-/HER2 + (n = 35, 19.6%), or triple negative (TN, n = 48, 27.0%) invasive ductal carcinoma (IDC), and 2) preoperative breast MRI at 1.5T or 3.0T. Shape, texture, and histogram-based features were extracted from each tumor contoured on pre- and three postcontrast MR images using in-house software. Clinical and pathologic features were also collected. Machine-learning-based (support vector machines) models were used to identify significant imaging features and to build models that predict IDC subtype. Leave-one-out cross-validation (LOOCV) was used to avoid model overfitting. Statistical significance was determined using the Kruskal-Wallis test.
Each support vector machine fit in the LOOCV process generated a model with varying features. Eleven out of the top 20 ranked features were significantly different between IDC subtypes with P < 0.05. When the top nine pathologic and imaging features were incorporated, the predictive model distinguished IDC subtypes with an overall accuracy on LOOCV of 83.4%. The combined pathologic and imaging model's accuracy for each subtype was 89.2% (ERPR+), 63.6% (ERPR-/HER2+), and 82.5% (TN). When only the top nine imaging features were incorporated, the predictive model distinguished IDC subtypes with an overall accuracy on LOOCV of 71.2%. The combined pathologic and imaging model's accuracy for each subtype was 69.9% (ERPR+), 62.9% (ERPR-/HER2+), and 81.0% (TN).
We developed a machine-learning-based predictive model using features extracted from MRI that can distinguish IDC subtypes with significant predictive power. J. Magn. Reson. Imaging 2016;44:122-129.
利用从磁共振(MR)图像中提取的特征和机器学习方法辅助鉴别乳腺癌分子亚型。
这项符合《健康保险流通与责任法案》(HIPAA)的回顾性研究获得了机构审查委员会(IRB)的批准。我们纳入了2006年至2011年间的178例乳腺癌患者,这些患者患有:1)雌激素受体/孕激素受体阳性(ERPR+,n = 95,53.4%)、雌激素受体/孕激素受体阴性/人表皮生长因子受体2阳性(ERPR-/HER2+,n = 35,19.6%)或三阴性(TN,n = 48,27.0%)浸润性导管癌(IDC),以及2)术前1.5T或3.0T乳腺MRI检查。使用内部软件从在对比剂注射前及注射后三张MR图像上勾勒出的每个肿瘤中提取形状、纹理和基于直方图的特征。还收集了临床和病理特征。基于机器学习(支持向量机)的模型用于识别重要的影像特征并构建预测IDC亚型的模型。采用留一法交叉验证(LOOCV)以避免模型过度拟合。使用Kruskal-Wallis检验确定统计学意义。
在LOOCV过程中,每个支持向量机拟合生成了具有不同特征的模型。在前20个排名特征中,有11个在IDC亚型之间存在显著差异,P < 0.05。当纳入前九个病理和影像特征时,预测模型在LOOCV上鉴别IDC亚型的总体准确率为83.4%。联合病理和影像模型对各亚型的准确率分别为89.2%(ERPR+)、63.6%(ERPR-/HER2+)和82.5%(TN)。当仅纳入前九个影像特征时,预测模型在LOOCV上鉴别IDC亚型的总体准确率为71.2%。联合病理和影像模型对各亚型的准确率分别为69.9%(ERPR+)、62.9%(ERPR-/HER2+)和81.0%(TN)。
我们利用从MRI中提取的特征开发了一种基于机器学习的预测模型,该模型能够以显著的预测能力鉴别IDC亚型。《磁共振成像杂志》2016年;44:122 - 129。