Department of Radiology, Kangwon National University College of Medicine, 192-1 Hyoja 2-dong, Chuncheon, Kangwon-do, 200-701, Republic of Korea.
J Digit Imaging. 2012 Oct;25(5):599-606. doi: 10.1007/s10278-012-9457-7.
To determine which Breast Imaging Reporting and Data System (BI-RADS) descriptors for ultrasound are predictors for breast cancer using logistic regression (LR) analysis in conjunction with interobserver variability between breast radiologists, and to compare the performance of artificial neural network (ANN) and LR models in differentiation of benign and malignant breast masses. Five breast radiologists retrospectively reviewed 140 breast masses and described each lesion using BI-RADS lexicon and categorized final assessments. Interobserver agreements between the observers were measured by kappa statistics. The radiologists' responses for BI-RADS were pooled. The data were divided randomly into train (n = 70) and test sets (n = 70). Using train set, optimal independent variables were determined by using LR analysis with forward stepwise selection. The LR and ANN models were constructed with the optimal independent variables and the biopsy results as dependent variable. Performances of the models and radiologists were evaluated on the test set using receiver-operating characteristic (ROC) analysis. Among BI-RADS descriptors, margin and boundary were determined as the predictors according to stepwise LR showing moderate interobserver agreement. Area under the ROC curves (AUC) for both of LR and ANN were 0.87 (95% CI, 0.77-0.94). AUCs for the five radiologists ranged 0.79-0.91. There was no significant difference in AUC values among the LR, ANN, and radiologists (p > 0.05). Margin and boundary were found as statistically significant predictors with good interobserver agreement. Use of the LR and ANN showed similar performance to that of the radiologists for differentiation of benign and malignant breast masses.
为了确定使用逻辑回归(LR)分析联合乳腺放射科医生之间的观察者间变异性,哪些乳腺成像报告和数据系统(BI-RADS)超声描述符可用于预测乳腺癌,并比较人工神经网络(ANN)和 LR 模型在区分良性和恶性乳腺肿块中的性能。五位乳腺放射科医生回顾性地检查了 140 个乳腺肿块,使用 BI-RADS 词汇描述每个病变,并对最终评估进行分类。观察者之间的观察者间协议通过 Kappa 统计进行测量。汇集了放射科医生对 BI-RADS 的反应。数据随机分为训练集(n=70)和测试集(n=70)。使用训练集,通过使用向前逐步选择的 LR 分析确定最佳独立变量。使用最佳独立变量和活检结果作为因变量构建 LR 和 ANN 模型。使用测试集通过接收者操作特征(ROC)分析评估模型和放射科医生的性能。在 BI-RADS 描述符中,根据逐步 LR 显示出中度观察者间一致性,确定边缘和边界为预测因子。LR 和 ANN 的 ROC 曲线下面积(AUC)均为 0.87(95%CI,0.77-0.94)。五位放射科医生的 AUC 范围为 0.79-0.91。LR、ANN 和放射科医生的 AUC 值之间没有统计学差异(p>0.05)。边缘和边界被发现是具有良好观察者间一致性的统计学上显著的预测因子。LR 和 ANN 的使用对于区分良性和恶性乳腺肿块的性能与放射科医生相似。