Annu Int Conf IEEE Eng Med Biol Soc. 2021 Nov;2021:2310-2313. doi: 10.1109/EMBC46164.2021.9630556.
Early detection of breast cancer is a powerful tool towards decreasing its socioeconomic burden. Although, artificial intelligence (AI) methods have shown remarkable results towards this goal, their "black box" nature hinders their wide adoption in clinical practice. To address the need for AI guided breast cancer diagnosis, interpretability methods can be utilized. In this study, we used AI methods, i.e., Random Forests (RF), Neural Networks (NN) and Ensembles of Neural Networks (ENN), towards this goal and explained and optimized their performance through interpretability techniques, such as the Global Surrogate (GS) method, the Individual Conditional Expectation (ICE) plots and the Shapley values (SV). The Wisconsin Diagnostic Breast Cancer (WDBC) dataset of the open UCI repository was used for the training and evaluation of the AI algorithms. The best performance for breast cancer diagnosis was achieved by the proposed ENN (96.6% accuracy and 0.96 area under the ROC curve), and its predictions were explained by ICE plots, proving that its decisions were compliant with current medical knowledge and can be further utilized to gain new insights in the pathophysiological mechanisms of breast cancer. Feature selection based on features' importance according to the GS model improved the performance of the RF (leading the accuracy from 96.49% to 97.18% and the area under the ROC curve from 0.96 to 0.97) and feature selection based on features' importance according to SV improved the performance of the NN (leading the accuracy from 94.6% to 95.53% and the area under the ROC curve from 0.94 to 0.95). Compared to other approaches on the same dataset, our proposed models demonstrated state of the art performance while being interpretable.
早期发现乳腺癌是降低其社会经济负担的有力工具。尽管人工智能 (AI) 方法在这一目标上取得了显著的成果,但它们的“黑箱”性质阻碍了它们在临床实践中的广泛应用。为了解决 AI 引导的乳腺癌诊断的需求,可以利用可解释性方法。在这项研究中,我们使用了人工智能方法,即随机森林 (RF)、神经网络 (NN) 和神经网络集成 (ENN),并通过可解释性技术,如全局替代物 (GS) 方法、个体条件期望 (ICE) 图和 Shapley 值 (SV),对其性能进行解释和优化。我们使用了 UCI 开放数据集库中的威斯康星州诊断乳腺癌 (WDBC) 数据集来训练和评估 AI 算法。用于乳腺癌诊断的最佳性能是由所提出的 ENN 实现的(准确率为 96.6%,ROC 曲线下面积为 0.96),其预测结果通过 ICE 图进行了解释,证明其决策符合当前的医学知识,可以进一步用于深入了解乳腺癌的病理生理机制。根据 GS 模型对特征重要性进行特征选择提高了 RF 的性能(将准确率从 96.49%提高到 97.18%,ROC 曲线下面积从 0.96 提高到 0.97),根据 SV 对特征重要性进行特征选择提高了 NN 的性能(将准确率从 94.6%提高到 95.53%,ROC 曲线下面积从 0.94 提高到 0.95)。与同一数据集上的其他方法相比,我们提出的模型在具有可解释性的同时展示了最先进的性能。