School of Computer Science and Mathematics, Liverpool John Moores University, James Parsons Building, Byrom Street, Liverpool, L3 3AF, UK.
Institute of Visual Computing & Human-Centred Technology, TU Wien, Vienna, Austria.
Sci Rep. 2022 Aug 17;12(1):14004. doi: 10.1038/s41598-022-17894-6.
Breast cancer is the most commonly diagnosed female malignancy globally, with better survival rates if diagnosed early. Mammography is the gold standard in screening programmes for breast cancer, but despite technological advances, high error rates are still reported. Machine learning techniques, and in particular deep learning (DL), have been successfully used for breast cancer detection and classification. However, the added complexity that makes DL models so successful reduces their ability to explain which features are relevant to the model, or whether the model is biased. The main aim of this study is to propose a novel visualisation to help characterise breast cancer patients using Fisher Information Networks on features extracted from mammograms using a DL model. In the proposed visualisation, patients are mapped out according to their similarities and can be used to study new patients as a 'patient-like-me' approach. When applied to the CBIS-DDSM dataset, it was shown that it is a competitive methodology that can (i) facilitate the analysis and decision-making process in breast cancer diagnosis with the assistance of the FIN visualisations and 'patient-like-me' analysis, and (ii) help improve diagnostic accuracy and reduce overdiagnosis by identifying the most likely diagnosis based on clinical similarities with neighbouring patients.
乳腺癌是全球最常见的女性恶性肿瘤,如果早期诊断,生存率会更高。乳腺 X 线摄影术是乳腺癌筛查计划中的金标准,但尽管技术有所进步,仍有报道称其错误率较高。机器学习技术,特别是深度学习(DL),已成功用于乳腺癌的检测和分类。然而,使得 DL 模型如此成功的复杂性增加了其解释哪些特征与模型相关的能力,或者模型是否存在偏差。本研究的主要目的是提出一种新的可视化方法,使用从使用 DL 模型的乳腺 X 光片中提取的特征的 Fisher 信息网络来描述乳腺癌患者。在提出的可视化方法中,根据患者的相似性将患者映射出来,可以用作“类似我的患者”方法来研究新患者。当应用于 CBIS-DDSM 数据集时,结果表明,该方法是一种有竞争力的方法,可以 (i) 通过 FIN 可视化和“类似我的患者”分析来协助乳腺癌诊断中的分析和决策过程,以及 (ii) 帮助提高诊断准确性并减少过度诊断,方法是根据与邻近患者的临床相似性确定最可能的诊断。