Krasnov Daniel, Davis Dresya, Malott Keiran, Chen Yiting, Shi Xiaoping, Wong Augustine
Department of Computer Science, Mathematics, Physics and Statistics, University of British Columbia, Kelowna, BC V1V 1V7, Canada.
Faculty of Health and Social Development, School of Nursing, University of British Columbia, Kelowna, BC V1V 1V7, Canada.
Entropy (Basel). 2023 Jul 4;25(7):1021. doi: 10.3390/e25071021.
This paper reviews the potential use of fuzzy c-means clustering (FCM) and explores modifications to the distance function and centroid initialization methods to enhance image segmentation. The application of interest in the paper is the segmentation of breast tumours in mammograms. Breast cancer is the second leading cause of cancer deaths in Canadian women. Early detection reduces treatment costs and offers a favourable prognosis for patients. Classical methods, like mammograms, rely on radiologists to detect cancerous tumours, which introduces the potential for human error in cancer detection. Classical methods are labour-intensive, and, hence, expensive in terms of healthcare resources. Recent research supplements classical methods with automated mammogram analysis. The basic FCM method relies upon the Euclidean distance, which is not optimal for measuring non-spherical structures. To address these limitations, we review the implementation of a Mahalanobis-distance-based FCM (FCM-M). The three objectives of the paper are: (1) review FCM, FCM-M, and three centroid initialization algorithms in the literature, (2) illustrate the effectiveness of these algorithms in image segmentation, and (3) develop a Python package with the optimized algorithms to upload onto GitHub. Image analysis of the algorithms shows that using one of the three centroid initialization algorithms enhances the performance of FCM. FCM-M produced higher clustering accuracy and outlined the tumour structure better than basic FCM.
本文回顾了模糊c均值聚类(FCM)的潜在用途,并探讨了对距离函数和质心初始化方法的修改,以增强图像分割效果。本文所关注的应用是乳腺钼靶片中乳腺肿瘤的分割。乳腺癌是加拿大女性癌症死亡的第二大主要原因。早期检测可降低治疗成本,并为患者提供良好的预后。传统方法,如乳腺钼靶检查,依赖放射科医生来检测癌性肿瘤,这在癌症检测中存在人为误差的可能性。传统方法劳动强度大,因此在医疗资源方面成本高昂。最近的研究通过自动乳腺钼靶分析对传统方法进行了补充。基本的FCM方法依赖于欧几里得距离,这对于测量非球形结构并非最优。为解决这些局限性,我们回顾了基于马氏距离的FCM(FCM-M)的实现。本文的三个目标是:(1)回顾文献中的FCM、FCM-M和三种质心初始化算法,(2)说明这些算法在图像分割中的有效性,(3)开发一个包含优化算法的Python包并上传到GitHub。对这些算法的图像分析表明,使用三种质心初始化算法之一可提高FCM的性能。与基本的FCM相比,FCM-M产生了更高的聚类准确率,并且能更好地勾勒出肿瘤结构。