Yang Jian, Hussein Kadir Dler
General Office of China Science and Technology Development Center for Chinese Medicine, Chaoyang District, Beijing, 100020, China.
Department of Statistics and Informatics, College of Administration and Economics, Salahaddin University, Erbil, Iraq.
J Cancer Res Clin Oncol. 2023 Nov;149(14):12605-12620. doi: 10.1007/s00432-023-05090-6. Epub 2023 Jul 14.
Studies in the field of better diagnosis of breast cancer using machine learning and data mining techniques have always been promising. A new diagnostic method can detect the characteristics of breast cancer in the early stages and help in better treatment. The aim of this study is to provide a method for early detection of breast cancer by reducing human errors based on data mining techniques in medicine using accurate and rapid screening.
The proposed method includes data pre-processing and image quality improvement in the first step. The second step consists of separating cancer cells from healthy breast tissue and removing outliers using image segmentation. Finally, a classification model is configured by combining deep neural networks in the third phase. The proposed ensemble classification model uses several effective features extracted from images and is based on majority vote. This model can be used as a screening system to diagnose the grade of invasive ductal carcinoma of the breast.
Evaluations have been done using two histopathological microscopic datasets including patients with invasive ductal carcinoma of the breast. With extracting high-level features with average accuracies of 92.65% and 93.34% in these two datasets, the proposed method has succeeded in quickly diagnosing and classifying breast cancer with high performance.
By combining deep neural networks and extracting features affecting breast cancer, the ability to diagnose with the highest accuracy is provided, and this is a step toward helping specialists and increasing the chances of patients' survival.
利用机器学习和数据挖掘技术更好地诊断乳腺癌领域的研究一直很有前景。一种新的诊断方法可以在早期阶段检测出乳腺癌的特征,并有助于更好地治疗。本研究的目的是基于医学数据挖掘技术,通过准确快速的筛查减少人为误差,提供一种早期检测乳腺癌的方法。
所提出的方法第一步包括数据预处理和图像质量改进。第二步包括通过图像分割将癌细胞与健康乳腺组织分离并去除异常值。最后,在第三阶段通过结合深度神经网络配置一个分类模型。所提出的集成分类模型使用从图像中提取的几个有效特征,并基于多数投票。该模型可用作筛查系统来诊断乳腺浸润性导管癌的分级。
使用包括乳腺浸润性导管癌患者的两个组织病理学微观数据集进行了评估。通过在这两个数据集中提取平均准确率分别为92.65%和93.34%的高级特征,所提出的方法成功地以高性能快速诊断和分类乳腺癌。
通过结合深度神经网络并提取影响乳腺癌的特征,提供了最高准确率的诊断能力,这是朝着帮助专家和增加患者生存机会迈出的一步。