Yedjou Clement G, Tchounwou Solange S, Aló Richard A, Elhag Rashid, Mochona BereKet, Latinwo Lekan
Department of Biological Sciences, College of Science and Technology, Florida Agricultural and Mechanical University, 1610 S. Martin Luther King Blvd, Tallahassee, FL 32307, United States.
Department of Pathology and Laboratory Medicine. School of Medicine, Tulane University, 1430 Tulane Avenue, New Orleans, LA, 70112, United States.
Int J Sci Acad Res. 2021 Jan;2(1):3081-3086. Epub 2021 Oct 30.
Breast cancer continues to be the most frequent cancer in females, affecting about one in 8 women and causing the highest number of cancer-related deaths in females worldwide despite remarkable progress in early diagnosis, screening, and patient management. All breast lesions are not malignant, and all the benign lesions do not progress to cancer. However, the accuracy of diagnosis can be increased by a combination or preoperative tests such as physical examination, mammography, fine-needle aspiration cytology, and core needle biopsy. Despite some limitations, these procedures are more accurate, reliable, and acceptable, when compared with a single adopted diagnostic procedure. Recent studies have shown that breast cancer can be accurately predicted and diagnosed using machine learning (ML) technology. The objective of this study was to explore the application of ML approaches to classify breast cancer based on feature values generated from a digitized image of a fine-needle aspiration (FNA) of a breast mass. To achieve this objective, we used ML algorithms, collected a scientific dataset of 569 breast cancer patients from Kaggle (https://www.kaggle.com/uciml/breast-cancer-wisconsin-data), analyze and interpreted the data based on ten real-valued features of a breast mass FNA including the radius, texture, perimeter, area, smoothness, compactness, concavity, concave points, symmetry, and fractal dimension. Among the 569 patients tested, 63% were diagnosed with benign breast cancer and 37% were diagnosed with malignant breast cancer. Benign tumors grow slowly and do not spread while malignant tumors grow rapidly and spread to other parts of the body.
乳腺癌仍然是女性中最常见的癌症,尽管在早期诊断、筛查和患者管理方面取得了显著进展,但全球约八分之一的女性受其影响,且乳腺癌导致的女性癌症相关死亡人数最多。并非所有乳腺病变都是恶性的,也不是所有良性病变都会发展为癌症。然而,通过体格检查、乳腺X线摄影、细针穿刺细胞学检查和粗针活检等术前检查相结合,可以提高诊断的准确性。尽管存在一些局限性,但与单一采用的诊断程序相比,这些程序更准确、可靠且可接受。最近的研究表明,使用机器学习(ML)技术可以准确预测和诊断乳腺癌。本研究的目的是探索基于乳腺肿块细针穿刺(FNA)数字化图像生成的特征值,应用ML方法对乳腺癌进行分类。为实现这一目标,我们使用了ML算法,从Kaggle(https://www.kaggle.com/uciml/breast-cancer-wisconsin-data)收集了569例乳腺癌患者的科学数据集,并基于乳腺肿块FNA的十个实值特征(包括半径、纹理、周长、面积、光滑度、致密性、凹陷度、凹陷点数、对称性和分形维数)对数据进行分析和解释。在测试的569例患者中,63%被诊断为良性乳腺癌,37%被诊断为恶性乳腺癌。良性肿瘤生长缓慢且不扩散,而恶性肿瘤生长迅速并扩散到身体其他部位。