Department of Electrical and Electronic Engineering, Islamic University of Technology, Gazipur, Bangladesh.
Comput Math Methods Med. 2022 Mar 7;2022:1633858. doi: 10.1155/2022/1633858. eCollection 2022.
Breast cancer is a global epidemic, responsible for one of the highest mortality rates among women. Ultrasound imaging is becoming a popular tool for breast cancer screening, and quantitative ultrasound (QUS) techniques are being increasingly applied by researchers in an attempt to characterize breast tissue. Several different quantitative descriptors for breast cancer have been explored by researchers. This study proposes a breast tumor classification system using the three major types of intratumoral QUS descriptors which can be extracted from ultrasound radiofrequency (RF) data: spectral features, envelope statistics features, and texture features. A total of 16 features were extracted from ultrasound RF data across two different datasets, of which one is balanced and the other is severely imbalanced. The balanced dataset contains RF data of 100 patients with breast tumors, of which 48 are benign and 52 are malignant. The imbalanced dataset contains RF data of 130 patients with breast tumors, of which 104 are benign and 26 are malignant. Holdout validation was used to split the balanced dataset into 60% training and 40% testing sets. Feature selection was applied on the training set to identify the most relevant subset for the classification of benign and malignant breast tumors, and the performance of the features was evaluated on the test set. A maximum classification accuracy of 95% and an area under the receiver operating characteristic curve (AUC) of 0.968 was obtained on the test set. The performance of the identified relevant features was further validated on the imbalanced dataset, where a hybrid resampling strategy was firstly utilized to create an optimal balance between benign and malignant samples. A maximum classification accuracy of 93.01%, sensitivity of 94.62%, specificity of 91.4%, and AUC of 0.966 were obtained. The results indicate that the identified features are able to distinguish between benign and malignant breast lesions very effectively, and the combination of the features identified in this research has the potential to be a significant tool in the noninvasive rapid and accurate diagnosis of breast cancer.
乳腺癌是一种全球性的流行病,是导致女性死亡率最高的疾病之一。超声成像是一种用于乳腺癌筛查的流行工具,研究人员越来越多地应用定量超声(QUS)技术来对乳腺组织进行特征描述。研究人员已经探索了几种不同的用于乳腺癌的定量描述符。本研究提出了一种使用从超声射频(RF)数据中提取的三种主要类型的肿瘤内 QUS 描述符的乳腺肿瘤分类系统:谱特征、包络统计特征和纹理特征。总共从两个不同的数据集的超声 RF 数据中提取了 16 个特征,其中一个数据集是平衡的,另一个数据集是严重不平衡的。平衡数据集包含 100 名乳腺肿瘤患者的 RF 数据,其中 48 例为良性,52 例为恶性。不平衡数据集包含 130 名乳腺肿瘤患者的 RF 数据,其中 104 例为良性,26 例为恶性。采用保留验证法将平衡数据集分为 60%的训练集和 40%的测试集。在训练集上应用特征选择以确定用于良性和恶性乳腺肿瘤分类的最相关子集,并在测试集上评估特征的性能。在测试集上获得了 95%的最大分类准确率和 0.968 的接收者操作特征曲线(AUC)下面积。在不平衡数据集上进一步验证了所识别的相关特征的性能,首先利用混合重采样策略在良性和恶性样本之间创建最佳平衡。获得了 93.01%的最大分类准确率、94.62%的灵敏度、91.4%的特异性和 0.966 的 AUC。结果表明,所识别的特征能够非常有效地区分良性和恶性乳腺病变,并且本研究中所识别的特征的组合有可能成为一种用于非侵入性快速准确诊断乳腺癌的重要工具。