Department of Computer Science and Engineering, Indian Institute of Technology Patna, India.
Department of Computer Science and Engineering, Indian Institute of Technology Patna, India.
Comput Methods Programs Biomed. 2022 May;218:106716. doi: 10.1016/j.cmpb.2022.106716. Epub 2022 Feb 28.
Medical image classification problems are frequently constrained by the availability of datasets. "Data augmentation" has come as a data enhancement and data enrichment solution to the challenge of limited data. Traditionally data augmentation techniques are based on linear and label preserving transformations; however, recent works have demonstrated that even non-linear, non-label preserving techniques can be unexpectedly effective. This paper proposes a non-linear data augmentation technique for the medical domain and explores its results.
This paper introduces "Crossover technique", a new data augmentation technique for Convolutional Neural Networks in Medical Image Classification problems. Our technique synthesizes a pair of samples by applying two-point crossover on the already available training dataset. By this technique, we create N new samples from N training samples. The proposed crossover based data augmentation technique, although non-label preserving, has performed significantly better in terms of increased accuracy and reduced loss for all the tested datasets over varied architectures.
The proposed method was tested on three publicly available medical datasets with various network architectures. For the mini-MIAS database of mammograms, our method improved the accuracy by 1.47%, achieving 80.15% using VGG-16 architecture. Our method works fine for both gray-scale as well as RGB images, as on the PH2 database for Skin Cancer, it improved the accuracy by 3.57%, achieving 85.71% using VGG-19 architecture. In addition, our technique improved accuracy on the brain tumor dataset by 0.40%, achieving 97.97% using VGG-16 architecture.
The proposed novel crossover technique for training the Convolutional Neural Network (CNN) is painless to implement by applying two-point crossover on two images to form new images. The method would go a long way in tackling the challenges of limited datasets and problems of class imbalances in medical image analysis. Our code is available at https://github.com/rishiraj-cs/Crossover-augmentation.
医学图像分类问题经常受到数据集可用性的限制。“数据增强”是解决数据有限的一种数据增强和数据丰富化的解决方案。传统的数据增强技术基于线性和标签保持变换;然而,最近的工作表明,即使是非线性、非标签保持的技术也可能具有出人意料的效果。本文提出了一种用于医学领域的非线性数据增强技术,并探讨了其结果。
本文介绍了“交叉技术”,这是一种用于卷积神经网络在医学图像分类问题中的新的数据增强技术。我们的技术通过在已有的训练数据集上应用两点交叉来合成一对样本。通过这种技术,我们可以从 N 个训练样本中创建 N 个新样本。虽然所提出的基于交叉的数据增强技术不是标签保持的,但在各种架构的所有测试数据集上,它在提高准确性和降低损失方面都表现得更好。
该方法在三个公开的医学数据集上进行了测试,使用了各种网络架构。对于乳腺 X 线摄影的 mini-MIAS 数据库,我们的方法将准确率提高了 1.47%,使用 VGG-16 架构达到了 80.15%。我们的方法对灰度和 RGB 图像都适用,在皮肤癌的 PH2 数据库中,使用 VGG-19 架构将准确率提高了 3.57%,达到了 85.71%。此外,我们的技术还提高了脑肿瘤数据集的准确率,使用 VGG-16 架构达到了 0.40%,达到了 97.97%。
本文提出的用于训练卷积神经网络(CNN)的新交叉技术通过在两张图像上应用两点交叉来形成新图像,实现起来非常简单。该方法将在解决医学图像分析中数据集有限和类不平衡问题方面发挥重要作用。我们的代码可在 https://github.com/rishiraj-cs/Crossover-augmentation 上获得。