Department of Mathematics and Computer Science, University of La Rioja, Ed. CCT. C/ Madre de Dios 53, Logroño, 26006, Spain.
BMC Bioinformatics. 2019 Jun 13;20(1):323. doi: 10.1186/s12859-019-2931-1.
Deep learning techniques have been successfully applied to bioimaging problems; however, these methods are highly data demanding. An approach to deal with the lack of data and avoid overfitting is the application of data augmentation, a technique that generates new training samples from the original dataset by applying different kinds of transformations. Several tools exist to apply data augmentation in the context of image classification, but it does not exist a similar tool for the problems of localization, detection, semantic segmentation or instance segmentation that works not only with 2 dimensional images but also with multi-dimensional images (such as stacks or videos).
In this paper, we present a generic strategy that can be applied to automatically augment a dataset of images, or multi-dimensional images, devoted to classification, localization, detection, semantic segmentation or instance segmentation. The augmentation method presented in this paper has been implemented in the open-source package CLoDSA. To prove the benefits of using CLoDSA, we have employed this library to improve the accuracy of models for Malaria parasite classification, stomata detection, and automatic segmentation of neural structures.
CLoDSA is the first, at least up to the best of our knowledge, image augmentation library for object classification, localization, detection, semantic segmentation, and instance segmentation that works not only with 2 dimensional images but also with multi-dimensional images.
深度学习技术已成功应用于生物成像问题;然而,这些方法非常需要数据。一种解决数据缺乏和避免过拟合的方法是应用数据增强,该技术通过应用不同类型的变换从原始数据集生成新的训练样本。有几个工具可用于在图像分类的上下文中应用数据增强,但不存在类似的工具可用于本地化、检测、语义分割或实例分割等问题,该工具不仅可用于二维图像,还可用于多维图像(如堆栈或视频)。
在本文中,我们提出了一种通用策略,可应用于自动增强专门用于分类、本地化、检测、语义分割或实例分割的图像或多维图像数据集。本文中提出的数据增强方法已在开源软件包 CLoDSA 中实现。为了证明使用 CLoDSA 的好处,我们已经使用该库来提高疟疾寄生虫分类、气孔检测和神经结构自动分割模型的准确性。
CLoDSA 是第一个,至少在我们所知范围内,不仅适用于二维图像,而且适用于多维图像的对象分类、本地化、检测、语义分割和实例分割的图像增强库。