Biodata Mining Group, Bielefeld University, P.O. Box 100131, 33501 Bielefeld, Germany.
Sensors (Basel). 2022 Jul 19;22(14):5383. doi: 10.3390/s22145383.
Data augmentation is an established technique in computer vision to foster the generalization of training and to deal with low data volume. Most data augmentation and computer vision research are focused on everyday images such as traffic data. The application of computer vision techniques in domains like marine sciences has shown to be not that straightforward in the past due to special characteristics, such as very low data volume and class imbalance, because of costly manual annotation by human domain experts, and general low species abundances. However, the data volume acquired today with moving platforms to collect large image collections from remote marine habitats, like the deep benthos, for marine biodiversity assessment and monitoring makes the use of computer vision automatic detection and classification inevitable. In this work, we investigate the effect of data augmentation in the context of taxonomic classification in underwater, i.e., benthic images. First, we show that established data augmentation methods (i.e., geometric and photometric transformations) perform differently in marine image collections compared to established image collections like the Cityscapes dataset, showing everyday traffic images. Some of the methods even decrease the learning performance when applied to marine image collections. Second, we propose new data augmentation combination policies motivated by our observations and compare their effect to those proposed by the AutoAugment algorithm and can show that the proposed augmentation policy outperforms the AutoAugment results for marine image collections. We conclude that in the case of small marine image datasets, background knowledge, and heuristics should sometimes be applied to design an effective data augmentation method.
数据增强是计算机视觉中的一项成熟技术,用于促进训练的泛化并处理数据量低的问题。大多数数据增强和计算机视觉研究都集中在日常图像上,例如交通数据。过去,由于特殊的特征,例如非常低的数据量和类别不平衡,由于人类领域专家的昂贵手动注释,以及一般的低物种丰度,计算机视觉技术在海洋科学等领域的应用并不那么直接。然而,如今使用移动平台从远程海洋栖息地(如深海海底)收集大型图像集进行海洋生物多样性评估和监测,获取的数据量使得使用计算机视觉自动检测和分类成为必然。在这项工作中,我们研究了数据增强在水下分类(即海底图像)方面的效果。首先,我们表明,与 Cityscapes 数据集等已建立的图像集相比,在海洋图像集中,已建立的数据增强方法(例如几何和光度变换)的表现不同,显示出日常交通图像。其中一些方法甚至在应用于海洋图像集时会降低学习性能。其次,我们提出了新的数据增强组合策略,这些策略是基于我们的观察得出的,并将其效果与 AutoAugment 算法提出的策略进行比较,可以表明,对于海洋图像集,所提出的增强策略优于 AutoAugment 的结果。我们得出的结论是,在小型海洋图像数据集的情况下,有时应该应用背景知识和启发式方法来设计有效的数据增强方法。