Myers Jaden, Najafian Keyhan, Maleki Farhad, Ovens Katie
Department of Computer Science, University of Calgary, 2500 University Drive NW, Calgary, AB T2N 1N4, Canada.
Department of Computer Science, University of Saskatchewan, Saskatoon, SK S7N 5A2, Canada.
J Imaging. 2024 Jun 21;10(7):152. doi: 10.3390/jimaging10070152.
Deep learning models have been used for a variety of image processing tasks. However, most of these models are developed through supervised learning approaches, which rely heavily on the availability of large-scale annotated datasets. Developing such datasets is tedious and expensive. In the absence of an annotated dataset, synthetic data can be used for model development; however, due to the substantial differences between simulated and real data, a phenomenon referred to as domain gap, the resulting models often underperform when applied to real data. In this research, we aim to address this challenge by first computationally simulating a large-scale annotated dataset and then using a generative adversarial network (GAN) to fill the gap between simulated and real images. This approach results in a synthetic dataset that can be effectively utilized to train a deep-learning model. Using this approach, we developed a realistic annotated synthetic dataset for wheat head segmentation. This dataset was then used to develop a deep-learning model for semantic segmentation. The resulting model achieved a Dice score of 83.4% on an internal dataset and Dice scores of 79.6% and 83.6% on two external datasets from the Global Wheat Head Detection datasets. While we proposed this approach in the context of wheat head segmentation, it can be generalized to other crop types or, more broadly, to images with dense, repeated patterns such as those found in cellular imagery.
深度学习模型已被用于各种图像处理任务。然而,这些模型大多是通过监督学习方法开发的,这严重依赖于大规模带注释数据集的可用性。开发这样的数据集既繁琐又昂贵。在没有带注释数据集的情况下,可以使用合成数据进行模型开发;然而,由于模拟数据和真实数据之间存在显著差异,即所谓的域差距现象,所得模型在应用于真实数据时往往表现不佳。在本研究中,我们旨在通过首先通过计算模拟一个大规模带注释数据集,然后使用生成对抗网络(GAN)来填补模拟图像和真实图像之间的差距来应对这一挑战。这种方法产生了一个可以有效用于训练深度学习模型的合成数据集。使用这种方法,我们为小麦穗分割开发了一个逼真的带注释合成数据集。然后使用这个数据集开发了一个用于语义分割的深度学习模型。所得模型在一个内部数据集上的Dice分数为83.4%,在来自全球小麦穗检测数据集的两个外部数据集上的Dice分数分别为79.6%和83.6%。虽然我们在小麦穗分割的背景下提出了这种方法,但它可以推广到其他作物类型,或者更广泛地说,推广到具有密集、重复模式的图像,如细胞图像中发现的那些图像。