C.L. Brown Department of Electrical and Computer Engineering, University of Virginia, Charlottesville, VA 22904, USA.
School of Telecommunications & Information Engineering, Nanjing University of Posts and Telecommunications, Nanjing 210003, China.
Bioinformatics. 2022 Sep 30;38(19):4598-4604. doi: 10.1093/bioinformatics/btac529.
Data-driven deep learning techniques usually require a large quantity of labeled training data to achieve reliable solutions in bioimage analysis. However, noisy image conditions and high cell density in bacterial biofilm images make 3D cell annotations difficult to obtain. Alternatively, data augmentation via synthetic data generation is attempted, but current methods fail to produce realistic images.
This article presents a bioimage synthesis and assessment workflow with application to augment bacterial biofilm images. 3D cyclic generative adversarial networks (GAN) with unbalanced cycle consistency loss functions are exploited in order to synthesize 3D biofilm images from binary cell labels. Then, a stochastic synthetic dataset quality assessment (SSQA) measure that compares statistical appearance similarity between random patches from random images in two datasets is proposed. Both SSQA scores and other existing image quality measures indicate that the proposed 3D Cyclic GAN, along with the unbalanced loss function, provides a reliably realistic (as measured by mean opinion score) 3D synthetic biofilm image. In 3D cell segmentation experiments, a GAN-augmented training model also presents more realistic signal-to-background intensity ratio and improved cell counting accuracy.
https://github.com/jwang-c/DeepBiofilm.
Supplementary data are available at Bioinformatics online.
数据驱动的深度学习技术通常需要大量的标记训练数据,才能在生物图像分析中得到可靠的解决方案。然而,嘈杂的图像条件和细菌生物膜图像中的高密度细胞使得 3D 细胞注释难以获得。或者,可以尝试通过合成数据生成进行数据扩充,但目前的方法无法生成逼真的图像。
本文提出了一种生物图像合成和评估工作流程,应用于扩充细菌生物膜图像。利用具有不平衡循环一致性损失函数的 3D 循环生成对抗网络(GAN),从二进制细胞标签中合成 3D 生物膜图像。然后,提出了一种随机合成数据集质量评估(SSQA)度量标准,用于比较两个数据集中随机图像的随机补丁之间的统计外观相似性。SSQA 分数和其他现有图像质量度量标准都表明,所提出的 3D 循环 GAN 与不平衡损失函数一起,提供了一种可靠逼真的(根据平均意见得分衡量)3D 合成生物膜图像。在 3D 细胞分割实验中,GAN 增强的训练模型还呈现出更逼真的信号与背景强度比,并提高了细胞计数的准确性。
https://github.com/jwang-c/DeepBiofilm。
补充数据可在生物信息学在线获得。