Kumar L Ravi, Tata Ravi Kumar, Mahesh T R, Ali Endris Mohammed
Department of Computer Science and Engineering, Koneru Lakshmaiah Education Foundation, Vaddeswaram, Guntur, Andhra Pradesh, 522302, India.
Department of Computer Science and Engineering, Faculty of Engineering and Technology, JAIN (Deemed-to-Be University), Bengaluru, 562112, India.
Sci Rep. 2025 Jul 2;15(1):22549. doi: 10.1038/s41598-025-06758-4.
Deep learning primarily operates on images which contain hidden patterns that are quantified through pixel intensities. Deep learning is used to analyze the image patterns and to recognize the objects. The detection process includes the creation of labels with bounding boxes, and it will be evaluated by using accuracy scores. Sometimes, there is a need to improve the accuracy score by changing the fine-tuning parameters or generating the synthetic data, which leads to reducing the gap in organizing the patterns. To address this, our research introduces the synthetic data generation for "enhanced shrimp detection using integrated augmentation (ESDIA)" approach to detect shrimps. The methodology is used to combine the shrimp images with different backgrounds in different phases: segmentation, dataset construction, and creating classifiers like faster recurrent convolution neural network (FRCNN) and you only look once (YOLOv7). The Enhanced Shrimp Detection algorithm generates the unique features set and also various parameters fetched from foundational classifiers. The segmentation phase can be done through grayscale conversion, edge detection, thresholding, morphological operations, and image compilation from myriad angles. To bolster our dataset volume and variance, the proposed system will generate the synthetic data with different variants of the backgrounds using generative adversarial networks. The precision of object (shrimp) detection rate can be gauged by the proposed model is witnessed a vital flow, with the mean average precision ranging from 80.53 to 89.13 which indicates the efficacy of ESDIA in elevating shrimp detection capabilities through DL paradigms.
深度学习主要对包含通过像素强度量化的隐藏模式的图像进行操作。深度学习用于分析图像模式并识别对象。检测过程包括创建带有边界框的标签,并将通过使用准确率得分进行评估。有时,需要通过更改微调参数或生成合成数据来提高准确率得分,这有助于缩小在组织模式方面的差距。为了解决这个问题,我们的研究引入了用于“使用集成增强的虾类检测(ESDIA)”方法的合成数据生成,以检测虾类。该方法用于在不同阶段将虾类图像与不同背景相结合:分割、数据集构建以及创建诸如更快的循环卷积神经网络(FRCNN)和你只看一次(YOLOv7)这样的分类器。增强型虾类检测算法生成独特的特征集以及从基础分类器获取的各种参数。分割阶段可以通过灰度转换、边缘检测、阈值处理、形态学操作以及从无数角度进行图像编译来完成。为了增加我们数据集的数量和多样性,所提出的系统将使用生成对抗网络生成具有不同背景变体的合成数据。所提出的模型所见证的对象(虾类)检测率的精度呈现出一个重要趋势,平均精度范围从80.53到89.13,这表明ESDIA在通过深度学习范式提升虾类检测能力方面的有效性。