OFIDA：基于注意力驱动图卷积网络的目标聚焦图像数据增强。

OFIDA: Object-focused image data augmentation with attention-driven graph convolutional networks.

机构信息

School of Electronics and Information Engineering, Taiyuan University of Science and Technology, Taiyuan, Shanxi, China.

出版信息

PLoS One. 2024 May 2;19(5):e0302124. doi: 10.1371/journal.pone.0302124. eCollection 2024.

DOI:10.1371/journal.pone.0302124

PMID:38696446

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11065271/

Abstract

Image data augmentation plays a crucial role in data augmentation (DA) by increasing the quantity and diversity of labeled training data. However, existing methods have limitations. Notably, techniques like image manipulation, erasing, and mixing can distort images, compromising data quality. Accurate representation of objects without confusion is a challenge in methods like auto augment and feature augmentation. Preserving fine details and spatial relationships also proves difficult in certain techniques, as seen in deep generative models. To address these limitations, we propose OFIDA, an object-focused image data augmentation algorithm. OFIDA implements one-to-many enhancements that not only preserve essential target regions but also elevate the authenticity of simulating real-world settings and data distributions. Specifically, OFIDA utilizes a graph-based structure and object detection to streamline augmentation. Specifically, by leveraging graph properties like connectivity and hierarchy, it captures object essence and context for improved comprehension in real-world scenarios. Then, we introduce DynamicFocusNet, a novel object detection algorithm built on the graph framework. DynamicFocusNet merges dynamic graph convolutions and attention mechanisms to flexibly adjust receptive fields. Finally, the detected target images are extracted to facilitate one-to-many data augmentation. Experimental results validate the superiority of our OFIDA method over state-of-the-art methods across six benchmark datasets.

摘要

图像数据增强在数据增强（DA）中起着至关重要的作用，它可以增加标记训练数据的数量和多样性。然而，现有的方法存在局限性。特别是，像图像操纵、擦除和混合这样的技术会扭曲图像，从而影响数据质量。在像自动增强和特征增强这样的方法中，准确地表示没有混淆的物体是一个挑战。在某些技术中，如深度生成模型，精细的细节和空间关系的保留也被证明是困难的。为了解决这些限制，我们提出了 OFIDA，一种面向对象的图像数据增强算法。OFIDA 实现了一对多的增强，不仅可以保留目标区域的关键部分，还可以提高模拟真实世界场景和数据分布的真实性。具体来说，OFIDA 利用基于图的结构和对象检测来简化增强。具体来说，通过利用图的连通性和层次结构等属性，它可以捕获对象的本质和上下文，从而提高在真实场景中的理解能力。然后，我们引入了 DynamicFocusNet，这是一种基于图框架的新的对象检测算法。DynamicFocusNet 融合了动态图卷积和注意力机制，以灵活调整感受野。最后，提取检测到的目标图像，以方便一对多的数据增强。实验结果验证了我们的 OFIDA 方法在六个基准数据集上优于最先进方法的优越性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/017e/11065271/24faedf4ed3d/pone.0302124.g001.jpg

相似文献

OFIDA: Object-focused image data augmentation with attention-driven graph convolutional networks.OFIDA：基于注意力驱动图卷积网络的目标聚焦图像数据增强。

PLoS One. 2024 May 2;19(5):e0302124. doi: 10.1371/journal.pone.0302124. eCollection 2024.

Steganographer detection via a similarity accumulation graph convolutional network.基于相似性累积图卷积网络的隐写分析检测。

Neural Netw. 2021 Apr;136:97-111. doi: 10.1016/j.neunet.2020.12.026. Epub 2021 Jan 4.

Robust Data Augmentation Generative Adversarial Network for Object Detection.用于目标检测的鲁棒数据增强生成对抗网络。

Sensors (Basel). 2022 Dec 23;23(1):157. doi: 10.3390/s23010157.

Generative Adversarial Networks in Medical Image augmentation: A review.生成对抗网络在医学图像增强中的应用：综述。

Comput Biol Med. 2022 May;144:105382. doi: 10.1016/j.compbiomed.2022.105382. Epub 2022 Mar 5.

Attention-Driven Graph Neural Network for Deep Face Super-Resolution.注意力驱动的图神经网络在深度人脸超分辨率中的应用。

IEEE Trans Image Process. 2022;31:6455-6470. doi: 10.1109/TIP.2022.3212311. Epub 2022 Oct 21.

A Data Augmentation Methodology to Reduce the Class Imbalance in Histopathology Images.一种减少组织病理学图像中类别不平衡的的数据增强方法。

J Imaging Inform Med. 2024 Aug;37(4):1767-1782. doi: 10.1007/s10278-024-01018-9. Epub 2024 Mar 14.

Small object detection algorithm incorporating swin transformer for tea buds.用于茶芽的融合 Swin 变换小目标检测算法。

PLoS One. 2024 Mar 21;19(3):e0299902. doi: 10.1371/journal.pone.0299902. eCollection 2024.

Dual Encoder-Based Dynamic-Channel Graph Convolutional Network With Edge Enhancement for Retinal Vessel Segmentation.基于双编码器的动态通道图卷积网络与边缘增强的视网膜血管分割。

IEEE Trans Med Imaging. 2022 Aug;41(8):1975-1989. doi: 10.1109/TMI.2022.3151666. Epub 2022 Aug 1.

Self-supervised structural similarity-based convolutional neural network for cardiac diffusion tensor image denoising.基于自监督结构相似性的卷积神经网络用于心脏扩散张量图像去噪

Med Phys. 2023 Oct;50(10):6137-6150. doi: 10.1002/mp.16301. Epub 2023 Apr 17.

PolypMixNet: Enhancing semi-supervised polyp segmentation with polyp-aware augmentation.PolypMixNet：利用息肉感知增强进行半监督息肉分割。

Comput Biol Med. 2024 Mar;170:108006. doi: 10.1016/j.compbiomed.2024.108006. Epub 2024 Jan 15.

引用本文的文献

Comparative Analysis of Conventional and Focused Data Augmentation Methods in Rib Fracture Detection in CT Images.CT图像中肋骨骨折检测中传统与聚焦数据增强方法的对比分析

Diagnostics (Basel). 2025 Aug 1;15(15):1938. doi: 10.3390/diagnostics15151938.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

OFIDA：基于注意力驱动图卷积网络的目标聚焦图像数据增强。

OFIDA: Object-focused image data augmentation with attention-driven graph convolutional networks.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献