ChatDiff：一种基于ChatGPT的用于长尾分类的扩散模型。

ChatDiff: A ChatGPT-based diffusion model for long-tailed classification.

作者信息

Deng Chenxun, Li Dafang, Ji Lin, Zhang Chengyang, Li Baican, Yan Hongying, Zheng Jiyuan, Wang Lifeng, Zhang Junguo

机构信息

School of Technology, Beijing Forestry University, Beijing, 100083, PR China; Research Center for Biodiversity Intelligent Monitoring, Beijing Forestry University, Beijing, 100083, PR China; State Key Laboratory of Efficient Production of Forest Resources, Beijing Forestry University, Beijing, 100083, PR China.

出版信息

Neural Netw. 2025 Jan;181:106794. doi: 10.1016/j.neunet.2024.106794. Epub 2024 Oct 15.

DOI:10.1016/j.neunet.2024.106794

PMID:39426035

Abstract

Long-tailed data distributions have been a major challenge for the practical application of deep learning. Information augmentation intends to expand the long-tailed data into uniform distribution, which provides a feasible way to mitigate the data starvation of underrepresented classes. However, most existing augmentation methods face two significant challenges: (1) limited diversity in generated samples, and (2) the adverse effect of generated negative samples on downstream classification performance. In this paper, we propose a novel information augmentation method, named ChatDiff, to provide diverse positive samples for underrepresented classes, and eliminate generated negative samples. Specifically, we start with a prompt template to extract textual prior knowledge from the ChatGPT-3.5 model, enhancing the feature space for underrepresented classes. Then using this prior knowledge, a conditional diffusion model generates semantic-rich image samples for tail classes. Moreover, the proposed ChatDiff leverages a CLIP-based discriminator to screen and remove generated negative samples. This process avoids neural network learning the invalid or erroneous features, and further, improves long-tailed classification performance. Comprehensive experiments conducted on long-tailed benchmarks such as CIFAR10-LT, CIFAR100-LT, ImageNet-LT, and iNaturalist 2018, validate the effectiveness of our ChatDiff method.

摘要

长尾数据分布一直是深度学习实际应用中的一大挑战。信息增强旨在将长尾数据扩展为均匀分布，这为缓解代表性不足类别的数据匮乏问题提供了一种可行的方法。然而，大多数现有的增强方法面临两个重大挑战：（1）生成样本的多样性有限，以及（2）生成的负样本对下游分类性能的不利影响。在本文中，我们提出了一种新颖的信息增强方法，名为ChatDiff，为代表性不足的类别提供多样化的正样本，并消除生成的负样本。具体而言，我们从一个提示模板开始，从ChatGPT-3.5模型中提取文本先验知识，增强代表性不足类别的特征空间。然后利用这些先验知识，一个条件扩散模型为尾部类别生成语义丰富的图像样本。此外，所提出的ChatDiff利用基于CLIP的鉴别器来筛选和去除生成的负样本。这一过程避免了神经网络学习无效或错误的特征，进而提高了长尾分类性能。在CIFAR10-LT、CIFAR100-LT、ImageNet-LT和iNaturalist 2018等长尾基准上进行的综合实验，验证了我们的ChatDiff方法的有效性。

相似文献

ChatDiff: A ChatGPT-based diffusion model for long-tailed classification.ChatDiff：一种基于ChatGPT的用于长尾分类的扩散模型。

Neural Netw. 2025 Jan;181:106794. doi: 10.1016/j.neunet.2024.106794. Epub 2024 Oct 15.

SH: Long-tailed classification via spatial constraint sampling, scalable network, and hybrid task.SH：通过空间约束采样、可扩展网络和混合任务进行长尾分类

Neural Netw. 2025 May;185:107247. doi: 10.1016/j.neunet.2025.107247. Epub 2025 Feb 8.

A Long-Tailed Image Classification Method Based on Enhanced Contrastive Visual Language.基于增强对比视觉语言的长尾图像分类方法。

Sensors (Basel). 2023 Jul 26;23(15):6694. doi: 10.3390/s23156694.

Mixed Mutual Transfer for Long-Tailed Image Classification.用于长尾图像分类的混合相互转移

Entropy (Basel). 2024 Oct 2;26(10):839. doi: 10.3390/e26100839.

When an extra rejection class meets out-of-distribution detection in long-tailed image classification.当在长尾图像分类中额外的拒绝类别遇到分布外检测时。

Neural Netw. 2024 Oct;178:106485. doi: 10.1016/j.neunet.2024.106485. Epub 2024 Jun 21.

A dual-branch model with inter- and intra-branch contrastive loss for long-tailed recognition.用于长尾识别的具有分支间和分支内对比损失的双分支模型。

Neural Netw. 2023 Nov;168:214-222. doi: 10.1016/j.neunet.2023.09.022. Epub 2023 Sep 21.

Enhanced Long-Tailed Recognition With Contrastive CutMix Augmentation.基于对比CutMix增强的长尾识别优化

IEEE Trans Image Process. 2024;33:4215-4230. doi: 10.1109/TIP.2024.3425148. Epub 2024 Jul 22.

Robust long-tailed recognition with distribution-aware adversarial example generation.

Neural Netw. 2025 Apr;184:106932. doi: 10.1016/j.neunet.2024.106932. Epub 2024 Dec 10.

Long-Tailed Food Classification.长尾食品分类。

Nutrients. 2023 Jun 15;15(12):2751. doi: 10.3390/nu15122751.

Instance-Specific Semantic Augmentation for Long-Tailed Image Classification.用于长尾图像分类的实例特定语义增强

IEEE Trans Image Process. 2024;33:2544-2557. doi: 10.1109/TIP.2024.3379929. Epub 2024 Apr 1.

ChatDiff：一种基于ChatGPT的用于长尾分类的扩散模型。

ChatDiff: A ChatGPT-based diffusion model for long-tailed classification.

作者信息

Deng Chenxun, Li Dafang, Ji Lin, Zhang Chengyang, Li Baican, Yan Hongying, Zheng Jiyuan, Wang Lifeng, Zhang Junguo

机构信息

出版信息

Neural Netw. 2025 Jan;181:106794. doi: 10.1016/j.neunet.2024.106794. Epub 2024 Oct 15.

DOI:10.1016/j.neunet.2024.106794

PMID:39426035

Abstract

摘要

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

ChatDiff：一种基于ChatGPT的用于长尾分类的扩散模型。

ChatDiff: A ChatGPT-based diffusion model for long-tailed classification.

作者信息

机构信息

出版信息

相似文献

ChatDiff：一种基于ChatGPT的用于长尾分类的扩散模型。

ChatDiff: A ChatGPT-based diffusion model for long-tailed classification.

作者信息

机构信息

出版信息

相似文献