• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ChatDiff:一种基于ChatGPT的用于长尾分类的扩散模型。

ChatDiff: A ChatGPT-based diffusion model for long-tailed classification.

作者信息

Deng Chenxun, Li Dafang, Ji Lin, Zhang Chengyang, Li Baican, Yan Hongying, Zheng Jiyuan, Wang Lifeng, Zhang Junguo

机构信息

School of Technology, Beijing Forestry University, Beijing, 100083, PR China; Research Center for Biodiversity Intelligent Monitoring, Beijing Forestry University, Beijing, 100083, PR China; State Key Laboratory of Efficient Production of Forest Resources, Beijing Forestry University, Beijing, 100083, PR China.

School of Technology, Beijing Forestry University, Beijing, 100083, PR China; Research Center for Biodiversity Intelligent Monitoring, Beijing Forestry University, Beijing, 100083, PR China; State Key Laboratory of Efficient Production of Forest Resources, Beijing Forestry University, Beijing, 100083, PR China.

出版信息

Neural Netw. 2025 Jan;181:106794. doi: 10.1016/j.neunet.2024.106794. Epub 2024 Oct 15.

DOI:10.1016/j.neunet.2024.106794
PMID:39426035
Abstract

Long-tailed data distributions have been a major challenge for the practical application of deep learning. Information augmentation intends to expand the long-tailed data into uniform distribution, which provides a feasible way to mitigate the data starvation of underrepresented classes. However, most existing augmentation methods face two significant challenges: (1) limited diversity in generated samples, and (2) the adverse effect of generated negative samples on downstream classification performance. In this paper, we propose a novel information augmentation method, named ChatDiff, to provide diverse positive samples for underrepresented classes, and eliminate generated negative samples. Specifically, we start with a prompt template to extract textual prior knowledge from the ChatGPT-3.5 model, enhancing the feature space for underrepresented classes. Then using this prior knowledge, a conditional diffusion model generates semantic-rich image samples for tail classes. Moreover, the proposed ChatDiff leverages a CLIP-based discriminator to screen and remove generated negative samples. This process avoids neural network learning the invalid or erroneous features, and further, improves long-tailed classification performance. Comprehensive experiments conducted on long-tailed benchmarks such as CIFAR10-LT, CIFAR100-LT, ImageNet-LT, and iNaturalist 2018, validate the effectiveness of our ChatDiff method.

摘要

长尾数据分布一直是深度学习实际应用中的一大挑战。信息增强旨在将长尾数据扩展为均匀分布,这为缓解代表性不足类别的数据匮乏问题提供了一种可行的方法。然而,大多数现有的增强方法面临两个重大挑战:(1)生成样本的多样性有限,以及(2)生成的负样本对下游分类性能的不利影响。在本文中,我们提出了一种新颖的信息增强方法,名为ChatDiff,为代表性不足的类别提供多样化的正样本,并消除生成的负样本。具体而言,我们从一个提示模板开始,从ChatGPT-3.5模型中提取文本先验知识,增强代表性不足类别的特征空间。然后利用这些先验知识,一个条件扩散模型为尾部类别生成语义丰富的图像样本。此外,所提出的ChatDiff利用基于CLIP的鉴别器来筛选和去除生成的负样本。这一过程避免了神经网络学习无效或错误的特征,进而提高了长尾分类性能。在CIFAR10-LT、CIFAR100-LT、ImageNet-LT和iNaturalist 2018等长尾基准上进行的综合实验,验证了我们的ChatDiff方法的有效性。

相似文献

1
ChatDiff: A ChatGPT-based diffusion model for long-tailed classification.ChatDiff:一种基于ChatGPT的用于长尾分类的扩散模型。
Neural Netw. 2025 Jan;181:106794. doi: 10.1016/j.neunet.2024.106794. Epub 2024 Oct 15.
2
SH: Long-tailed classification via spatial constraint sampling, scalable network, and hybrid task.SH:通过空间约束采样、可扩展网络和混合任务进行长尾分类
Neural Netw. 2025 May;185:107247. doi: 10.1016/j.neunet.2025.107247. Epub 2025 Feb 8.
3
A Long-Tailed Image Classification Method Based on Enhanced Contrastive Visual Language.基于增强对比视觉语言的长尾图像分类方法。
Sensors (Basel). 2023 Jul 26;23(15):6694. doi: 10.3390/s23156694.
4
Mixed Mutual Transfer for Long-Tailed Image Classification.用于长尾图像分类的混合相互转移
Entropy (Basel). 2024 Oct 2;26(10):839. doi: 10.3390/e26100839.
5
When an extra rejection class meets out-of-distribution detection in long-tailed image classification.当在长尾图像分类中额外的拒绝类别遇到分布外检测时。
Neural Netw. 2024 Oct;178:106485. doi: 10.1016/j.neunet.2024.106485. Epub 2024 Jun 21.
6
A dual-branch model with inter- and intra-branch contrastive loss for long-tailed recognition.用于长尾识别的具有分支间和分支内对比损失的双分支模型。
Neural Netw. 2023 Nov;168:214-222. doi: 10.1016/j.neunet.2023.09.022. Epub 2023 Sep 21.
7
Enhanced Long-Tailed Recognition With Contrastive CutMix Augmentation.基于对比CutMix增强的长尾识别优化
IEEE Trans Image Process. 2024;33:4215-4230. doi: 10.1109/TIP.2024.3425148. Epub 2024 Jul 22.
8
Robust long-tailed recognition with distribution-aware adversarial example generation.
Neural Netw. 2025 Apr;184:106932. doi: 10.1016/j.neunet.2024.106932. Epub 2024 Dec 10.
9
Long-Tailed Food Classification.长尾食品分类。
Nutrients. 2023 Jun 15;15(12):2751. doi: 10.3390/nu15122751.
10
Instance-Specific Semantic Augmentation for Long-Tailed Image Classification.用于长尾图像分类的实例特定语义增强
IEEE Trans Image Process. 2024;33:2544-2557. doi: 10.1109/TIP.2024.3379929. Epub 2024 Apr 1.