• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

The Equalization Losses: Gradient-Driven Training for Long-tailed Object Recognition.

作者信息

Tan Jingru, Li Bo, Lu Xin, Yao Yongqiang, Yu Fengwei, He Tong, Ouyang Wanli

出版信息

IEEE Trans Pattern Anal Mach Intell. 2023 Nov;45(11):13876-13892. doi: 10.1109/TPAMI.2023.3298433. Epub 2023 Oct 3.

DOI:10.1109/TPAMI.2023.3298433
PMID:37486845
Abstract

Long-tail distribution is widely spread in real-world applications. Due to the extremely small ratio of instances, tail categories often show inferior accuracy. In this paper, we find such performance bottleneck is mainly caused by the imbalanced gradients, which can be categorized into two parts: (1) positive part, deriving from the samples of the same category, and (2) negative part, contributed by other categories. Based on comprehensive experiments, it is also observed that the gradient ratio of accumulated positives to negatives is a good indicator to measure how balanced a category is trained. Inspired by this, we come up with a gradient-driven training mechanism to tackle the long-tail problem: re-balancing the positive/negative gradients dynamically according to current accumulative gradients, with a unified goal of achieving balance gradient ratios. Taking advantage of the simple and flexible gradient mechanism, we introduce a new family of gradient-driven loss functions, namely equalization losses. We conduct extensive experiments on a wide spectrum of visual tasks, including two-stage/single-stage long-tailed object detection (LVIS), long-tailed image classification (ImageNet-LT, Places-LT, iNaturalist), and long-tailed semantic segmentation (ADE20 K). Our method consistently outperforms the baseline models, demonstrating the effectiveness and generalization ability of the proposed equalization losses.

摘要

相似文献

1
The Equalization Losses: Gradient-Driven Training for Long-tailed Object Recognition.
IEEE Trans Pattern Anal Mach Intell. 2023 Nov;45(11):13876-13892. doi: 10.1109/TPAMI.2023.3298433. Epub 2023 Oct 3.
2
Inverse Image Frequency for Long-Tailed Image Recognition.用于长尾图像识别的逆图像频率
IEEE Trans Image Process. 2023;32:5721-5736. doi: 10.1109/TIP.2023.3321461. Epub 2023 Oct 24.
3
Open Long-Tailed Recognition in a Dynamic World.动态世界中的开放长尾识别
IEEE Trans Pattern Anal Mach Intell. 2024 Mar;46(3):1836-1851. doi: 10.1109/TPAMI.2022.3200091. Epub 2024 Feb 6.
4
Divide and Retain: A Dual-Phase Modeling for Long-Tailed Visual Recognition.分割与保留:用于长尾视觉识别的双阶段建模
IEEE Trans Neural Netw Learn Syst. 2024 Oct;35(10):13538-13549. doi: 10.1109/TNNLS.2023.3269907. Epub 2024 Oct 7.
5
Key Point Sensitive Loss for Long-Tailed Visual Recognition.用于长尾视觉识别的关键敏感损失
IEEE Trans Pattern Anal Mach Intell. 2023 Apr;45(4):4812-4825. doi: 10.1109/TPAMI.2022.3196044. Epub 2023 Mar 7.
6
A dual-branch model with inter- and intra-branch contrastive loss for long-tailed recognition.用于长尾识别的具有分支间和分支内对比损失的双分支模型。
Neural Netw. 2023 Nov;168:214-222. doi: 10.1016/j.neunet.2023.09.022. Epub 2023 Sep 21.
7
Rectify representation bias in vision-language models for long-tailed recognition.纠正长尾识别中视觉-语言模型的表示偏差。
Neural Netw. 2024 Apr;172:106134. doi: 10.1016/j.neunet.2024.106134. Epub 2024 Jan 17.
8
Enhanced Long-Tailed Recognition With Contrastive CutMix Augmentation.基于对比CutMix增强的长尾识别优化
IEEE Trans Image Process. 2024;33:4215-4230. doi: 10.1109/TIP.2024.3425148. Epub 2024 Jul 22.
9
ChatDiff: A ChatGPT-based diffusion model for long-tailed classification.ChatDiff:一种基于ChatGPT的用于长尾分类的扩散模型。
Neural Netw. 2025 Jan;181:106794. doi: 10.1016/j.neunet.2024.106794. Epub 2024 Oct 15.
10
MBNM: Multi-branch network based on memory features for long-tailed medical image recognition.基于记忆特征的多分支网络用于长尾医学图像识别。
Comput Methods Programs Biomed. 2021 Nov;212:106448. doi: 10.1016/j.cmpb.2021.106448. Epub 2021 Oct 2.