CBNet：一种用于目标检测的复合主干网络架构

CBNet: A Composite Backbone Network Architecture for Object Detection.

作者信息

Liang Tingting, Chu Xiaojie, Liu Yudong, Wang Yongtao, Tang Zhi, Chu Wei, Chen Jingdong, Ling Haibin

出版信息

IEEE Trans Image Process. 2022 Oct 28;PP. doi: 10.1109/TIP.2022.3216771.

DOI:10.1109/TIP.2022.3216771

Abstract

top-performing object detectors depend heavily on backbone networks, whose advances bring consistent performance gains through exploring more effective network structures. In this paper, we propose a novel and flexible backbone framework, namely CBNet, to construct high-performance detectors using existing open-source pre-trained backbones under the pre-training fine-tuning paradigm. In particular, CBNet architecture groups multiple identical backbones, which are connected through composite connections. Specifically, it integrates the high- and low-level features of multiple identical backbone networks and gradually expands the receptive field to more effectively perform object detection. We also propose a better training strategy with auxiliary supervision for CBNet-based detectors. CBNet has strong generalization capabilities for different backbones and head designs of the detector architecture. Without additional pre-training of the composite backbone, CBNet can be adapted to various backbones (i.e., CNN-based vs. Transformer-based) and head designs of most mainstream detectors (i.e., one-stage vs. two-stage, anchor-based vs. anchor-free-based). Experiments provide strong evidence that, compared with simply increasing the depth and width of the network, CBNet introduces a more efficient, effective, and resource-friendly way to build high-performance backbone networks. Particularly, our CB-Swin-L achieves 59.4% box AP and 51.6% mask AP on COCO test-dev under the single-model and single-scale testing protocol, which are significantly better than the state-of-the-art results (i.e., 57.7% box AP and 50.2% mask AP) achieved by Swin-L, while reducing the training time by 6×. With multi-scale testing, we push the current best single model result to a new record of 60.1% box AP and 52.3% mask AP without using extra training data. Code is available at https://github.com/VDIGPKU/CBNetV2.

摘要

性能卓越的目标检测器严重依赖主干网络，其进步通过探索更有效的网络结构带来了持续的性能提升。在本文中，我们提出了一种新颖且灵活的主干框架，即CBNet，以在预训练微调范式下使用现有的开源预训练主干构建高性能检测器。具体而言，CBNet架构将多个相同的主干分组，这些主干通过复合连接相连。具体来说，它整合了多个相同主干网络的高低层特征，并逐步扩大感受野以更有效地执行目标检测。我们还为基于CBNet的检测器提出了一种带有辅助监督的更好的训练策略。CBNet对于检测器架构的不同主干和头部设计具有强大的泛化能力。无需对复合主干进行额外的预训练，CBNet就可以适应各种主干（即基于卷积神经网络的与基于Transformer的）以及大多数主流检测器的头部设计（即单阶段与两阶段、基于锚框的与无锚框的）。实验提供了有力证据，表明与简单增加网络的深度和宽度相比，CBNet引入了一种更高效、有效且资源友好的方式来构建高性能主干网络。特别是，我们的CB-Swin-L在单模型和单尺度测试协议下在COCO测试开发集上实现了59.4%的框AP和51.6%的掩码AP，显著优于Swin-L所取得的当前最优结果（即57.7%的框AP和50.2%的掩码AP），同时将训练时间减少了6倍。通过多尺度测试，我们在不使用额外训练数据的情况下将当前最佳单模型结果提升到了60.1%的框AP和52.3%的掩码AP的新记录。代码可在https://github.com/VDIGPKU/CBNetV2获取。

相似文献

CBNet: A Composite Backbone Network Architecture for Object Detection.CBNet：一种用于目标检测的复合主干网络架构

IEEE Trans Image Process. 2022 Oct 28;PP. doi: 10.1109/TIP.2022.3216771.

A Novel Underwater Image Enhancement Using Optimal Composite Backbone Network.一种基于最优复合骨干网络的新型水下图像增强方法。

Biomimetics (Basel). 2023 Jun 27;8(3):275. doi: 10.3390/biomimetics8030275.

Object Detection from Scratch with Deep Supervision.基于深度监督的目标从头检测。

IEEE Trans Pattern Anal Mach Intell. 2020 Feb;42(2):398-412. doi: 10.1109/TPAMI.2019.2922181. Epub 2019 Jun 11.

Fast-iTPN: Integrally Pre-Trained Transformer Pyramid Network With Token Migration.快速iTPN：带令牌迁移的整体预训练变压器金字塔网络

IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):9766-9779. doi: 10.1109/TPAMI.2024.3429508. Epub 2024 Nov 6.

Object detectors involving a NAS-gate convolutional module and capsule attention module.基于 NAS 门控卷积模块和胶囊注意力模块的目标探测器。

Sci Rep. 2022 Mar 10;12(1):3916. doi: 10.1038/s41598-022-07898-7.

CenterNet++ for Object Detection.用于目标检测的CenterNet++

IEEE Trans Pattern Anal Mach Intell. 2024 May;46(5):3509-3521. doi: 10.1109/TPAMI.2023.3342120. Epub 2024 Apr 3.

Unsupervised Pre-Training for Detection Transformers.用于检测变压器的无监督预训练

IEEE Trans Pattern Anal Mach Intell. 2023 Nov;45(11):12772-12782. doi: 10.1109/TPAMI.2022.3216514. Epub 2023 Oct 3.

Object Detection Based on Swin Deformable Transformer-BiPAFPN-YOLOX.基于 Swin 变形 Transformer-BiPAFPN-YOLOX 的目标检测。

Comput Intell Neurosci. 2023 Mar 9;2023:4228610. doi: 10.1155/2023/4228610. eCollection 2023.

Res2Net: A New Multi-Scale Backbone Architecture.Res2Net：一种新的多尺度骨干网络架构。

IEEE Trans Pattern Anal Mach Intell. 2021 Feb;43(2):652-662. doi: 10.1109/TPAMI.2019.2938758. Epub 2021 Jan 8.

Interactive Regression and Classification for Dense Object Detector.密集目标检测器的交互式回归与分类

IEEE Trans Image Process. 2022;31:3684-3696. doi: 10.1109/TIP.2022.3174391. Epub 2022 May 26.

引用本文的文献

Motion blur aware multiscale adaptive cascade framework for ear tag dropout detection in reserve breeding pigs.用于后备种猪耳标脱落检测的运动模糊感知多尺度自适应级联框架

Sci Rep. 2025 Jul 7;15(1):24188. doi: 10.1038/s41598-025-09679-4.

Aircraft Wake Vortex Recognition Method Based on Improved Inception-VGG16 Hybrid Network.基于改进型Inception-VGG16混合网络的飞机尾流涡识别方法

Sensors (Basel). 2025 May 4;25(9):2909. doi: 10.3390/s25092909.

Cup and Disc Segmentation in Smartphone Handheld Ophthalmoscope Images with a Composite Backbone and Double Decoder Architecture.基于复合主干和双解码器架构的智能手机手持检眼镜图像中的视杯和视盘分割

Vision (Basel). 2025 Apr 11;9(2):32. doi: 10.3390/vision9020032.

A Review of Machine Learning and Deep Learning Methods for Person Detection, Tracking and Identification, and Face Recognition with Applications.用于人员检测、跟踪与识别以及人脸识别应用的机器学习和深度学习方法综述

Sensors (Basel). 2025 Feb 26;25(5):1410. doi: 10.3390/s25051410.

Attention-based scale sequence network for small object detection.用于小目标检测的基于注意力的尺度序列网络。

Heliyon. 2024 Jun 19;10(12):e32931. doi: 10.1016/j.heliyon.2024.e32931. eCollection 2024 Jun 30.

A Novel Underwater Image Enhancement Using Optimal Composite Backbone Network.一种基于最优复合骨干网络的新型水下图像增强方法。

Biomimetics (Basel). 2023 Jun 27;8(3):275. doi: 10.3390/biomimetics8030275.

ssFPN: Scale Sequence () Feature-Based Feature Pyramid Network for Object Detection.ssFPN：基于尺度序列（Scale Sequence）特征的目标检测特征金字塔网络。

Sensors (Basel). 2023 Apr 30;23(9):4432. doi: 10.3390/s23094432.

Exploring Soybean Flower and Pod Variation Patterns During Reproductive Period Based on Fusion Deep Learning.基于融合深度学习探索大豆生殖期花荚变异模式

Front Plant Sci. 2022 Jul 13;13:922030. doi: 10.3389/fpls.2022.922030. eCollection 2022.

Detection of plane in remote sensing images using super-resolution.利用超分辨率技术检测遥感图像中的飞机。

PLoS One. 2022 Apr 21;17(4):e0265503. doi: 10.1371/journal.pone.0265503. eCollection 2022.

A review on modern defect detection models using DCNNs - Deep convolutional neural networks.基于 DCNN 的现代缺陷检测模型综述 - 深度卷积神经网络。

J Adv Res. 2021 Apr 23;35:33-48. doi: 10.1016/j.jare.2021.03.015. eCollection 2022 Jan.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

CBNet：一种用于目标检测的复合主干网络架构

CBNet: A Composite Backbone Network Architecture for Object Detection.

作者信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献