• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于自动驾驶车辆中鲁棒视觉目标检测的双阶段特征专业化网络。

Dual-stage feature specialization network for robust visual object detection in autonomous vehicles.

作者信息

Liu Ze, Wu Junhua, Cai Yingfeng, Wang Hai, Chen Long, Liu Qingchao

机构信息

Automotive Engineering Research Institute, Jiangsu University, Zhenjiang, 212013, Jiangsu, China.

School of Automotive and Traffic Engineering, Jiangsu University, Zhenjiang, 212013, Jiangsu, China.

出版信息

Sci Rep. 2025 May 3;15(1):15501. doi: 10.1038/s41598-025-99363-4.

DOI:10.1038/s41598-025-99363-4
PMID:40319138
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12049487/
Abstract

Efficient feature representation is critical for accurate visual perception in autonomous vehicles. Existing two-stage object detection methods often suffer from feature interference between candidate region generation and classification regression tasks, leading to suboptimal performance in complex scenes. To address this, we propose a Dual-Stage Feature Specialization Network (DSFSN) that decouples feature extraction: MobileNetV3 is employed for lightweight candidate region generation, while ResNet-FPN enhances multi-scale feature fusion for precise classification. Extensive experiments on PASCAL VOC and MS COCO datasets demonstrate state-of-the-art performance, achieving 81.6% mAP (9.3% higher than Faster R-CNN) and 29.3% AP on MS COCO, with a 14.9% improvement in small object detection. Real-world tests under diverse conditions (e.g., rain, night) validate the robustness of our method for autonomous driving applications. This work provides a novel framework for balancing accuracy and efficiency in visual perception systems.

摘要

高效的特征表示对于自动驾驶车辆中的准确视觉感知至关重要。现有的两阶段目标检测方法通常在候选区域生成和分类回归任务之间存在特征干扰,导致在复杂场景中的性能次优。为了解决这个问题,我们提出了一种双阶段特征专业化网络(DSFSN),它将特征提取解耦:使用MobileNetV3进行轻量级候选区域生成,而ResNet-FPN增强多尺度特征融合以进行精确分类。在PASCAL VOC和MS COCO数据集上进行的大量实验证明了其具有领先的性能,在MS COCO上实现了81.6%的平均精度均值(mAP)(比Faster R-CNN高9.3%)和29.3%的平均精度(AP),在小目标检测方面有14.9%的提升。在不同条件(如下雨、夜间)下的实际测试验证了我们的方法在自动驾驶应用中的鲁棒性。这项工作为视觉感知系统中平衡准确性和效率提供了一个新颖的框架。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b207/12049487/b2d13d206403/41598_2025_99363_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b207/12049487/d9c7862a2f3d/41598_2025_99363_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b207/12049487/c02707b9b86e/41598_2025_99363_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b207/12049487/21fa5ca4a96d/41598_2025_99363_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b207/12049487/c7b86621804c/41598_2025_99363_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b207/12049487/ec994425e7ee/41598_2025_99363_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b207/12049487/0fcb50b556c5/41598_2025_99363_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b207/12049487/9ebea9d8d8d8/41598_2025_99363_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b207/12049487/4027251a8bb9/41598_2025_99363_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b207/12049487/b2d13d206403/41598_2025_99363_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b207/12049487/d9c7862a2f3d/41598_2025_99363_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b207/12049487/c02707b9b86e/41598_2025_99363_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b207/12049487/21fa5ca4a96d/41598_2025_99363_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b207/12049487/c7b86621804c/41598_2025_99363_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b207/12049487/ec994425e7ee/41598_2025_99363_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b207/12049487/0fcb50b556c5/41598_2025_99363_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b207/12049487/9ebea9d8d8d8/41598_2025_99363_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b207/12049487/4027251a8bb9/41598_2025_99363_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b207/12049487/b2d13d206403/41598_2025_99363_Fig9_HTML.jpg

相似文献

1
Dual-stage feature specialization network for robust visual object detection in autonomous vehicles.用于自动驾驶车辆中鲁棒视觉目标检测的双阶段特征专业化网络。
Sci Rep. 2025 May 3;15(1):15501. doi: 10.1038/s41598-025-99363-4.
2
ssFPN: Scale Sequence () Feature-Based Feature Pyramid Network for Object Detection.ssFPN:基于尺度序列(Scale Sequence)特征的目标检测特征金字塔网络。
Sensors (Basel). 2023 Apr 30;23(9):4432. doi: 10.3390/s23094432.
3
Lightweight multi-scale network for small object detection.用于小目标检测的轻量级多尺度网络。
PeerJ Comput Sci. 2022 Nov 8;8:e1145. doi: 10.7717/peerj-cs.1145. eCollection 2022.
4
Improved Mask R-CNN Multi-Target Detection and Segmentation for Autonomous Driving in Complex Scenes.改进的 Mask R-CNN 多目标检测与分割在复杂场景下的自动驾驶。
Sensors (Basel). 2023 Apr 10;23(8):3853. doi: 10.3390/s23083853.
5
Lightweight Feature Enhancement Network for Single-Shot Object Detection.轻量级特征增强网络用于单目标检测。
Sensors (Basel). 2021 Feb 4;21(4):1066. doi: 10.3390/s21041066.
6
Edge Preserving and Multi-Scale Contextual Neural Network for Salient Object Detection.边缘保持和多尺度上下文神经网络的显著目标检测。
IEEE Trans Image Process. 2018;27(1):121-134. doi: 10.1109/TIP.2017.2756825.
7
DPSSD: Dual-Path Single-Shot Detector.DPSSD:双路径单发探测器。
Sensors (Basel). 2022 Jun 18;22(12):4616. doi: 10.3390/s22124616.
8
DPNet: Dual-Path Network for Real-Time Object Detection With Lightweight Attention.DPNet:用于实时目标检测的带轻量级注意力机制的双路径网络
IEEE Trans Neural Netw Learn Syst. 2025 Mar;36(3):4504-4518. doi: 10.1109/TNNLS.2024.3376563. Epub 2025 Feb 28.
9
IV-YOLO: A Lightweight Dual-Branch Object Detection Network.IV-YOLO:一种轻量级双分支目标检测网络。
Sensors (Basel). 2024 Sep 24;24(19):6181. doi: 10.3390/s24196181.
10
Diversity-Learning Block: Conquer Feature Homogenization of Multibranch.多样性学习模块:克服多分支的特征同质化
IEEE Trans Neural Netw Learn Syst. 2024 Jun;35(6):7563-7576. doi: 10.1109/TNNLS.2022.3214993. Epub 2024 Jun 3.

本文引用的文献

1
A strategy for tough and fatigue-resistant hydrogels via loose cross-linking and dense dehydration-induced entanglements.一种通过松散交联和致密脱水诱导缠结制备坚韧且抗疲劳水凝胶的策略。
Nat Commun. 2024 Jul 13;15(1):5896. doi: 10.1038/s41467-024-50364-3.
2
Several Feature Extraction Methods Combined with Near-Infrared Spectroscopy for Identifying the Geographical Origins of Milk.几种结合近红外光谱的特征提取方法用于鉴定牛奶的地理来源
Foods. 2024 Jun 6;13(11):1783. doi: 10.3390/foods13111783.
3
Multi-Scale Safety Helmet Detection Based on RSSE-YOLOv3.
基于 RSSE-YOLOv3 的多尺度安全头盔检测。
Sensors (Basel). 2022 Aug 13;22(16):6061. doi: 10.3390/s22166061.
4
Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition.空间金字塔池化在深度卷积网络中的视觉识别。
IEEE Trans Pattern Anal Mach Intell. 2015 Sep;37(9):1904-16. doi: 10.1109/TPAMI.2015.2389824.
5
Fast and robust object detection using asymmetric totally corrective boosting.使用非对称完全校正提升实现快速稳健的目标检测。
IEEE Trans Neural Netw Learn Syst. 2012 Jan;23(1):33-46. doi: 10.1109/TNNLS.2011.2178324.
6
Yeast go the whole HOG for the hyperosmotic response.酵母在高渗应激反应中全力以赴地激活高渗甘油(HOG)途径。
Trends Genet. 2002 Aug;18(8):405-12. doi: 10.1016/s0168-9525(02)02723-3.