• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用语义分割网络的快速R-CNN用于稳健行人检测

Faster R-CNN for Robust Pedestrian Detection Using Semantic Segmentation Network.

作者信息

Liu Tianrui, Stathaki Tania

机构信息

Department of Electrical and Electronic Engineering, Imperial College London, London, United Kingdom.

出版信息

Front Neurorobot. 2018 Oct 5;12:64. doi: 10.3389/fnbot.2018.00064. eCollection 2018.

DOI:10.3389/fnbot.2018.00064
PMID:30344486
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6182048/
Abstract

Convolutional neural networks (CNN) have enabled significant improvements in pedestrian detection owing to the strong representation ability of the CNN features. However, it is generally difficult to reduce false positives on hard negative samples such as tree leaves, traffic lights, poles, etc. Some of these hard negatives can be removed by making use of high level semantic vision cues. In this paper, we propose a region-based CNN method which makes use of semantic cues for better pedestrian detection. Our method extends the Faster R-CNN detection framework by adding a branch of network for semantic image segmentation. The semantic network aims to compute complementary higher level semantic features to be integrated with the convolutional features. We make use of multi-resolution feature maps extracted from different network layers in order to ensure good detection accuracy for pedestrians at different scales. Boosted forest is used for training the integrated features in a cascaded manner for hard negatives mining. Experiments on the Caltech pedestrian dataset show improvements on detection accuracy with the semantic network. With the deep VGG16 model, our pedestrian detection method achieves robust detection performance on the Caltech dataset.

摘要

卷积神经网络(CNN)凭借其强大的特征表示能力,在行人检测方面取得了显著进展。然而,要减少对诸如树叶、交通信号灯、电线杆等困难负样本的误报通常很困难。其中一些困难负样本可以通过利用高级语义视觉线索来去除。在本文中,我们提出了一种基于区域的CNN方法,该方法利用语义线索来更好地进行行人检测。我们的方法通过添加一个用于语义图像分割的网络分支来扩展Faster R-CNN检测框架。语义网络旨在计算互补的高级语义特征,以便与卷积特征集成。我们利用从不同网络层提取的多分辨率特征图,以确保对不同尺度的行人具有良好的检测精度。增强森林用于以级联方式训练集成特征,以进行困难负样本挖掘。在加州理工学院行人数据集上的实验表明,使用语义网络后检测精度有所提高。使用深度VGG16模型,我们的行人检测方法在加州理工学院数据集上实现了强大的检测性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17f7/6182048/d548631f7a90/fnbot-12-00064-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17f7/6182048/e0b03f6b1a66/fnbot-12-00064-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17f7/6182048/de21ffc1779f/fnbot-12-00064-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17f7/6182048/d5daaa8d8b17/fnbot-12-00064-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17f7/6182048/25aeffd0cb94/fnbot-12-00064-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17f7/6182048/19d2f15d7c51/fnbot-12-00064-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17f7/6182048/d548631f7a90/fnbot-12-00064-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17f7/6182048/e0b03f6b1a66/fnbot-12-00064-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17f7/6182048/de21ffc1779f/fnbot-12-00064-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17f7/6182048/d5daaa8d8b17/fnbot-12-00064-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17f7/6182048/25aeffd0cb94/fnbot-12-00064-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17f7/6182048/19d2f15d7c51/fnbot-12-00064-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17f7/6182048/d548631f7a90/fnbot-12-00064-g0006.jpg

相似文献

1
Faster R-CNN for Robust Pedestrian Detection Using Semantic Segmentation Network.使用语义分割网络的快速R-CNN用于稳健行人检测
Front Neurorobot. 2018 Oct 5;12:64. doi: 10.3389/fnbot.2018.00064. eCollection 2018.
2
Pedestrian Detection Using Integrated Aggregate Channel Features and Multitask Cascaded Convolutional Neural-Network-Based Face Detectors.基于集成聚合通道特征和多任务级联卷积神经网络的人脸检测器的行人检测。
Sensors (Basel). 2022 May 7;22(9):3568. doi: 10.3390/s22093568.
3
Accurate Pedestrian Detection by Human Pose Regression.通过人体姿态回归实现精确的行人检测
IEEE Trans Image Process. 2019 Sep 26. doi: 10.1109/TIP.2019.2942686.
4
Detection Method of Citrus Psyllids With Field High-Definition Camera Based on Improved Cascade Region-Based Convolution Neural Networks.基于改进的基于区域的级联卷积神经网络的田间高清相机检测柑橘木虱方法
Front Plant Sci. 2022 Jan 24;12:816272. doi: 10.3389/fpls.2021.816272. eCollection 2021.
5
Learning Multilayer Channel Features for Pedestrian Detection.学习用于行人检测的多层通道特征。
IEEE Trans Image Process. 2017 Jul;26(7):3210-3220. doi: 10.1109/TIP.2017.2694224. Epub 2017 Apr 26.
6
Vehicle Detection in Aerial Images Based on Region Convolutional Neural Networks and Hard Negative Example Mining.基于区域卷积神经网络和难负样本挖掘的航空图像车辆检测
Sensors (Basel). 2017 Feb 10;17(2):336. doi: 10.3390/s17020336.
7
Butterfly network: a convolutional neural network with a new architecture for multi-scale semantic segmentation of pedestrians.蝴蝶网络:一种具有全新架构的卷积神经网络,用于行人的多尺度语义分割。
J Real Time Image Process. 2023;20(1):9. doi: 10.1007/s11554-023-01273-z. Epub 2023 Feb 2.
8
Coupled Network for Robust Pedestrian Detection With Gated Multi-Layer Feature Extraction and Deformable Occlusion Handling.具有门控多层特征提取和可变形遮挡处理的稳健行人检测的耦合网络。
IEEE Trans Image Process. 2021;30:754-766. doi: 10.1109/TIP.2020.3038371. Epub 2020 Dec 4.
9
Pedestrian Detection with Semantic Regions of Interest.基于语义感兴趣区域的行人检测
Sensors (Basel). 2017 Nov 22;17(11):2699. doi: 10.3390/s17112699.
10
Mask-Refined R-CNN: A Network for Refining Object Details in Instance Segmentation.Mask-Refined R-CNN:用于实例分割中细化对象细节的网络。
Sensors (Basel). 2020 Feb 13;20(4):1010. doi: 10.3390/s20041010.

引用本文的文献

1
Cascade contour-enhanced panoptic segmentation for robotic vision perception.用于机器人视觉感知的级联轮廓增强全景分割
Front Neurorobot. 2024 Oct 21;18:1489021. doi: 10.3389/fnbot.2024.1489021. eCollection 2024.
2
The Development of a Skin Cancer Classification System for Pigmented Skin Lesions Using Deep Learning.利用深度学习开发用于色素性皮肤病变的皮肤癌分类系统。
Biomolecules. 2020 Jul 29;10(8):1123. doi: 10.3390/biom10081123.
3
EDSSA: An Encoder-Decoder Semantic Segmentation Networks Accelerator on OpenCL-Based FPGA Platform.

本文引用的文献

1
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.DeepLab:基于深度卷积网络、空洞卷积和全连接条件随机场的语义图像分割。
IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):834-848. doi: 10.1109/TPAMI.2017.2699184. Epub 2017 Apr 27.
2
SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation.SegNet:一种用于图像分割的深度卷积编解码器架构。
IEEE Trans Pattern Anal Mach Intell. 2017 Dec;39(12):2481-2495. doi: 10.1109/TPAMI.2016.2644615. Epub 2017 Jan 2.
3
Fully Convolutional Networks for Semantic Segmentation.
EDSSA:基于OpenCL的FPGA平台上的编解码器语义分割网络加速器。
Sensors (Basel). 2020 Jul 17;20(14):3969. doi: 10.3390/s20143969.
4
Detection of Mild Cognitive Impairment Using Convolutional Neural Network: Temporal-Feature Maps of Functional Near-Infrared Spectroscopy.使用卷积神经网络检测轻度认知障碍:功能近红外光谱的时间特征图
Front Aging Neurosci. 2020 May 21;12:141. doi: 10.3389/fnagi.2020.00141. eCollection 2020.
全卷积网络用于语义分割。
IEEE Trans Pattern Anal Mach Intell. 2017 Apr;39(4):640-651. doi: 10.1109/TPAMI.2016.2572683. Epub 2016 May 24.
4
Fast Feature Pyramids for Object Detection.快速目标检测特征金字塔。
IEEE Trans Pattern Anal Mach Intell. 2014 Aug;36(8):1532-45. doi: 10.1109/TPAMI.2014.2300479.
5
Fast image interpolation via random forests.基于随机森林的快速图像插值。
IEEE Trans Image Process. 2015 Oct;24(10):3232-45. doi: 10.1109/TIP.2015.2440751.
6
Visual-Patch-Attention-Aware Saliency Detection.基于视觉注意感知的显著目标检测。
IEEE Trans Cybern. 2015 Aug;45(8):1575-86. doi: 10.1109/TCYB.2014.2356200. Epub 2014 Oct 1.
7
Layered object models for image segmentation.分层目标模型的图像分割。
IEEE Trans Pattern Anal Mach Intell. 2012 Sep;34(9):1731-43. doi: 10.1109/TPAMI.2011.208.
8
Object detection with discriminatively trained part-based models.基于判别式训练的部件模型的目标检测。
IEEE Trans Pattern Anal Mach Intell. 2010 Sep;32(9):1627-45. doi: 10.1109/TPAMI.2009.167.