• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于改进的 YOLOX-s 网络的野生动物目标检测。

Wildlife target detection based on improved YOLOX-s network.

机构信息

School of Computer Science and Technology, Zhejiang Sci-Tech University, Hangzhou, 310018, Zhejiang, China.

School of Information Science and Engineering, Zhejiang Sci-Tech University, Hangzhou, 310018, Zhejiang, China.

出版信息

Sci Rep. 2024 Oct 9;14(1):23608. doi: 10.1038/s41598-024-73631-1.

DOI:10.1038/s41598-024-73631-1
PMID:39384881
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11464500/
Abstract

To addresse the problem of poor detection accuracy or even false detection of wildlife caused by rainy environment at night. In this paper, a wildlife target detection algorithm based on improved YOLOX-s network is proposed. Our algorithm comprises the MobileViT-Pooling module, the Dynamic Head module, and the Focal-IoU module.First, the MobileViT-Pooling module is introduced.It is based on the MobileViT attention mechanism, which uses a spatial pooling operator with no parameters as a token mixer module to reduce the number of network parameters. This module performs feature extraction on three feature layers of the backbone network output respectively, senses the global information and strengthens the weight of the effective information. Second, the Dynamic Head module is used on the downstream task of network detection, which fuses the information of scale sensing, spatial sensing, and task sensing and improves the representation ability of the target detection head. Lastly, the Focal idea is utilized to improve the IoU loss function, which balances the learning of high and low quality IoU for the network. Experimental results reveal that our algorithm achieves a notable performance boost with mAP@0.5 reaching 87.8% (an improvement of 7.9%) and mAP@0.5:0.95 reaching 62.0% (an improvement of 5.3%). This advancement significantly augments the night-time wildlife detection accuracy under rainy conditions, concurrently diminishing false detections in such challenging environments.

摘要

为了解决夜间雨天环境导致野生动物检测精度差甚至误检的问题。本文提出了一种基于改进的 YOLOX-s 网络的野生动物目标检测算法。我们的算法包括 MobileViT-Pooling 模块、Dynamic Head 模块和 Focal-IoU 模块。首先,引入 MobileViT-Pooling 模块。它基于 MobileViT 注意力机制,使用无参数的空间池化算子作为令牌混合器模块,减少网络参数数量。该模块分别对骨干网络输出的三个特征层进行特征提取,感知全局信息并增强有效信息的权重。其次,在网络检测的下游任务中使用 Dynamic Head 模块,融合尺度感知、空间感知和任务感知的信息,提高目标检测头的表示能力。最后,利用 Focal 思想改进 IoU 损失函数,平衡网络对高质量和低质量 IoU 的学习。实验结果表明,我们的算法在 mAP@0.5 达到 87.8%(提高了 7.9%)和 mAP@0.5:0.95 达到 62.0%(提高了 5.3%)方面取得了显著的性能提升。这一进步显著提高了夜间雨天环境下的野生动物检测精度,同时减少了此类挑战性环境下的误检。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e01/11464500/25b8538ec098/41598_2024_73631_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e01/11464500/40727961440d/41598_2024_73631_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e01/11464500/f984ea795d2b/41598_2024_73631_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e01/11464500/f861f219509e/41598_2024_73631_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e01/11464500/81dd50a2d5c4/41598_2024_73631_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e01/11464500/46061511569f/41598_2024_73631_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e01/11464500/c76b93794e2a/41598_2024_73631_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e01/11464500/25b8538ec098/41598_2024_73631_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e01/11464500/40727961440d/41598_2024_73631_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e01/11464500/f984ea795d2b/41598_2024_73631_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e01/11464500/f861f219509e/41598_2024_73631_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e01/11464500/81dd50a2d5c4/41598_2024_73631_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e01/11464500/46061511569f/41598_2024_73631_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e01/11464500/c76b93794e2a/41598_2024_73631_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e01/11464500/25b8538ec098/41598_2024_73631_Fig7_HTML.jpg

相似文献

1
Wildlife target detection based on improved YOLOX-s network.基于改进的 YOLOX-s 网络的野生动物目标检测。
Sci Rep. 2024 Oct 9;14(1):23608. doi: 10.1038/s41598-024-73631-1.
2
A Domestic Trash Detection Model Based on Improved YOLOX.基于改进 YOLOX 的家庭垃圾检测模型。
Sensors (Basel). 2022 Sep 15;22(18):6974. doi: 10.3390/s22186974.
3
YOLOv8-RMDA: Lightweight YOLOv8 Network for Early Detection of Small Target Diseases in Tea.YOLOv8-RMDA:用于茶中早期检测小目标疾病的轻量级 YOLOv8 网络。
Sensors (Basel). 2024 May 1;24(9):2896. doi: 10.3390/s24092896.
4
Enhanced Lightweight YOLOX for Small Object Wildfire Detection in UAV Imagery.用于无人机图像中小目标野火检测的增强型轻量级YOLOX
Sensors (Basel). 2024 Apr 24;24(9):2710. doi: 10.3390/s24092710.
5
Multiple Attention Mechanism Enhanced YOLOX for Remote Sensing Object Detection.多注意力机制增强 YOLOX 用于遥感目标检测。
Sensors (Basel). 2023 Jan 22;23(3):1261. doi: 10.3390/s23031261.
6
Research on vehicle detection based on improved YOLOX_S.基于改进YOLOX_S的车辆检测研究。
Sci Rep. 2023 Dec 27;13(1):23081. doi: 10.1038/s41598-023-50306-x.
7
Research on Safety Helmet Detection Algorithm Based on Improved YOLOv5s.基于改进 YOLOv5s 的安全头盔检测算法研究。
Sensors (Basel). 2023 Jun 22;23(13):5824. doi: 10.3390/s23135824.
8
Small object detection algorithm incorporating swin transformer for tea buds.用于茶芽的融合 Swin 变换小目标检测算法。
PLoS One. 2024 Mar 21;19(3):e0299902. doi: 10.1371/journal.pone.0299902. eCollection 2024.
9
Weed target detection at seedling stage in paddy fields based on YOLOX.基于 YOLOX 的稻田苗期杂草目标检测。
PLoS One. 2023 Dec 13;18(12):e0294709. doi: 10.1371/journal.pone.0294709. eCollection 2023.
10
Improved Algorithm for Insulator and Its Defect Detection Based on YOLOX.基于 YOLOX 的绝缘子及其缺陷检测改进算法。
Sensors (Basel). 2022 Aug 18;22(16):6186. doi: 10.3390/s22166186.

本文引用的文献

1
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN:基于区域建议网络的实时目标检测。
IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.
2
Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition.空间金字塔池化在深度卷积网络中的视觉识别。
IEEE Trans Pattern Anal Mach Intell. 2015 Sep;37(9):1904-16. doi: 10.1109/TPAMI.2015.2389824.
3
Snapshot Serengeti, high-frequency annotated camera trap images of 40 mammalian species in an African savanna.
《塞伦盖蒂快照》,非洲热带稀树草原 40 种哺乳动物高频标注的相机陷阱图像。
Sci Data. 2015 Jun 9;2:150026. doi: 10.1038/sdata.2015.26. eCollection 2015.