• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

鸟类物种检测网络:基于使用双特征混合器提取局部细节和全局信息的鸟类物种检测

Bird Species Detection Net: Bird Species Detection Based on the Extraction of Local Details and Global Information Using a Dual-Feature Mixer.

作者信息

Li Chaoyang, He Zhipeng, Lu Kai, Fang Chaoyang

机构信息

Jiangxi Protected Area Construction Center, Nanchang 330006, China.

Key Laboratory of Poyang Lake Wetland and Watershed Research, Ministry of Education, Jiangxi Normal University, Nanchang 330022, China.

出版信息

Sensors (Basel). 2025 Jan 6;25(1):291. doi: 10.3390/s25010291.

DOI:10.3390/s25010291
PMID:39797082
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11723449/
Abstract

Bird species detection is critical for applications such as the analysis of bird population dynamics and species diversity. However, this task remains challenging due to local structural similarities and class imbalances among bird species. Currently, most deep learning algorithms focus on designing local feature extraction modules while ignoring the importance of global information. However, this global information is essential for accurate bird species detection. To address this limitation, we propose BSD-Net, a bird species detection network. BSD-Net efficiently learns local and global information in pixels to accurately detect bird species. BSD-Net consists of two main components: a dual-branch feature mixer (DBFM) and a prediction balancing module (PBM). The dual-branch feature mixer extracts features from dichotomous feature segments using global attention and deep convolution, expanding the network's receptive field and achieving a strong inductive bias, allowing the network to distinguish between similar local details. The prediction balance module balances the difference in feature space based on the pixel values of each category, thereby resolving category imbalances and improving the network's detection accuracy. The experimental results using two public benchmarks and a self-constructed Poyang Lake Bird dataset demonstrate that BSD-Net outperforms existing methods, achieving 45.71% and 80.00% mAP50 with the CUB-200-2011 and Poyang Lake Bird datasets, respectively, and 66.03% AP with FBD-SV-2024, allowing for more accurate location and species information for bird detection tasks in video surveillance.

摘要

鸟类物种检测对于诸如鸟类种群动态分析和物种多样性分析等应用至关重要。然而,由于鸟类物种之间存在局部结构相似性和类别不平衡,这项任务仍然具有挑战性。目前,大多数深度学习算法专注于设计局部特征提取模块,而忽略了全局信息的重要性。然而,这种全局信息对于准确的鸟类物种检测至关重要。为了解决这一局限性,我们提出了BSD-Net,一种鸟类物种检测网络。BSD-Net能够有效地学习像素中的局部和全局信息,以准确检测鸟类物种。BSD-Net由两个主要组件组成:双分支特征混合器(DBFM)和预测平衡模块(PBM)。双分支特征混合器使用全局注意力和深度卷积从二分特征段中提取特征,扩大网络的感受野并实现强大的归纳偏差,使网络能够区分相似的局部细节。预测平衡模块基于每个类别的像素值平衡特征空间中的差异,从而解决类别不平衡问题并提高网络的检测精度。使用两个公共基准和自行构建的鄱阳湖鸟类数据集进行的实验结果表明,BSD-Net优于现有方法,在CUB-200-2011和鄱阳湖鸟类数据集上分别实现了45.71%和80.00%的mAP50,在FBD-SV-2024上实现了66.03%的AP,能够为视频监控中的鸟类检测任务提供更准确的位置和物种信息。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bdc2/11723449/db03116c7112/sensors-25-00291-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bdc2/11723449/6f08d027fd36/sensors-25-00291-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bdc2/11723449/ebdf9ce0b8ad/sensors-25-00291-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bdc2/11723449/8d5b2aca82be/sensors-25-00291-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bdc2/11723449/cdd1f856d4d4/sensors-25-00291-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bdc2/11723449/b34c6198d576/sensors-25-00291-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bdc2/11723449/0f54f6e4daee/sensors-25-00291-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bdc2/11723449/b4eed714421d/sensors-25-00291-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bdc2/11723449/db03116c7112/sensors-25-00291-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bdc2/11723449/6f08d027fd36/sensors-25-00291-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bdc2/11723449/ebdf9ce0b8ad/sensors-25-00291-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bdc2/11723449/8d5b2aca82be/sensors-25-00291-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bdc2/11723449/cdd1f856d4d4/sensors-25-00291-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bdc2/11723449/b34c6198d576/sensors-25-00291-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bdc2/11723449/0f54f6e4daee/sensors-25-00291-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bdc2/11723449/b4eed714421d/sensors-25-00291-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bdc2/11723449/db03116c7112/sensors-25-00291-g008.jpg

相似文献

1
Bird Species Detection Net: Bird Species Detection Based on the Extraction of Local Details and Global Information Using a Dual-Feature Mixer.鸟类物种检测网络:基于使用双特征混合器提取局部细节和全局信息的鸟类物种检测
Sensors (Basel). 2025 Jan 6;25(1):291. doi: 10.3390/s25010291.
2
An Improved Bird Detection Method Using Surveillance Videos from Poyang Lake Based on YOLOv8.一种基于YOLOv8的利用鄱阳湖监控视频的改进鸟类检测方法。
Animals (Basel). 2024 Nov 21;14(23):3353. doi: 10.3390/ani14233353.
3
TransXNet: Learning Both Global and Local Dynamics With a Dual Dynamic Token Mixer for Visual Recognition.TransXNet:通过用于视觉识别的双动态令牌混合器学习全局和局部动态。
IEEE Trans Neural Netw Learn Syst. 2025 Jun;36(6):11534-11547. doi: 10.1109/TNNLS.2025.3550979.
4
A Deep Learning Framework for Real-Time Bird Detection and Its Implications for Reducing Bird Strike Incidents.用于实时鸟类检测的深度学习框架及其减少鸟撞事件的意义。
Sensors (Basel). 2024 Aug 23;24(17):5455. doi: 10.3390/s24175455.
5
ASD-Net: a novel U-Net based asymmetric spatial-channel convolution network for precise kidney and kidney tumor image segmentation.ASD-Net:一种新颖的基于 U-Net 的非对称空间-通道卷积网络,用于精确的肾脏和肾肿瘤图像分割。
Med Biol Eng Comput. 2024 Jun;62(6):1673-1687. doi: 10.1007/s11517-024-03025-y. Epub 2024 Feb 8.
6
ILR-Net: Low-light image enhancement network based on the combination of iterative learning mechanism and Retinex theory.ILR-Net:基于迭代学习机制与Retinex理论相结合的低光照图像增强网络。
PLoS One. 2025 Feb 13;20(2):e0314541. doi: 10.1371/journal.pone.0314541. eCollection 2025.
7
IEA-Net: Internal and External Dual-Attention Medical Segmentation Network with High-Performance Convolutional Blocks.IEA-Net:具有高性能卷积块的内外双注意力医学分割网络
J Imaging Inform Med. 2025 Feb;38(1):602-614. doi: 10.1007/s10278-024-01217-4. Epub 2024 Aug 6.
8
A novel low light object detection method based on the YOLOv5 fusion feature enhancement.一种基于YOLOv5融合特征增强的新型低光照目标检测方法。
Sci Rep. 2024 Feb 23;14(1):4486. doi: 10.1038/s41598-024-54428-8.
9
Expansive Receptive Field and Local Feature Extraction Network: Advancing Multiscale Feature Fusion for Breast Fibroadenoma Segmentation in Sonography.扩张感受野与局部特征提取网络:推进超声检查中乳腺纤维瘤分割的多尺度特征融合
J Imaging Inform Med. 2024 Dec;37(6):2810-2824. doi: 10.1007/s10278-024-01142-6. Epub 2024 May 31.
10
DepITCM: an audio-visual method for detecting depression.DepITCM:一种检测抑郁症的视听方法。
Front Psychiatry. 2025 Jan 23;15:1466507. doi: 10.3389/fpsyt.2024.1466507. eCollection 2024.

引用本文的文献

1
Preliminary Analysis and Proof-of-Concept Validation of a Neuronally Controlled Visual Assistive Device Integrating Computer Vision with EEG-Based Binary Control.一种将计算机视觉与基于脑电图的二元控制相结合的神经控制视觉辅助设备的初步分析与概念验证
Sensors (Basel). 2025 Aug 21;25(16):5187. doi: 10.3390/s25165187.

本文引用的文献

1
Research on Fine-Grained Image Recognition of Birds Based on Improved YOLOv5.基于改进的 YOLOv5 的鸟类细粒度图像识别研究。
Sensors (Basel). 2023 Sep 30;23(19):8204. doi: 10.3390/s23198204.
2
An Efficient Time-Domain End-to-End Single-Channel Bird Sound Separation Network.一种高效的时域端到端单通道鸟鸣声分离网络。
Animals (Basel). 2022 Nov 11;12(22):3117. doi: 10.3390/ani12223117.
3
Imbalance Problems in Object Detection: A Review.目标检测中的不平衡问题:综述
IEEE Trans Pattern Anal Mach Intell. 2021 Oct;43(10):3388-3415. doi: 10.1109/TPAMI.2020.2981890. Epub 2021 Sep 2.
4
Deep Imbalanced Learning for Face Recognition and Attribute Prediction.深度不平衡学习在人脸识别和属性预测中的应用。
IEEE Trans Pattern Anal Mach Intell. 2020 Nov;42(11):2781-2794. doi: 10.1109/TPAMI.2019.2914680. Epub 2019 May 7.
5
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN:基于区域建议网络的实时目标检测。
IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.