• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DAM-Faster RCNN:基于双重注意力机制的木材少样本缺陷检测方法

DAM-Faster RCNN: few-shot defect detection method for wood based on dual attention mechanism.

作者信息

Tong Xingyu, Liang Zhihong, Qin Mingming, Liu Fangrong, Yang Jiayu, Xiao Hengjiang, Dai Wei

机构信息

College of Big Data and Intelligent Engineering, Southwest Forestry University, Kunming, 650224, China.

出版信息

Sci Rep. 2025 Jul 2;15(1):22860. doi: 10.1038/s41598-025-05479-y.

DOI:10.1038/s41598-025-05479-y
PMID:40593082
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12219072/
Abstract

In wood defect detection, factors such as few-shot sample scarcity, diverse defect types, and complex background interference severely limit the model's recognition accuracy and generalization ability. To address the above issues, this paper proposes an improved Faster RCNN model based on a dual attention mechanism (DAM). The model integrates cross-attention and spatial attention modules to enhance the expression of key region features, suppresses texture noise interference; the improved Wood-Region Proposal Network (WRPs) module utilizes feature mean pooling and cross-layer fusion strategies to significantly improve the quality and robustness of candidate box generation; in addition, the Wood-Feature Reconstruction Head (WFRH) module effectively enhances the adaptability to new classes and few-shot defects through multi-branch classification and weighted fusion mechanisms. After synergistic optimization of all modules, the model demonstrates superior detection accuracy and category discrimination capability. Experimental results show that the proposed method achieves state-of-the-art performance on the PASCAL VOC and FSOD datasets, particularly in the identification of 17 types of wood defects, where AP50 and AP75 are improved by 25% and 7.9%, respectively, validating the significant advantages of the proposed DAM mechanism under few-shot and complex background conditions. The findings of this study provide practical technical references for intelligent and efficient few-shot detection in real-world wood quality inspection tasks.

摘要

在木材缺陷检测中,少样本稀缺、缺陷类型多样以及复杂背景干扰等因素严重限制了模型的识别精度和泛化能力。为解决上述问题,本文提出了一种基于双注意力机制(DAM)的改进型Faster RCNN模型。该模型集成了交叉注意力和空间注意力模块,以增强关键区域特征的表达,抑制纹理噪声干扰;改进后的木材区域提议网络(WRPs)模块采用特征均值池化和跨层融合策略,显著提高了候选框生成的质量和鲁棒性;此外,木材特征重构头(WFRH)模块通过多分支分类和加权融合机制,有效增强了对新类别和少样本缺陷的适应性。经过所有模块的协同优化,该模型展现出卓越的检测精度和类别区分能力。实验结果表明,所提方法在PASCAL VOC和FSOD数据集上取得了领先性能,尤其在17种木材缺陷的识别中,AP50和AP75分别提高了25%和7.9%,验证了所提DAM机制在少样本和复杂背景条件下的显著优势。本研究结果为实际木材质量检测任务中的智能高效少样本检测提供了实用的技术参考。

相似文献

1
DAM-Faster RCNN: few-shot defect detection method for wood based on dual attention mechanism.DAM-Faster RCNN:基于双重注意力机制的木材少样本缺陷检测方法
Sci Rep. 2025 Jul 2;15(1):22860. doi: 10.1038/s41598-025-05479-y.
2
Few-shot object detection for pest insects via features aggregation and contrastive learning.通过特征聚合和对比学习实现害虫的少样本目标检测
Front Plant Sci. 2025 Jun 19;16:1522510. doi: 10.3389/fpls.2025.1522510. eCollection 2025.
3
iACP-DPNet: a dual-pooling causal dilated convolutional network for interpretable anticancer peptide identification.iACP-DPNet:一种用于可解释抗癌肽识别的双池因果扩张卷积网络。
Funct Integr Genomics. 2025 Jul 4;25(1):147. doi: 10.1007/s10142-025-01641-x.
4
A fake news detection model using the integration of multimodal attention mechanism and residual convolutional network.一种融合多模态注意力机制和残差卷积网络的假新闻检测模型。
Sci Rep. 2025 Jul 1;15(1):20544. doi: 10.1038/s41598-025-05702-w.
5
Few-shot network intrusion detection method based on multi-domain fusion and cross-attention.基于多域融合与交叉注意力的少样本网络入侵检测方法
PLoS One. 2025 Jul 2;20(7):e0327161. doi: 10.1371/journal.pone.0327161. eCollection 2025.
6
DASNet a dual branch multi level attention sheep counting network.DASNet是一种双分支多级注意力羊只计数网络。
Sci Rep. 2025 Jul 2;15(1):23228. doi: 10.1038/s41598-025-97929-w.
7
Spatial correlation guided cross scale feature fusion for age and gender estimation.用于年龄和性别估计的空间相关性引导的跨尺度特征融合
Sci Rep. 2025 Jul 1;15(1):20873. doi: 10.1038/s41598-025-03081-w.
8
HGCS-Det: A Deep Learning-Based Solution for Localizing and Recognizing Household Garbage in Complex Scenarios.HGCS-Det:一种基于深度学习的复杂场景下家庭垃圾定位与识别解决方案。
Sensors (Basel). 2025 Jun 14;25(12):3726. doi: 10.3390/s25123726.
9
A dual-domain perception gate-controlled adaptive fusion algorithm for road crack detection.一种用于道路裂缝检测的双域感知门控自适应融合算法。
Sci Rep. 2025 Jul 1;15(1):21485. doi: 10.1038/s41598-025-07610-5.
10
Sparse-view spectral CT reconstruction via a coupled subspace representation and score-based generative model.基于耦合子空间表示和基于分数的生成模型的稀疏视图光谱CT重建
Quant Imaging Med Surg. 2025 Jun 6;15(6):5474-5495. doi: 10.21037/qims-24-2226. Epub 2025 May 28.

本文引用的文献

1
DG-GAN: A High Quality Defect Image Generation Method for Defect Detection.DG-GAN:一种用于缺陷检测的高质量缺陷图像生成方法。
Sensors (Basel). 2023 Jun 26;23(13):5922. doi: 10.3390/s23135922.
2
SmartWoodID-an image collection of large end-grain surfaces to support wood identification systems.SmartWoodID-一个大型横截面表面图像集合,用于支持木材识别系统。
Database (Oxford). 2023 May 13;2023. doi: 10.1093/database/baad034.
3
Using Deep Learning to Identify Costa Rican Native Tree Species From Wood Cut Images.利用深度学习从木材切割图像中识别哥斯达黎加本土树种。
Front Plant Sci. 2022 Apr 1;13:789227. doi: 10.3389/fpls.2022.789227. eCollection 2022.
4
ProtTrans: Toward Understanding the Language of Life Through Self-Supervised Learning.ProtTrans:通过自监督学习理解生命语言。
IEEE Trans Pattern Anal Mach Intell. 2022 Oct;44(10):7112-7127. doi: 10.1109/TPAMI.2021.3095381. Epub 2022 Sep 14.
5
Meta-Learning in Neural Networks: A Survey.元学习在神经网络中的研究进展综述
IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):5149-5169. doi: 10.1109/TPAMI.2021.3079209. Epub 2022 Aug 4.
6
Multimodal Transformer for Unaligned Multimodal Language Sequences.用于未对齐多模态语言序列的多模态变换器
Proc Conf Assoc Comput Linguist Meet. 2019 Jul;2019:6558-6569. doi: 10.18653/v1/p19-1656.
7
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN:基于区域建议网络的实时目标检测。
IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.