• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于多模态深度学习网络的 RGB-D 路面废弃物检测与识别

Multi-modal deep learning networks for RGB-D pavement waste detection and recognition.

机构信息

School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, Shaanxi, China.

出版信息

Waste Manag. 2024 Apr 1;177:125-134. doi: 10.1016/j.wasman.2024.01.047. Epub 2024 Feb 6.

DOI:10.1016/j.wasman.2024.01.047
PMID:38325013
Abstract

To create a clean living environment, governments around the world have hired a large number of workers to clean up waste on pavements, which is inefficient for waste management. To better alleviate this problem, relevant scholars have proposed several deep learning methods based on RGB images to achieve waste detection and recognition. Considering the limitations of color images, we propose an efficient multi-modal learning solution for pavement waste detection and recognition. Specifically, we construct a high-quality outdoor pavement waste dataset called OPWaste, which is more in line with real needs. Compared to other waste datasets, OPWaste dataset not only has the advantages of rich background and high diversity, but also provides color and depth images. Meanwhile, we explore six different multi-modal fusion methods and propose a novel multi-modal multi-scale network (MM-Net) for RGB-D waste detection and recognition. MM-Net introduces a novel multi-scale refinement module (MRM) and multi-scale interaction module (MIM). MRM can effectively refine critical features using attention mechanisms. MIM can gradually realize information interaction between hierarchical features. In addition, we select several representative methods and perform comparative experiments. Experimental results show that MM-Net based on the image addition fusion method outperforms other deep learning models and reaches 97.3% and 84.4% on mAP and AR metrics. In fact, multi-modal learning plays an important role in intelligent waste recycling. As a promising auxiliary tool, our solution can be applied to intelligent cleaning robots for automatic outdoor waste management.

摘要

为了创造一个清洁的生活环境,世界各国政府已经雇佣了大量工人清理人行道上的垃圾,但这种方式在垃圾管理方面效率低下。为了更好地缓解这个问题,相关学者提出了几种基于 RGB 图像的深度学习方法,以实现废物检测和识别。考虑到彩色图像的局限性,我们提出了一种高效的多模态学习解决方案,用于路面废物检测和识别。具体来说,我们构建了一个名为 OPWaste 的高质量户外路面废物数据集,它更符合实际需求。与其他废物数据集相比,OPWaste 数据集不仅具有丰富背景和高度多样性的优势,还提供了彩色和深度图像。同时,我们探索了六种不同的多模态融合方法,并提出了一种新颖的 RGB-D 废物检测和识别的多模态多尺度网络(MM-Net)。MM-Net 引入了一种新颖的多尺度细化模块(MRM)和多尺度交互模块(MIM)。MRM 可以使用注意力机制有效地细化关键特征。MIM 可以逐步实现分层特征之间的信息交互。此外,我们选择了几种有代表性的方法进行对比实验。实验结果表明,基于图像相加融合方法的 MM-Net 优于其他深度学习模型,在 mAP 和 AR 指标上分别达到 97.3%和 84.4%。事实上,多模态学习在智能废物回收中起着重要作用。作为一种有前途的辅助工具,我们的解决方案可以应用于智能清洁机器人,实现自动户外废物管理。

相似文献

1
Multi-modal deep learning networks for RGB-D pavement waste detection and recognition.基于多模态深度学习网络的 RGB-D 路面废弃物检测与识别
Waste Manag. 2024 Apr 1;177:125-134. doi: 10.1016/j.wasman.2024.01.047. Epub 2024 Feb 6.
2
RGB-D Object Recognition Using Multi-Modal Deep Neural Network and DS Evidence Theory.基于多模态深度神经网络和证据理论的 RGB-D 目标识别。
Sensors (Basel). 2019 Jan 27;19(3):529. doi: 10.3390/s19030529.
3
Multi-Modal Deep Learning for Weeds Detection in Wheat Field Based on RGB-D Images.基于RGB-D图像的多模态深度学习用于麦田杂草检测
Front Plant Sci. 2021 Nov 5;12:732968. doi: 10.3389/fpls.2021.732968. eCollection 2021.
4
SLMSF-Net: A Semantic Localization and Multi-Scale Fusion Network for RGB-D Salient Object Detection.SLMSF-Net:用于RGB-D显著目标检测的语义定位与多尺度融合网络
Sensors (Basel). 2024 Feb 8;24(4):1117. doi: 10.3390/s24041117.
5
Optimally leveraging depth features to enhance segmentation of recyclables from cluttered construction and demolition waste streams.优化利用深度特征以增强从杂乱的建筑和拆除废物流中对可回收物的分割。
J Environ Manage. 2024 Mar;354:120313. doi: 10.1016/j.jenvman.2024.120313. Epub 2024 Feb 16.
6
RGB-D fusion models for construction and demolition waste detection.基于 RGB-D 融合的建筑和拆除垃圾检测模型。
Waste Manag. 2022 Feb 15;139:96-104. doi: 10.1016/j.wasman.2021.12.021. Epub 2021 Dec 23.
7
RGB-D based multi-modal deep learning for spacecraft and debris recognition.基于RGB-D的多模态深度学习用于航天器与碎片识别。
Sci Rep. 2022 Mar 10;12(1):3924. doi: 10.1038/s41598-022-07846-5.
8
A modality-collaborative convolution and transformer hybrid network for unpaired multi-modal medical image segmentation with limited annotations.一种用于具有有限标注的未配对多模态医学图像分割的模态协作卷积与Transformer混合网络。
Med Phys. 2023 Sep;50(9):5460-5478. doi: 10.1002/mp.16338. Epub 2023 Mar 15.
9
GCDN-Net: Garbage classifier deep neural network for recyclable urban waste management.GCDN-Net:用于可回收城市废物管理的垃圾分类器深度神经网络。
Waste Manag. 2024 Feb 15;174:439-450. doi: 10.1016/j.wasman.2023.12.014. Epub 2023 Dec 19.
10
Waste image classification based on transfer learning and convolutional neural network.基于迁移学习和卷积神经网络的废图像分类。
Waste Manag. 2021 Nov;135:150-157. doi: 10.1016/j.wasman.2021.08.038. Epub 2021 Sep 8.