Suppr超能文献

通过混合模态融合优化骨干网络:一种垃圾分类的新策略。

Optimizing Backbone Networks Through Hybrid-Modal Fusion: A New Strategy for Waste Classification.

作者信息

Zhou Houkui, Ding Qifeng, Chen Chang, Liao Qinqin, Wang Qun, Yu Huimin, Hu Haoji, Zhang Guangqun, Hu Junguo, He Tao

机构信息

College of Mathematics and Computer Science, Zhejiang A & F University, Hangzhou 311300, China.

Zhejiang Provincial Key Laboratory of Forestry Intelligent Monitoring and Information Technology, Hangzhou 311300, China.

出版信息

Sensors (Basel). 2025 May 21;25(10):3241. doi: 10.3390/s25103241.

Abstract

With rapid urbanization, effective waste classification is a critical challenge. Traditional manual methods are time-consuming, labor-intensive, costly, and error-prone, resulting in reduced accuracy. Deep learning has revolutionized this field. Convolutional neural networks such as VGG and ResNet have dramatically improved automated sorting efficiency, and Transformer architectures like the Swin Transformer have further enhanced performance and adaptability in complex sorting scenarios. However, these approaches still struggle in complex environments and with diverse waste types, often suffering from limited recognition accuracy, poor generalization, or prohibitive computational demands. To overcome these challenges, we propose an efficient hybrid-modal fusion method, the Hybrid-modal Fusion Waste Classification Network (HFWC-Net), for precise waste image classification. HFWC-Net leverages a Transformer-based hierarchical architecture that integrates CNNs and Transformers, enhancing feature capture and fusion across varied image types for superior scalability and flexibility. By incorporating advanced techniques such as the Agent Attention mechanism and the LionBatch optimization strategy, HFWC-Net not only improves classification accuracy but also significantly reduces classification time. Comparative experimental results on the public datasets Garbage Classification, TrashNet, and our self-built MixTrash dataset demonstrate that HFWC-Net achieves Top-1 accuracy rates of 98.89%, 96.88%, and 94.35%, respectively. These findings indicate that HFWC-Net attains the highest accuracy among current methods, offering significant advantages in accelerating classification efficiency and supporting automated waste management applications.

摘要

随着城市化的快速发展,有效的垃圾分类是一项严峻的挑战。传统的人工方法耗时、费力、成本高且容易出错,导致准确率降低。深度学习给这个领域带来了变革。诸如VGG和ResNet等卷积神经网络极大地提高了自动分类效率,而像Swin Transformer这样的Transformer架构在复杂分类场景中进一步提升了性能和适应性。然而,这些方法在复杂环境和面对多样的垃圾类型时仍然存在困难,常常面临识别准确率有限、泛化能力差或计算需求过高的问题。为了克服这些挑战,我们提出了一种高效的混合模态融合方法,即混合模态融合垃圾分类网络(HFWC-Net),用于精确的垃圾图像分类。HFWC-Net利用基于Transformer的分层架构,将卷积神经网络和Transformer集成在一起,增强了对各种图像类型的特征捕捉和融合,具有卓越的可扩展性和灵活性。通过融入智能注意力机制和LionBatch优化策略等先进技术,HFWC-Net不仅提高了分类准确率,还显著缩短了分类时间。在公共数据集垃圾分类、TrashNet以及我们自建的MixTrash数据集上的对比实验结果表明,HFWC-Net的Top-1准确率分别达到了98.89%、96.88%和94.35%。这些结果表明,HFWC-Net在当前方法中达到了最高的准确率,在加快分类效率和支持自动化垃圾管理应用方面具有显著优势。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/08b4/12115457/40f6d76acb1a/sensors-25-03241-g001.jpg

相似文献

1
2
Enhanced Pneumonia Detection in Chest X-Rays Using Hybrid Convolutional and Vision Transformer Networks.
Curr Med Imaging. 2025;21:e15734056326685. doi: 10.2174/0115734056326685250101113959.
3
SwinCross: Cross-modal Swin transformer for head-and-neck tumor segmentation in PET/CT images.
Med Phys. 2024 Mar;51(3):2096-2107. doi: 10.1002/mp.16703. Epub 2023 Sep 30.
4
GCDN-Net: Garbage classifier deep neural network for recyclable urban waste management.
Waste Manag. 2024 Feb 15;174:439-450. doi: 10.1016/j.wasman.2023.12.014. Epub 2023 Dec 19.
6
BiU-net: A dual-branch structure based on two-stage fusion strategy for biomedical image segmentation.
Comput Methods Programs Biomed. 2024 Jul;252:108235. doi: 10.1016/j.cmpb.2024.108235. Epub 2024 May 18.
7
Lightweight hybrid transformers-based dyslexia detection using cross-modality data.
Sci Rep. 2025 May 16;15(1):17054. doi: 10.1038/s41598-025-01235-4.
8
SwinConvNeXt: a fused deep learning architecture for Real-time garbage image classification.
Sci Rep. 2025 Mar 7;15(1):7995. doi: 10.1038/s41598-025-91302-7.
10
Improved deep learning image classification algorithm based on Swin Transformer V2.
PeerJ Comput Sci. 2023 Oct 30;9:e1665. doi: 10.7717/peerj-cs.1665. eCollection 2023.

本文引用的文献

1
GCDN-Net: Garbage classifier deep neural network for recyclable urban waste management.
Waste Manag. 2024 Feb 15;174:439-450. doi: 10.1016/j.wasman.2023.12.014. Epub 2023 Dec 19.
3
MSWNet: A visual deep machine learning method adopting transfer learning based upon ResNet 50 for municipal solid waste sorting.
Front Environ Sci Eng. 2023;17(6):77. doi: 10.1007/s11783-023-1677-1. Epub 2023 Jan 1.
4
Waste image classification based on transfer learning and convolutional neural network.
Waste Manag. 2021 Nov;135:150-157. doi: 10.1016/j.wasman.2021.08.038. Epub 2021 Sep 8.
5
Spillover of different regulatory policies for waste sorting: Potential influence on energy-saving policy acceptability.
Waste Manag. 2021 Apr 15;125:112-121. doi: 10.1016/j.wasman.2021.02.008. Epub 2021 Mar 5.
6
Application of machine learning methods for the prediction of organic solid waste treatment and recycling processes: A review.
Bioresour Technol. 2021 Jan;319:124114. doi: 10.1016/j.biortech.2020.124114. Epub 2020 Sep 11.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验