• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 StyleGAN-DSAD 的高质量煤炭异物图像生成方法。

High Quality Coal Foreign Object Image Generation Method Based on StyleGAN-DSAD.

机构信息

School of Mechanical Engineering, Xi'an University of Science and Technology, Xi'an 710054, China.

Shaanxi Provincial Key Laboratory of Intelligent Testing of Mine Mechanical and Electrical Equipment, Xi'an 710054, China.

出版信息

Sensors (Basel). 2022 Dec 29;23(1):374. doi: 10.3390/s23010374.

DOI:10.3390/s23010374
PMID:36616972
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9823712/
Abstract

Research on coal foreign object detection based on deep learning is of great significance to safe, efficient, and green production of coal mines. However, the foreign object image dataset is scarce due to collection conditions, which brings an enormous challenge to coal foreign object detection. To achieve augmentation of foreign object datasets, a high-quality coal foreign object image generation method based on improved StyleGAN is proposed. Firstly, the dual self-attention module is introduced into the generator to strengthen the long-distance dependence of features between spatial and channel, refine the details of the generated images, accurately distinguish the front background information, and improve the quality of the generated images. Secondly, the depthwise separable convolution is introduced into the discriminator to solve the problem of low efficiency caused by the large number of parameters of multi-stage convolutional networks, to realize the lightweight model, and to accelerate the training speed. Experimental results show that the improved model has significant advantages over several classical GANS and original StyleGAN in terms of quality and diversity of the generated images, with an average improvement of 2.52 in IS and a decrease of 5.80 in FID for each category. As for the model complexity, the parameters and training time of the improved model are reduced to 44.6% and 58.8% of the original model without affecting the generated images quality. Finally, the results of applying different data augmentation methods to the foreign object detection task show that our image generation method is more effective than the traditional methods, and that, under the optimal conditions, it improves AP by 5.8% and AP by 4.5%.

摘要

基于深度学习的煤炭异物检测研究对煤矿的安全、高效、绿色生产具有重要意义。然而,由于采集条件的限制,异物图像数据集稀缺,这给煤炭异物检测带来了巨大的挑战。为了实现异物数据集的扩充,提出了一种基于改进 StyleGAN 的高质量煤炭异物图像生成方法。首先,在生成器中引入双自注意力模块,增强特征在空间和通道之间的长距离依赖关系,细化生成图像的细节,准确区分前后背景信息,提高生成图像的质量。其次,在鉴别器中引入深度可分离卷积,解决多阶段卷积网络参数较多导致的效率低下问题,实现轻量化模型,加快训练速度。实验结果表明,改进后的模型在生成图像的质量和多样性方面与几个经典的 GANs 和原始 StyleGAN 相比具有显著优势,每个类别的 IS 平均提高了 2.52,FID 降低了 5.80。就模型复杂度而言,改进后的模型的参数和训练时间分别减少到原始模型的 44.6%和 58.8%,而不影响生成图像的质量。最后,将不同的数据增强方法应用于异物检测任务的结果表明,我们的图像生成方法比传统方法更有效,在最优条件下,AP 提高了 5.8%,AP 提高了 4.5%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f31a/9823712/dfb91aff3988/sensors-23-00374-g018.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f31a/9823712/3d5bea00de6b/sensors-23-00374-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f31a/9823712/eca10fe1a423/sensors-23-00374-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f31a/9823712/378c6b55d932/sensors-23-00374-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f31a/9823712/66f00563164d/sensors-23-00374-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f31a/9823712/bd4d2b254c4b/sensors-23-00374-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f31a/9823712/7348ea98c5ab/sensors-23-00374-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f31a/9823712/a596f2c46f6d/sensors-23-00374-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f31a/9823712/c040ff7cb769/sensors-23-00374-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f31a/9823712/4cc15488bf9b/sensors-23-00374-g009a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f31a/9823712/d86369f0a8ff/sensors-23-00374-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f31a/9823712/a27bd9541574/sensors-23-00374-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f31a/9823712/a23f258f710a/sensors-23-00374-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f31a/9823712/a6277fb1112d/sensors-23-00374-g013a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f31a/9823712/d8dc70bf553c/sensors-23-00374-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f31a/9823712/742df0a27771/sensors-23-00374-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f31a/9823712/f1cfb18ff35d/sensors-23-00374-g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f31a/9823712/865316684add/sensors-23-00374-g017a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f31a/9823712/dfb91aff3988/sensors-23-00374-g018.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f31a/9823712/3d5bea00de6b/sensors-23-00374-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f31a/9823712/eca10fe1a423/sensors-23-00374-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f31a/9823712/378c6b55d932/sensors-23-00374-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f31a/9823712/66f00563164d/sensors-23-00374-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f31a/9823712/bd4d2b254c4b/sensors-23-00374-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f31a/9823712/7348ea98c5ab/sensors-23-00374-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f31a/9823712/a596f2c46f6d/sensors-23-00374-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f31a/9823712/c040ff7cb769/sensors-23-00374-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f31a/9823712/4cc15488bf9b/sensors-23-00374-g009a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f31a/9823712/d86369f0a8ff/sensors-23-00374-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f31a/9823712/a27bd9541574/sensors-23-00374-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f31a/9823712/a23f258f710a/sensors-23-00374-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f31a/9823712/a6277fb1112d/sensors-23-00374-g013a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f31a/9823712/d8dc70bf553c/sensors-23-00374-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f31a/9823712/742df0a27771/sensors-23-00374-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f31a/9823712/f1cfb18ff35d/sensors-23-00374-g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f31a/9823712/865316684add/sensors-23-00374-g017a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f31a/9823712/dfb91aff3988/sensors-23-00374-g018.jpg

相似文献

1
High Quality Coal Foreign Object Image Generation Method Based on StyleGAN-DSAD.基于 StyleGAN-DSAD 的高质量煤炭异物图像生成方法。
Sensors (Basel). 2022 Dec 29;23(1):374. doi: 10.3390/s23010374.
2
A Faster and Lighter Detection Method for Foreign Objects in Coal Mine Belt Conveyors.一种用于煤矿带式输送机异物的更快、更轻量级检测方法。
Sensors (Basel). 2023 Jul 10;23(14):6276. doi: 10.3390/s23146276.
3
Foreign object detection in urban rail transit based on deep differentiation segmentation neural network.基于深度差分分割神经网络的城市轨道交通异物检测
Heliyon. 2024 Aug 28;10(17):e37072. doi: 10.1016/j.heliyon.2024.e37072. eCollection 2024 Sep 15.
4
High-Speed Railway Intruding Object Image Generating with Generative Adversarial Networks.基于生成对抗网络的高速铁路入侵物体图像生成
Sensors (Basel). 2019 Jul 11;19(14):3075. doi: 10.3390/s19143075.
5
A Swin transformer encoder-based StyleGAN for unbalanced endoscopic image enhancement.基于 Swin 变压器编码器的不平衡内窥镜图像增强 StyleGAN。
Comput Biol Med. 2024 Jun;175:108472. doi: 10.1016/j.compbiomed.2024.108472. Epub 2024 Apr 16.
6
A Lightweight Dangerous Liquid Detection Method Based on Depthwise Separable Convolution for X-Ray Security Inspection.基于深度可分离卷积的 X 射线安检轻量级危险品液体检测方法
Comput Intell Neurosci. 2022 Jan 18;2022:5371350. doi: 10.1155/2022/5371350. eCollection 2022.
7
Foreign Object Detection in Railway Images Based on an Efficient Two-Stage Convolutional Neural Network.基于高效两阶段卷积神经网络的铁路图像中的异物检测。
Comput Intell Neurosci. 2022 Aug 28;2022:3749635. doi: 10.1155/2022/3749635. eCollection 2022.
8
Coal Flow Foreign Body Classification Based on ESCBAM and Multi-Channel Feature Fusion.基于ESCBAM和多通道特征融合的煤流异物分类
Sensors (Basel). 2023 Jul 31;23(15):6831. doi: 10.3390/s23156831.
9
Improved YOLOv7 Network Model for Gangue Selection Robot for Gangue and Foreign Matter Detection in Coal.用于煤矸石和异物检测的矸石分选机器人的改进型 YOLOv7 网络模型。
Sensors (Basel). 2023 May 28;23(11):5140. doi: 10.3390/s23115140.
10
Facial expression recognition based on improved depthwise separable convolutional network.基于改进的深度可分离卷积网络的面部表情识别
Multimed Tools Appl. 2023;82(12):18635-18652. doi: 10.1007/s11042-022-14066-6. Epub 2022 Nov 23.

引用本文的文献

1
Multitask allocation model of the coal foreign object sorting robot based on optimal capacity and benefit.基于最优产能与效益的煤炭异物分拣机器人多任务分配模型
Sci Rep. 2025 Aug 11;15(1):29294. doi: 10.1038/s41598-025-14314-3.
2
RRBM-YOLO: Research on Efficient and Lightweight Convolutional Neural Networks for Underground Coal Gangue Identification.RRBM-YOLO:用于井下煤矸石识别的高效轻量级卷积神经网络研究
Sensors (Basel). 2024 Oct 29;24(21):6943. doi: 10.3390/s24216943.
3
Coal Flow Foreign Body Classification Based on ESCBAM and Multi-Channel Feature Fusion.

本文引用的文献

1
Defect Detection of MEMS Based on Data Augmentation, WGAN-DIV-DC, and a YOLOv5 Model.基于数据增强、WGAN-DIV-DC 和 YOLOv5 模型的 MEMS 缺陷检测。
Sensors (Basel). 2022 Dec 2;22(23):9400. doi: 10.3390/s22239400.
2
Text Data Augmentation for Deep Learning.用于深度学习的文本数据增强
J Big Data. 2021;8(1):101. doi: 10.1186/s40537-021-00492-0. Epub 2021 Jul 19.
基于ESCBAM和多通道特征融合的煤流异物分类
Sensors (Basel). 2023 Jul 31;23(15):6831. doi: 10.3390/s23156831.