• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 OpenCL 加速的并行 Canny 边缘检测算法。

A parallel Canny edge detection algorithm based on OpenCL acceleration.

机构信息

College of Computer Engineering, Shangqiu Polytechnic, Shangqiu, China.

School of Civil and Architectural Engineering, Shandong University of Technology, Zibo, China.

出版信息

PLoS One. 2024 Jan 5;19(1):e0292345. doi: 10.1371/journal.pone.0292345. eCollection 2024.

DOI:10.1371/journal.pone.0292345
PMID:38180975
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10769061/
Abstract

In the process of Canny edge detection, a large number of high complexity calculations such as Gaussian filtering, gradient calculation, non-maximum suppression, and double threshold judgment need to be performed on the image, which takes up a lot of operation time, which is a great challenge to the real-time requirements of the algorithm. The traditional Canny edge detection technology mainly uses customized equipment such as DSP and FPGA, but it has some problems, such as long development cycle, difficult debugging, resource consumption, and so on. At the same time, the adopted CUDA platform has the problem of poor cross-platform. In order to solve this problem, a fine-grained parallel Canny edge detection method is proposed, which is optimized from three aspects: task partition, vector memory access, and NDRange optimization, and CPU-GPU collaborative parallelism is realized. At the same time, the parallel Canny edge detection methods based on multi-core CPU and CUDA architecture are designed. The experimental results show that OpenCL accelerated Canny edge detection algorithm (OCL_Canny) achieves 20.68 times acceleration ratio compared with CPU serial algorithm at 7452 × 8024 image resolution. At the image resolution of 3500 × 3500, the OCL_Canny algorithm achieves 3.96 times the acceleration ratio compared with the CPU multi-threaded Canny parallel algorithm. At 1024 × 1024 image resolution, the OCL_Canny algorithm achieves 1.21 times the acceleration ratio compared with the CUDA-based Canny parallel algorithm. The effectiveness and performance portability of the proposed Canny edge detection parallel algorithm are verified, and it provides a reference for the research of fast calculation of image big data.

摘要

在 Canny 边缘检测过程中,需要对图像进行大量的高复杂度计算,如高斯滤波、梯度计算、非极大值抑制和双阈值判断,这占用了大量的运算时间,对算法的实时性要求构成了巨大挑战。传统的 Canny 边缘检测技术主要采用 DSP 和 FPGA 等定制设备,但存在开发周期长、调试困难、资源消耗等问题。同时,采用的 CUDA 平台存在跨平台性差的问题。为了解决这个问题,提出了一种细粒度并行 Canny 边缘检测方法,从任务划分、向量内存访问和 NDRange 优化三个方面进行优化,实现了 CPU-GPU 协同并行。同时,设计了基于多核 CPU 和 CUDA 架构的并行 Canny 边缘检测方法。实验结果表明,在 7452×8024 图像分辨率下,与 CPU 串行算法相比,OpenCL 加速 Canny 边缘检测算法(OCL_Canny)的加速比达到 20.68 倍。在 3500×3500 的图像分辨率下,OCL_Canny 算法与 CPU 多线程 Canny 并行算法相比,加速比达到 3.96 倍。在 1024×1024 图像分辨率下,OCL_Canny 算法与基于 CUDA 的 Canny 并行算法相比,加速比达到 1.21 倍。验证了所提出的 Canny 边缘检测并行算法的有效性和性能可移植性,为图像大数据的快速计算研究提供了参考。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e611/10769061/d3fe03f0a65d/pone.0292345.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e611/10769061/5cf81b0befa8/pone.0292345.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e611/10769061/8d9348c97550/pone.0292345.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e611/10769061/713ec4e70cd9/pone.0292345.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e611/10769061/20912ed4c76a/pone.0292345.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e611/10769061/079665a5d3d2/pone.0292345.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e611/10769061/22abc3891922/pone.0292345.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e611/10769061/3d61c513a4bc/pone.0292345.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e611/10769061/bd1f6eeb14e9/pone.0292345.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e611/10769061/a9936b51a89c/pone.0292345.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e611/10769061/17a0cefa6165/pone.0292345.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e611/10769061/684e97febe71/pone.0292345.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e611/10769061/617d46215b40/pone.0292345.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e611/10769061/d3fe03f0a65d/pone.0292345.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e611/10769061/5cf81b0befa8/pone.0292345.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e611/10769061/8d9348c97550/pone.0292345.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e611/10769061/713ec4e70cd9/pone.0292345.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e611/10769061/20912ed4c76a/pone.0292345.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e611/10769061/079665a5d3d2/pone.0292345.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e611/10769061/22abc3891922/pone.0292345.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e611/10769061/3d61c513a4bc/pone.0292345.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e611/10769061/bd1f6eeb14e9/pone.0292345.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e611/10769061/a9936b51a89c/pone.0292345.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e611/10769061/17a0cefa6165/pone.0292345.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e611/10769061/684e97febe71/pone.0292345.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e611/10769061/617d46215b40/pone.0292345.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e611/10769061/d3fe03f0a65d/pone.0292345.g013.jpg

相似文献

1
A parallel Canny edge detection algorithm based on OpenCL acceleration.基于 OpenCL 加速的并行 Canny 边缘检测算法。
PLoS One. 2024 Jan 5;19(1):e0292345. doi: 10.1371/journal.pone.0292345. eCollection 2024.
2
Implementing a Parallel Image Edge Detection Algorithm Based on the Otsu-Canny Operator on the Hadoop Platform.基于 Otsu-Canny 算子的并行图像边缘检测算法在 Hadoop 平台上的实现。
Comput Intell Neurosci. 2018 May 13;2018:3598284. doi: 10.1155/2018/3598284. eCollection 2018.
3
A distributed Canny edge detector: algorithm and FPGA implementation.一种分布式 Canny 边缘检测器:算法与 FPGA 实现。
IEEE Trans Image Process. 2014 Jul;23(7):2944-60. doi: 10.1109/tip.2014.2311656.
4
Unsharp masking image enhancement the parallel algorithm based on cross-platform.基于跨平台的锐化掩模图像增强并行算法。
Sci Rep. 2022 Nov 23;12(1):20175. doi: 10.1038/s41598-022-21745-9.
5
Parallel Hough Transform-based straight line detection and its FPGA implementation in embedded vision.基于并行 Hough 变换的直线检测及其在嵌入式视觉中的 FPGA 实现。
Sensors (Basel). 2013 Jul 17;13(7):9223-47. doi: 10.3390/s130709223.
6
Ultrasound image edge detection based on a novel multiplicative gradient and Canny operator.基于新型乘法梯度和Canny算子的超声图像边缘检测
Ultrason Imaging. 2015 Jul;37(3):238-50. doi: 10.1177/0161734614554461. Epub 2014 Oct 13.
7
A heterogeneous computing accelerated SCE-UA global optimization method using OpenMP, OpenCL, CUDA, and OpenACC.一种使用OpenMP、OpenCL、CUDA和OpenACC的异构计算加速SCE-UA全局优化方法。
Water Sci Technol. 2017 Oct;76(7-8):1640-1651. doi: 10.2166/wst.2017.322.
8
MIMO Radar Parallel Simulation System Based on CPU/GPU Architecture.基于 CPU/GPU 架构的 MIMO 雷达并行仿真系统。
Sensors (Basel). 2022 Jan 5;22(1):396. doi: 10.3390/s22010396.
9
A fast forward projection using multithreads for multirays on GPUs in medical image reconstruction.基于 GPU 的医学图像重建中多线程快速前向投影的多射线算法。
Med Phys. 2011 Jul;38(7):4052-65. doi: 10.1118/1.3591994.
10
A Study on Weak Edge Detection of COVID-19's CT Images Based on Histogram Equalization and Improved Canny Algorithm.基于直方图均衡化和改进的 Canny 算法的 COVID-19 CT 图像弱边缘检测研究。
Comput Math Methods Med. 2021 Oct 28;2021:5208940. doi: 10.1155/2021/5208940. eCollection 2021.

引用本文的文献

1
Noise-Robust image edge detection based on multi-scale automatic anisotropic morphological Gaussian Kernels.基于多尺度自动各向异性形态学高斯核的抗噪声图像边缘检测
PLoS One. 2025 May 5;20(5):e0319852. doi: 10.1371/journal.pone.0319852. eCollection 2025.
2
Post-integration based point-line feature visual SLAM in low-texture environments.低纹理环境下基于积分后的点线特征视觉同步定位与地图构建
Sci Rep. 2025 Apr 26;15(1):14606. doi: 10.1038/s41598-025-97250-6.

本文引用的文献

1
A novel method for segmenting brain tumor using modified watershed algorithm in MRI image with FPGA.使用 FPGA 在 MRI 图像中使用改进的分水岭算法对脑肿瘤进行分割的新方法。
Biosystems. 2020 Dec;198:104226. doi: 10.1016/j.biosystems.2020.104226. Epub 2020 Aug 27.
2
OpenCLIPER: An OpenCL-Based C++ Framework for Overhead-Reduced Medical Image Processing and Reconstruction on Heterogeneous Devices.OpenCLIPER:一种基于 OpenCL 的 C++ 框架,用于异构设备上降低开销的医学图像处理和重建。
IEEE J Biomed Health Inform. 2019 Jul;23(4):1702-1709. doi: 10.1109/JBHI.2018.2869421. Epub 2018 Sep 10.
3
Implementing a Parallel Image Edge Detection Algorithm Based on the Otsu-Canny Operator on the Hadoop Platform.
基于 Otsu-Canny 算子的并行图像边缘检测算法在 Hadoop 平台上的实现。
Comput Intell Neurosci. 2018 May 13;2018:3598284. doi: 10.1155/2018/3598284. eCollection 2018.
4
A computational approach to edge detection.一种基于计算的边缘检测方法。
IEEE Trans Pattern Anal Mach Intell. 1986 Jun;8(6):679-98.