Suppr超能文献

使用OpenCL为英特尔FPGA实现一种运动估计算法。

Implementation of a motion estimation algorithm for Intel FPGAs using OpenCL.

作者信息

de Castro Manuel, Osorio Roberto R, Vilariño David L, Gonzalez-Escribano Arturo, Llanos Diego R

机构信息

Departamento de Informática, Universidad de Valladolid, Escuela de Ingeniería Informática, Campus Miguel Delibes, Paseo Belén 15, 47011 Valladolid, Spain.

CITIC, Computer Architecture Group, Universidade da Coruña, Campus de Eviña s/n, 15008 A Coruña, Spain.

出版信息

J Supercomput. 2023;79(9):9866-9888. doi: 10.1007/s11227-023-05051-3. Epub 2023 Jan 21.

Abstract

Motion Estimation is one of the main tasks behind any video encoder. It is a computationally costly task; therefore, it is usually delegated to specific or reconfigurable hardware, such as FPGAs. Over the years, multiple FPGA implementations have been developed, mainly using hardware description languages such as Verilog or VHDL. Since programming using hardware description languages is a complex task, it is desirable to use higher-level languages to develop FPGA applications.The aim of this work is to evaluate OpenCL, in terms of expressiveness, as a tool for developing this kind of FPGA applications. To do so, we present and evaluate a parallel implementation of the Block Matching Motion Estimation process using OpenCL for Intel FPGAs, usable and tested on an Intel Stratix 10 FPGA. The implementation efficiently processes Full HD frames completely inside the FPGA. In this work, we show the resource utilization when synthesizing the code on an Intel Stratix 10 FPGA, as well as a performance comparison with multiple CPU implementations with varying levels of optimization and vectorization capabilities. We also compare the proposed OpenCL implementation, in terms of resource utilization and performance, with estimations obtained from an equivalent VHDL implementation.

摘要

运动估计是任何视频编码器背后的主要任务之一。它是一项计算成本高昂的任务;因此,通常将其委托给特定的或可重构的硬件,如现场可编程门阵列(FPGA)。多年来,已经开发了多种FPGA实现方式,主要使用诸如Verilog或VHDL之类的硬件描述语言。由于使用硬件描述语言进行编程是一项复杂的任务,因此希望使用更高级别的语言来开发FPGA应用程序。这项工作的目的是评估OpenCL在表达能力方面作为开发此类FPGA应用程序的工具的情况。为此,我们展示并评估了使用OpenCL针对英特尔FPGA的块匹配运动估计过程的并行实现,该实现可在英特尔Stratix 10 FPGA上使用和测试。该实现在FPGA内部高效地处理全高清帧。在这项工作中,我们展示了在英特尔Stratix 10 FPGA上合成代码时的资源利用率,以及与具有不同优化和向量化能力的多个CPU实现的性能比较。我们还在资源利用率和性能方面,将所提出的OpenCL实现与从等效VHDL实现中获得的估计值进行比较。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b3e3/9860217/9a5cae4d7c3a/11227_2023_5051_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验