• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

速度、能量和面积优化的早期输出准延迟不敏感阵列乘法器。

Speed, energy and area optimized early output quasi-delay-insensitive array multipliers.

机构信息

School of Computer Science and Engineering, Nanyang Technological University, Singapore.

Department of Industrial Engineering, Technical University of Sofia, Sofia, Bulgaria.

出版信息

PLoS One. 2020 Feb 3;15(2):e0228343. doi: 10.1371/journal.pone.0228343. eCollection 2020.

DOI:10.1371/journal.pone.0228343
PMID:32012180
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6996843/
Abstract

Multiplication is a widely used arithmetic operation that is frequently encountered in micro-processing and digital signal processing. Multiplication is implemented using a multiplier, and recently, QDI asynchronous array multipliers were presented in the literature utilizing delay-insensitive double-rail data encoding and four-phase return-to-zero (RTZ) handshaking and four-phase return-to-one (RTO) handshaking. In this context, this article makes two contributions: (i) the design of a new asynchronous partial product generator, and (ii) the design of a new asynchronous half adder. We analyze the usefulness of the proposed partial product generator and the proposed half adder to efficiently realize QDI array multipliers. When the new partial product generator and half adder are used along with our indicating full adder, significant reductions are achieved in the design metrics compared to the optimum QDI array multiplier reported in the literature. The cycle time is reduced by 17%, the area is reduced by 16.1%, the power is reduced by 15.3%, and the product of power and cycle time is reduced by 29.6% with respect to RTZ handshaking. On the other hand, the cycle time is reduced by 13%, the area is reduced by 16.1%, the power is reduced by 15.2%, and the product of power and cycle time is reduced by 26.1% with respect to RTO handshaking. Further, the RTO handshaking is found to be preferable to RTZ handshaking to achieve slightly improved optimizations in the design metrics. The QDI array multipliers were realized using a 32/28nm complementary metal oxide semiconductor (CMOS) process technology.

摘要

乘法是一种广泛使用的算术运算,在微处理和数字信号处理中经常遇到。乘法是通过乘法器来实现的,最近,文献中提出了 QDI 异步阵列乘法器,它利用了延迟不敏感的双轨数据编码和四相归零(RTZ)握手以及四相返回一(RTO)握手。在这种情况下,本文做出了两个贡献:(i)设计了一种新的异步部分积生成器,(ii)设计了一种新的异步半加器。我们分析了所提出的部分积生成器和所提出的半加器在有效地实现 QDI 阵列乘法器方面的有用性。当使用新的部分积生成器和半加器以及我们的指示全加器时,与文献中报告的最佳 QDI 阵列乘法器相比,设计指标有了显著的降低。与 RTZ 握手相比,时钟周期减少了 17%,面积减少了 16.1%,功率减少了 15.3%,功率和时钟周期的乘积减少了 29.6%。另一方面,与 RTO 握手相比,时钟周期减少了 13%,面积减少了 16.1%,功率减少了 15.2%,功率和时钟周期的乘积减少了 26.1%。此外,发现 RTO 握手比 RTZ 握手更有利于实现设计指标的略微改进。使用 32/28nm 互补金属氧化物半导体(CMOS)工艺技术实现了 QDI 阵列乘法器。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a22f/6996843/b19be946d17f/pone.0228343.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a22f/6996843/7a692726b82b/pone.0228343.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a22f/6996843/db7bb6c7e2ba/pone.0228343.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a22f/6996843/5e95a17a43d3/pone.0228343.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a22f/6996843/c7ea36055588/pone.0228343.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a22f/6996843/932324969c33/pone.0228343.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a22f/6996843/3218b5fef574/pone.0228343.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a22f/6996843/31947b2cbb6c/pone.0228343.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a22f/6996843/4d4e7135d6f1/pone.0228343.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a22f/6996843/b19be946d17f/pone.0228343.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a22f/6996843/7a692726b82b/pone.0228343.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a22f/6996843/db7bb6c7e2ba/pone.0228343.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a22f/6996843/5e95a17a43d3/pone.0228343.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a22f/6996843/c7ea36055588/pone.0228343.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a22f/6996843/932324969c33/pone.0228343.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a22f/6996843/3218b5fef574/pone.0228343.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a22f/6996843/31947b2cbb6c/pone.0228343.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a22f/6996843/4d4e7135d6f1/pone.0228343.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a22f/6996843/b19be946d17f/pone.0228343.g009.jpg

相似文献

1
Speed, energy and area optimized early output quasi-delay-insensitive array multipliers.速度、能量和面积优化的早期输出准延迟不敏感阵列乘法器。
PLoS One. 2020 Feb 3;15(2):e0228343. doi: 10.1371/journal.pone.0228343. eCollection 2020.
2
Speed and energy optimized quasi-delay-insensitive block carry lookahead adder.速度和能量优化的准延迟不敏感块进位先行加法器。
PLoS One. 2019 Jun 21;14(6):e0218347. doi: 10.1371/journal.pone.0218347. eCollection 2019.
3
Area/latency optimized early output asynchronous full adders and relative-timed ripple carry adders.面积/延迟优化的早期输出异步全加器和相对定时行波进位加法器。
Springerplus. 2016 Apr 12;5:440. doi: 10.1186/s40064-016-2074-z. eCollection 2016.
4
An Improved VLSI Design of the ALU Based FIR Filter for Biomedical Image Filtering Application.基于 ALU 的 FIR 滤波器的改进的 VLSI 设计用于生物医学图像处理应用。
Curr Med Imaging. 2021;17(2):276-287. doi: 10.2174/1573405616999200817101950.
5
High-speed and energy-efficient asynchronous carry look-ahead adder.高速且节能的异步超前进位加法器。
PLoS One. 2023 Oct 5;18(10):e0289569. doi: 10.1371/journal.pone.0289569. eCollection 2023.
6
Asynchronous quasi delay insensitive majority voters corresponding to quintuple modular redundancy for mission/safety-critical applications.用于任务/安全关键应用的五倍模冗余异步准延迟不敏感多数投票器。
PLoS One. 2020 Sep 22;15(9):e0239395. doi: 10.1371/journal.pone.0239395. eCollection 2020.
7
Design of efficient binary multiplier architecture using hybrid compressor with FPGA implementation.基于现场可编程门阵列(FPGA)实现的采用混合压缩器的高效二进制乘法器架构设计
Sci Rep. 2024 Apr 11;14(1):8492. doi: 10.1038/s41598-024-58482-0.
8
An Asynchronous Low Power and High Performance VLSI Architecture for Viterbi Decoder Implemented with Quasi Delay Insensitive Templates.一种采用准延迟不敏感模板实现的用于维特比译码器的异步低功耗高性能超大规模集成电路架构。
ScientificWorldJournal. 2015;2015:621012. doi: 10.1155/2015/621012. Epub 2015 Oct 7.
9
Power-Intent Systolic Array Using Modified Parallel Multiplier for Machine Learning Acceleration.基于改进并行乘法器的用于机器学习加速的功率意识脉动阵列。
Sensors (Basel). 2023 Apr 26;23(9):4297. doi: 10.3390/s23094297.
10
A modular technique of Booth encoding and Vedic multiplier for low-area and high-speed applications.一种用于低面积和高速应用的布斯编码与吠陀乘法器的模块化技术。
Sci Rep. 2023 Dec 16;13(1):22379. doi: 10.1038/s41598-023-49913-5.

引用本文的文献

1
Asynchronous quasi delay insensitive majority voters corresponding to quintuple modular redundancy for mission/safety-critical applications.用于任务/安全关键应用的五倍模冗余异步准延迟不敏感多数投票器。
PLoS One. 2020 Sep 22;15(9):e0239395. doi: 10.1371/journal.pone.0239395. eCollection 2020.

本文引用的文献

1
Speed and energy optimized quasi-delay-insensitive block carry lookahead adder.速度和能量优化的准延迟不敏感块进位先行加法器。
PLoS One. 2019 Jun 21;14(6):e0218347. doi: 10.1371/journal.pone.0218347. eCollection 2019.