Suppr超能文献

通过数值精度自动分析实现程序优化

Towards Program Optimization through Automated Analysis of Numerical Precision.

作者信息

Linderman Michael D, Ho Matthew, Dill David L, Meng Teresa H, Nolan Garry P

机构信息

Computer Systems Laboratory, Stanford University, Stanford, CA, USA.

Microbiology & Immunology, Stanford University, Stanford, CA, USA.

出版信息

Proc CGO. 2010 Apr;2010:230-237. doi: 10.1145/1772954.1772987.

Abstract

Reducing the arithmetic precision of a computation has real performance implications, including increased speed, decreased power consumption, and a smaller memory footprint. For some architectures, e.g., GPUs, there can be such a large performance difference that using reduced precision is effectively a requirement. The tradeoff is that the accuracy of the computation will be compromised. In this paper we describe a proof assistant and associated static analysis techniques for efficiently bounding numerical and precision-related errors. The programmer/compiler can use these bounds to numerically verify and optimize an application for different input and machine configurations. We present several case study applications that demonstrate the effectiveness of these techniques and the performance benefits that can be achieved with rigorous precision analysis.

摘要

降低计算的算术精度会对实际性能产生影响,包括提高速度、降低功耗以及减少内存占用。对于某些架构,例如图形处理器(GPU),性能差异可能非常大,以至于使用降低的精度实际上成为一种必要条件。权衡之处在于计算的准确性会受到影响。在本文中,我们描述了一种证明助手及相关的静态分析技术,用于有效地界定数值和精度相关的误差。程序员/编译器可以使用这些界限对不同的输入和机器配置进行数值验证并优化应用程序。我们展示了几个案例研究应用,以证明这些技术的有效性以及通过严格的精度分析可实现的性能优势。

相似文献

4
Numerical algorithms for high-performance computational science.高性能计算科学的数值算法。
Philos Trans A Math Phys Eng Sci. 2020 Mar 6;378(2166):20190066. doi: 10.1098/rsta.2019.0066. Epub 2020 Jan 20.
7
Rubus: A compiler for seamless and extensible parallelism.蔷薇:一种用于实现无缝和可扩展并行性的编译器。
PLoS One. 2017 Dec 6;12(12):e0188721. doi: 10.1371/journal.pone.0188721. eCollection 2017.
10
Numerical stability of DeepGOPlus inference.DeepGOPlus 推断的数值稳定性。
PLoS One. 2024 Jan 29;19(1):e0296725. doi: 10.1371/journal.pone.0296725. eCollection 2024.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验