Suppr超能文献

SPEC CPU2006 和 SPEC CPU2017 在英特尔至强 Skylake-SP 上的内存层次结构特征。

Memory hierarchy characterization of SPEC CPU2006 and SPEC CPU2017 on the Intel Xeon Skylake-SP.

机构信息

Departamento de Informática e Ingeniería de Sistemas - Aragón Institute for Engineering Research (I3A), Universidad de Zaragoza, Zaragoza, Spain.

出版信息

PLoS One. 2019 Aug 1;14(8):e0220135. doi: 10.1371/journal.pone.0220135. eCollection 2019.

Abstract

SPEC CPU is one of the most common benchmark suites used in computer architecture research. CPU2017 has recently been released to replace CPU2006. In this paper we present a detailed evaluation of the memory hierarchy performance for both the CPU2006 and single-threaded CPU2017 benchmarks. The experiments were executed on an Intel Xeon Skylake-SP, which is the first Intel processor to implement a mostly non-inclusive last-level cache (LLC). We present a classification of the benchmarks according to their memory pressure and analyze the performance impact of different LLC sizes. We also test all the hardware prefetchers showing they improve performance in most of the benchmarks. After comprehensive experimentation, we can highlight the following conclusions: i) almost half of SPEC CPU benchmarks have very low miss ratios in the second and third level caches, even with small LLC sizes and without hardware prefetching, ii) overall, the SPEC CPU2017 benchmarks demand even less memory hierarchy resources than the SPEC CPU2006 ones, iii) hardware prefetching is very effective in reducing LLC misses for most benchmarks, even with the smallest LLC size, and iv) from the memory hierarchy standpoint the methodologies commonly used to select benchmarks or simulation points do not guarantee representative workloads.

摘要

SPEC CPU 是计算机体系结构研究中最常用的基准套件之一。CPU2017 最近发布,以取代 CPU2006。在本文中,我们对 CPU2006 和单线程 CPU2017 基准测试的内存层次性能进行了详细评估。实验在英特尔至强 Skylake-SP 上执行,这是第一款实现大部分非包含性最后一级缓存(LLC)的英特尔处理器。我们根据内存压力对基准测试进行分类,并分析不同 LLC 大小对性能的影响。我们还测试了所有硬件预取器,发现它们在大多数基准测试中都能提高性能。经过全面的实验,我们可以得出以下结论:i)几乎一半的 SPEC CPU 基准测试在二级和三级缓存中的缺失率非常低,即使 LLC 尺寸较小且没有硬件预取,ii)总体而言,SPEC CPU2017 基准测试比 SPEC CPU2006 基准测试对内存层次资源的需求更少,iii)硬件预取对于大多数基准测试非常有效,可以减少 LLC 缺失,即使 LLC 尺寸最小,iv)从内存层次的角度来看,常用的选择基准测试或模拟点的方法并不能保证代表性的工作负载。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/901b/6675054/69b177fb7c02/pone.0220135.g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验