Lee Anthony, Yau Christopher, Giles Michael B, Doucet Arnaud, Holmes Christopher C
Oxford-Man Institute, Eagle House, Walton Well Road, Oxford OX2 6ED, UK.
J Comput Graph Stat. 2010 Dec 1;19(4):769-789. doi: 10.1198/jcgs.2010.10039.
We present a case-study on the utility of graphics cards to perform massively parallel simulation of advanced Monte Carlo methods. Graphics cards, containing multiple Graphics Processing Units (GPUs), are self-contained parallel computational devices that can be housed in conventional desktop and laptop computers and can be thought of as prototypes of the next generation of many-core processors. For certain classes of population-based Monte Carlo algorithms they offer massively parallel simulation, with the added advantage over conventional distributed multi-core processors that they are cheap, easily accessible, easy to maintain, easy to code, dedicated local devices with low power consumption. On a canonical set of stochastic simulation examples including population-based Markov chain Monte Carlo methods and Sequential Monte Carlo methods, we nd speedups from 35 to 500 fold over conventional single-threaded computer code. Our findings suggest that GPUs have the potential to facilitate the growth of statistical modelling into complex data rich domains through the availability of cheap and accessible many-core computation. We believe the speedup we observe should motivate wider use of parallelizable simulation methods and greater methodological attention to their design.
我们展示了一个关于利用图形卡执行高级蒙特卡罗方法大规模并行模拟的案例研究。图形卡包含多个图形处理单元(GPU),是自成一体的并行计算设备,可安装在传统台式机和笔记本电脑中,可被视为下一代多核处理器的原型。对于某些基于群体的蒙特卡罗算法类别,它们提供大规模并行模拟,相较于传统分布式多核处理器,具有价格便宜、易于获取、易于维护、易于编码、功耗低的专用本地设备等额外优势。在一组典型的随机模拟示例上,包括基于群体的马尔可夫链蒙特卡罗方法和序贯蒙特卡罗方法,我们发现相较于传统单线程计算机代码,加速比达到了35至500倍。我们的研究结果表明,通过提供廉价且易于获取的多核计算,GPU有潜力促进统计建模在复杂数据丰富领域的发展。我们认为我们观察到的加速比应促使更广泛地使用可并行化模拟方法,并在方法设计上给予更多关注。