• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于加速状态空间模型推理的粒子马尔可夫链蒙特卡罗算法及架构。

Particle MCMC algorithms and architectures for accelerating inference in state-space models.

作者信息

Mingas Grigorios, Bottolo Leonardo, Bouganis Christos-Savvas

机构信息

Department of Electrical and Electronic Engineering, Imperial College London, London, SW7 2AZ, UK.

Department of Medical Genetics, University of Cambridge, Cambridge, CB2 0QQ, UK.

出版信息

Int J Approx Reason. 2017 Apr;83:413-433. doi: 10.1016/j.ijar.2016.10.011.

DOI:10.1016/j.ijar.2016.10.011
PMID:28373744
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5362159/
Abstract

Particle Markov Chain Monte Carlo (pMCMC) is a stochastic algorithm designed to generate samples from a probability distribution, when the density of the distribution does not admit a closed form expression. pMCMC is most commonly used to sample from the Bayesian posterior distribution in State-Space Models (SSMs), a class of probabilistic models used in numerous scientific applications. Nevertheless, this task is prohibitive when dealing with complex SSMs with massive data, due to the high computational cost of pMCMC and its poor performance when the posterior exhibits multi-modality. This paper aims to address both issues by: 1) Proposing a novel pMCMC algorithm (denoted ppMCMC), which uses multiple Markov chains (instead of the one used by pMCMC) to improve sampling efficiency for multi-modal posteriors, 2) Introducing custom, parallel hardware architectures, which are tailored for pMCMC and ppMCMC. The architectures are implemented on Field Programmable Gate Arrays (FPGAs), a type of hardware accelerator with massive parallelization capabilities. The new algorithm and the two FPGA architectures are evaluated using a large-scale case study from genetics. Results indicate that ppMCMC achieves 1.96x higher sampling efficiency than pMCMC when using sequential CPU implementations. The FPGA architecture of pMCMC is 12.1x and 10.1x faster than state-of-the-art, parallel CPU and GPU implementations of pMCMC and up to 53x more energy efficient; the FPGA architecture of ppMCMC increases these speedups to 34.9x and 41.8x respectively and is 173x more power efficient, bringing previously intractable SSM-based data analyses within reach.

摘要

粒子马尔可夫链蒙特卡罗(pMCMC)是一种随机算法,旨在当概率分布的密度没有封闭形式的表达式时,从该概率分布中生成样本。pMCMC最常用于从状态空间模型(SSM)中的贝叶斯后验分布进行采样,SSM是一类在众多科学应用中使用的概率模型。然而,在处理具有大量数据的复杂SSM时,由于pMCMC的计算成本高且在后验呈现多模态时性能不佳,这项任务变得非常困难。本文旨在通过以下方式解决这两个问题:1)提出一种新颖的pMCMC算法(表示为ppMCMC),它使用多个马尔可夫链(而不是pMCMC使用的单个马尔可夫链)来提高对多模态后验的采样效率;2)引入定制的并行硬件架构,这些架构是为pMCMC和ppMCMC量身定制的。这些架构在现场可编程门阵列(FPGA)上实现,FPGA是一种具有大规模并行化能力的硬件加速器。使用来自遗传学的大规模案例研究对新算法和两种FPGA架构进行了评估。结果表明,在使用顺序CPU实现时,ppMCMC的采样效率比pMCMC高1.96倍。pMCMC的FPGA架构比pMCMC的最新并行CPU和GPU实现快12.1倍和10.1倍,并且能源效率提高多达53倍;ppMCMC的FPGA架构分别将这些加速提高到34.9倍和41.8倍,并且能源效率提高173倍,使以前难以处理的基于SSM的数据分析变得可行。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac1/5362159/1c2446d51d44/gr014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac1/5362159/1719570ce1d7/gr001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac1/5362159/4b3ec87258be/gr002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac1/5362159/80c12f954e6a/gr003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac1/5362159/8434d96cc0a1/gr004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac1/5362159/04045032729c/gr005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac1/5362159/01cd2ba019af/gr006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac1/5362159/fa02bf85d7cc/gr007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac1/5362159/95d560b041b0/gr008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac1/5362159/6375ef553115/gr009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac1/5362159/bb535f7f3b05/gr010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac1/5362159/644af04455ad/gr011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac1/5362159/a005f9ee4e7b/gr012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac1/5362159/75b56438e630/gr013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac1/5362159/1c2446d51d44/gr014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac1/5362159/1719570ce1d7/gr001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac1/5362159/4b3ec87258be/gr002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac1/5362159/80c12f954e6a/gr003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac1/5362159/8434d96cc0a1/gr004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac1/5362159/04045032729c/gr005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac1/5362159/01cd2ba019af/gr006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac1/5362159/fa02bf85d7cc/gr007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac1/5362159/95d560b041b0/gr008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac1/5362159/6375ef553115/gr009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac1/5362159/bb535f7f3b05/gr010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac1/5362159/644af04455ad/gr011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac1/5362159/a005f9ee4e7b/gr012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac1/5362159/75b56438e630/gr013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac1/5362159/1c2446d51d44/gr014.jpg

相似文献

1
Particle MCMC algorithms and architectures for accelerating inference in state-space models.用于加速状态空间模型推理的粒子马尔可夫链蒙特卡罗算法及架构。
Int J Approx Reason. 2017 Apr;83:413-433. doi: 10.1016/j.ijar.2016.10.011.
2
Using model-based proposals for fast parameter inference on discrete state space, continuous-time Markov processes.使用基于模型的提议进行离散状态空间、连续时间马尔可夫过程的快速参数推断。
J R Soc Interface. 2015 Jun 6;12(107). doi: 10.1098/rsif.2015.0225.
3
Introduction to particle Markov-chain Monte Carlo for disease dynamics modellers.粒子马尔可夫链蒙特卡罗方法在疾病动力学建模中的应用简介。
Epidemics. 2019 Dec;29:100363. doi: 10.1016/j.epidem.2019.100363. Epub 2019 Oct 3.
4
Distributed large-scale graph processing on FPGAs.基于现场可编程门阵列(FPGA)的分布式大规模图形处理
J Big Data. 2023;10(1):95. doi: 10.1186/s40537-023-00756-x. Epub 2023 Jun 4.
5
Bayesian inference for the onset time and epidemiological characteristics of emerging infectious diseases.贝叶斯推断在新发传染病发病时间和流行病学特征中的应用。
Front Public Health. 2024 May 17;12:1406566. doi: 10.3389/fpubh.2024.1406566. eCollection 2024.
6
Acceleration of Deep Neural Network Training Using Field Programmable Gate Arrays.使用现场可编程门阵列加速深度神经网络训练。
Comput Intell Neurosci. 2022 Oct 17;2022:8387364. doi: 10.1155/2022/8387364. eCollection 2022.
7
GeCo: Classification Restricted Boltzmann Machine Hardware for On-Chip Semisupervised Learning and Bayesian Inference.GeCo:用于片上半监督学习和贝叶斯推理的分类受限玻尔兹曼机硬件。
IEEE Trans Neural Netw Learn Syst. 2020 Jan;31(1):53-65. doi: 10.1109/TNNLS.2019.2899386. Epub 2019 Mar 15.
8
Extending the BEAGLE library to a multi-FPGA platform.将 BEAGLE 库扩展到多 FPGA 平台。
BMC Bioinformatics. 2013 Jan 19;14:25. doi: 10.1186/1471-2105-14-25.
9
Field Programmable Gate Arrays for Enhancing the Speed and Energy Efficiency of Quantum Dynamics Simulations.用于提高量子动力学模拟速度和能源效率的现场可编程门阵列
J Chem Theory Comput. 2020 Apr 14;16(4):2085-2098. doi: 10.1021/acs.jctc.9b01284. Epub 2020 Mar 27.
10
FPGA-based hardware accelerator for SENSE (a parallel MR image reconstruction method).基于现场可编程门阵列的灵敏度编码(一种并行磁共振图像重建方法)硬件加速器
Comput Biol Med. 2020 Feb;117:103598. doi: 10.1016/j.compbiomed.2019.103598. Epub 2020 Jan 3.

引用本文的文献

1
Identifiability analysis for stochastic differential equation models in systems biology.系统生物学中随机微分方程模型的可识别性分析。
J R Soc Interface. 2020 Dec;17(173):20200652. doi: 10.1098/rsif.2020.0652. Epub 2020 Dec 16.

本文引用的文献

1
WGBSSuite: simulating whole-genome bisulphite sequencing data and benchmarking differential DNA methylation analysis tools.WGBSSuite:模拟全基因组亚硫酸氢盐测序数据并对差异DNA甲基化分析工具进行基准测试。
Bioinformatics. 2015 Jul 15;31(14):2371-3. doi: 10.1093/bioinformatics/btv114. Epub 2015 Mar 15.
2
Parallel tempering: theory, applications, and new perspectives.并行回火:理论、应用及新视角
Phys Chem Chem Phys. 2005 Dec 7;7(23):3910-6. doi: 10.1039/b509983h.