• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于p比特并结合实际器件变异性建模的GPU加速模拟退火算法。

GPU-accelerated simulated annealing based on p-bits with real-world device-variability modeling.

作者信息

Onizawa Naoya, Hanyu Takahiro

机构信息

Research Institute of Electrical Communication, Tohoku University, Sendai, 980-8577, Japan.

出版信息

Sci Rep. 2025 Feb 19;15(1):6118. doi: 10.1038/s41598-025-90520-3.

DOI:10.1038/s41598-025-90520-3
PMID:39972041
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11840000/
Abstract

Probabilistic computing using probabilistic bits (p-bits) presents an efficient alternative to traditional CMOS logic for complex problem-solving, including simulated annealing and machine learning. Realizing p-bits with emerging devices such as magnetic tunnel junctions introduces device variability, which was expected to negatively impact computational performance. However, this study reveals an unexpected finding: device variability can not only degrade but also enhance algorithm performance, particularly by leveraging timing variability. This paper introduces a GPU-accelerated, open-source simulated annealing framework based on p-bits that models key device variability factors-timing, intensity, and offset-to reflect real-world device behavior. Through CUDA-based simulations, our approach achieves a two-order magnitude speedup over CPU implementations on the MAX-CUT benchmark with problem sizes ranging from 800 to 20,000 nodes. By providing a scalable and accessible tool, this framework aims to advance research in probabilistic computing, enabling optimization applications in diverse fields.

摘要

使用概率比特(p比特)的概率计算为解决复杂问题(包括模拟退火和机器学习)提供了一种高效的替代传统CMOS逻辑的方法。利用诸如磁性隧道结等新兴器件实现p比特会引入器件变异性,这原本预计会对计算性能产生负面影响。然而,这项研究揭示了一个意想不到的发现:器件变异性不仅会降低算法性能,还可能增强算法性能,特别是通过利用时序变异性。本文介绍了一种基于p比特的GPU加速开源模拟退火框架,该框架对关键器件变异性因素——时序、强度和偏移进行建模,以反映实际器件行为。通过基于CUDA的模拟,我们的方法在MAX-CUT基准测试中,对于节点数从800到20000的问题规模,比CPU实现方式实现了两个数量级的加速。通过提供一个可扩展且易于使用的工具,该框架旨在推动概率计算研究,实现不同领域的优化应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1bea/11840000/fde6af62c7c7/41598_2025_90520_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1bea/11840000/56f15ca05f18/41598_2025_90520_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1bea/11840000/1f3d9fdc8c5d/41598_2025_90520_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1bea/11840000/951ab4e84efa/41598_2025_90520_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1bea/11840000/d7ed08d491a2/41598_2025_90520_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1bea/11840000/99747bf3f2e9/41598_2025_90520_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1bea/11840000/96f94a251bea/41598_2025_90520_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1bea/11840000/fde6af62c7c7/41598_2025_90520_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1bea/11840000/56f15ca05f18/41598_2025_90520_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1bea/11840000/1f3d9fdc8c5d/41598_2025_90520_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1bea/11840000/951ab4e84efa/41598_2025_90520_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1bea/11840000/d7ed08d491a2/41598_2025_90520_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1bea/11840000/99747bf3f2e9/41598_2025_90520_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1bea/11840000/96f94a251bea/41598_2025_90520_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1bea/11840000/fde6af62c7c7/41598_2025_90520_Fig6_HTML.jpg

相似文献

1
GPU-accelerated simulated annealing based on p-bits with real-world device-variability modeling.基于p比特并结合实际器件变异性建模的GPU加速模拟退火算法。
Sci Rep. 2025 Feb 19;15(1):6118. doi: 10.1038/s41598-025-90520-3.
2
Fast-Converging Simulated Annealing for Ising Models Based on Integral Stochastic Computing.基于积分随机计算的伊辛模型快速收敛模拟退火算法
IEEE Trans Neural Netw Learn Syst. 2023 Dec;34(12):10999-11005. doi: 10.1109/TNNLS.2022.3159713. Epub 2023 Nov 30.
3
CMOS plus stochastic nanomagnets enabling heterogeneous computers for probabilistic inference and learning.互补金属氧化物半导体(CMOS)与随机纳米磁体相结合,助力异构计算机实现概率推理与学习。
Nat Commun. 2024 Mar 27;15(1):2685. doi: 10.1038/s41467-024-46645-6.
4
CMOS Single-Photon Avalanche Diode Circuits for Probabilistic Computing.用于概率计算的互补金属氧化物半导体单光子雪崩二极管电路
IEEE J Explor Solid State Comput Devices Circuits. 2024;10:49-57. doi: 10.1109/jxcdc.2024.3452030. Epub 2024 Aug 29.
5
Accelerating Spatial Cross-Matching on CPU-GPU Hybrid Platform With CUDA and OpenACC.利用CUDA和OpenACC在CPU-GPU混合平台上加速空间交叉匹配
Front Big Data. 2020 May;3. doi: 10.3389/fdata.2020.00014. Epub 2020 May 8.
6
Correlation free large-scale probabilistic computing using a true-random chaotic oscillator p-bit.使用真随机混沌振荡器p比特的无相关性大规模概率计算。
Sci Rep. 2025 Mar 7;15(1):8018. doi: 10.1038/s41598-025-93218-8.
7
Integer factorization using stochastic magnetic tunnel junctions.使用随机磁隧道结进行整数分解。
Nature. 2019 Sep;573(7774):390-393. doi: 10.1038/s41586-019-1557-9. Epub 2019 Sep 18.
8
A nonvoxel-based dose convolution/superposition algorithm optimized for scalable GPU architectures.一种针对可扩展GPU架构进行优化的基于非体素的剂量卷积/叠加算法。
Med Phys. 2014 Oct;41(10):101711. doi: 10.1118/1.4895822.
9
High performance computing for deformable image registration: towards a new paradigm in adaptive radiotherapy.用于可变形图像配准的高性能计算:迈向自适应放射治疗的新范式。
Med Phys. 2008 Aug;35(8):3546-53. doi: 10.1118/1.2948318.
10
A clinically viable vendor-independent and device-agnostic solution for accelerated cardiac MRI reconstruction.一种临床可行的、与供应商无关且与设备无关的加速心脏 MRI 重建解决方案。
Comput Methods Programs Biomed. 2021 Aug;207:106143. doi: 10.1016/j.cmpb.2021.106143. Epub 2021 May 5.

本文引用的文献

1
Experimental demonstration of an on-chip p-bit core based on stochastic magnetic tunnel junctions and 2D MoS transistors.基于随机磁隧道结和二维钼化钼晶体管的片上p比特内核的实验演示。
Nat Commun. 2024 May 15;15(1):4098. doi: 10.1038/s41467-024-48152-0.
2
Enhanced convergence in p-bit based simulated annealing with partial deactivation for large-scale combinatorial optimization problems.用于大规模组合优化问题的基于p位的模拟退火算法中通过部分停用实现的增强收敛性。
Sci Rep. 2024 Jan 16;14(1):1339. doi: 10.1038/s41598-024-51639-x.
3
Quantum annealing for industry applications: introduction and review.
面向工业应用的量子退火:介绍与综述
Rep Prog Phys. 2022 Sep 21;85(10). doi: 10.1088/1361-6633/ac8c54.
4
Fast-Converging Simulated Annealing for Ising Models Based on Integral Stochastic Computing.基于积分随机计算的伊辛模型快速收敛模拟退火算法
IEEE Trans Neural Netw Learn Syst. 2023 Dec;34(12):10999-11005. doi: 10.1109/TNNLS.2022.3159713. Epub 2023 Nov 30.
5
Nanosecond Random Telegraph Noise in In-Plane Magnetic Tunnel Junctions.面内磁隧道结中的纳秒随机电报噪声
Phys Rev Lett. 2021 Mar 19;126(11):117202. doi: 10.1103/PhysRevLett.126.117202.
6
Demonstration of Nanosecond Operation in Stochastic Magnetic Tunnel Junctions.随机磁隧道结中的纳秒级操作演示。
Nano Lett. 2021 Mar 10;21(5):2040-2045. doi: 10.1021/acs.nanolett.0c04652. Epub 2021 Feb 25.
7
Optimizing machine learning models for granular NdFeB magnets by very fast simulated annealing.通过极快速模拟退火优化用于颗粒钕铁硼磁体的机器学习模型。
Sci Rep. 2021 Feb 15;11(1):3792. doi: 10.1038/s41598-021-83315-9.
8
Integer factorization using stochastic magnetic tunnel junctions.使用随机磁隧道结进行整数分解。
Nature. 2019 Sep;573(7774):390-393. doi: 10.1038/s41586-019-1557-9. Epub 2019 Sep 18.
9
Combinatorial optimization by simulating adiabatic bifurcations in nonlinear Hamiltonian systems.通过模拟非线性哈密顿系统中的绝热分岔进行组合优化。
Sci Adv. 2019 Apr 19;5(4):eaav2372. doi: 10.1126/sciadv.aav2372. eCollection 2019 Apr.
10
Weighted p -Bits for FPGA Implementation of Probabilistic Circuits.用于概率电路FPGA实现的加权p比特
IEEE Trans Neural Netw Learn Syst. 2019 Jun;30(6):1920-1926. doi: 10.1109/TNNLS.2018.2874565. Epub 2018 Oct 30.