• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于决策的忆阻贝尔曼求解器

Memristive Bellman solver for decision-making.

作者信息

Feng Zhe, Wu Zuheng, Zou Jianxun, Cheng Lingli, Zhao Xiaolong, Zhang Xumeng, Lu Jian, Wang Cong, Wang Yilin, Wang Haochen, Guo Wenbin, Qian Zhibin, Zhu Yunlai, Xu Zuyu, Dai Yuehua, Liu Qi

机构信息

School of Integrated Circuits, Anhui University, Hefei, Anhui, China.

Frontier Institute of Chip and System, Fudan University, Shanghai, China.

出版信息

Nat Commun. 2025 May 27;16(1):4925. doi: 10.1038/s41467-025-60085-w.

DOI:10.1038/s41467-025-60085-w
PMID:40425569
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12117070/
Abstract

The Bellman equation, with a resource-consuming solving process, plays a fundamental role in formulating and solving dynamic optimization problems. The realization of the Bellman solver with memristive computing-in-memory (MCIM) technology, is significant for implementing efficient dynamic decision-making. However, the iterative nature of the Bellman equation solving process poses a challenge for efficient implementation on MCIM systems, which excel at vector-matrix multiplication (VMM) operations but are less suited for iterative algorithms. In this work, by incorporating the temporal dimension and transforming the solution into recurrent dot product operations, a memristive Bellman solver (MBS) is proposed, facilitating the implementation of the Bellman equation solving process with efficient MCIM technology. The MBS effectively reduces the iteration numbers and which further enhanced by approximated solutions leveraging memristor noise. Finally, the path planning tasks are used to verify the feasibility of the proposed MBS. The theoretical derivation and experimental results demonstrate that the MBS effectively reduces the iteration cycles, facilitating the solving efficiency. This work could be a sound of choice for developing high-efficiency decision-making systems.

摘要

贝尔曼方程在动态优化问题的制定和求解中起着基础性作用,但其求解过程会消耗资源。利用忆阻存内计算(MCIM)技术实现贝尔曼求解器,对于高效动态决策的实现具有重要意义。然而,贝尔曼方程求解过程的迭代特性给在MCIM系统上的高效实现带来了挑战,MCIM系统擅长向量矩阵乘法(VMM)运算,但不太适合迭代算法。在这项工作中,通过纳入时间维度并将解转换为循环点积运算,提出了一种忆阻贝尔曼求解器(MBS),便于利用高效的MCIM技术实现贝尔曼方程求解过程。MBS有效地减少了迭代次数,并且通过利用忆阻器噪声的近似解进一步增强。最后,使用路径规划任务来验证所提出的MBS的可行性。理论推导和实验结果表明,MBS有效地减少了迭代周期,提高了求解效率。这项工作可能是开发高效决策系统的一个不错选择。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4171/12117070/11d699ce69b3/41467_2025_60085_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4171/12117070/d39774b590b3/41467_2025_60085_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4171/12117070/0ddf9db90dfe/41467_2025_60085_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4171/12117070/6176af4edc1e/41467_2025_60085_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4171/12117070/11d699ce69b3/41467_2025_60085_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4171/12117070/d39774b590b3/41467_2025_60085_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4171/12117070/0ddf9db90dfe/41467_2025_60085_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4171/12117070/6176af4edc1e/41467_2025_60085_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4171/12117070/11d699ce69b3/41467_2025_60085_Fig4_HTML.jpg

相似文献

1
Memristive Bellman solver for decision-making.用于决策的忆阻贝尔曼求解器
Nat Commun. 2025 May 27;16(1):4925. doi: 10.1038/s41467-025-60085-w.
2
Fully analog iteration for solving matrix equations with in-memory computing.用于通过内存计算求解矩阵方程的全模拟迭代
Sci Adv. 2025 Feb 14;11(7):eadr6391. doi: 10.1126/sciadv.adr6391. Epub 2025 Feb 12.
3
Parallel multigrid method for solving inverse problems.求解逆问题的并行多重网格方法。
MethodsX. 2022 Nov 1;9:101887. doi: 10.1016/j.mex.2022.101887. eCollection 2022.
4
In-Memory-Computing Realization with a Photodiode/Memristor Based Vision Sensor.基于光电二极管/忆阻器的视觉传感器实现内存计算
Materials (Basel). 2021 Sep 10;14(18):5223. doi: 10.3390/ma14185223.
5
Model-Free λ-Policy Iteration for Discrete-Time Linear Quadratic Regulation.无模型 λ-策略迭代法用于离散时间线性二次调节。
IEEE Trans Neural Netw Learn Syst. 2023 Feb;34(2):635-649. doi: 10.1109/TNNLS.2021.3098985. Epub 2023 Feb 3.
6
Policy iteration adaptive dynamic programming algorithm for discrete-time nonlinear systems.策略迭代自适应动态规划算法用于离散时间非线性系统。
IEEE Trans Neural Netw Learn Syst. 2014 Mar;25(3):621-34. doi: 10.1109/TNNLS.2013.2281663.
7
Forward and Backward Bellman Equations Improve the Efficiency of the EM Algorithm for DEC-POMDP.前向和后向贝尔曼方程提高了用于去中心化部分可观测马尔可夫决策过程的期望最大化算法的效率。
Entropy (Basel). 2021 Apr 29;23(5):551. doi: 10.3390/e23050551.
8
An Efficient and Robust Partial Differential Equation Solver by Flash-Based Computing in Memory.基于内存闪存计算的高效稳健偏微分方程求解器
Micromachines (Basel). 2023 Apr 22;14(5):901. doi: 10.3390/mi14050901.
9
Approximate Policy Iteration With Deep Minimax Average Bellman Error Minimization.基于深度极小极大平均贝尔曼误差最小化的近似策略迭代
IEEE Trans Neural Netw Learn Syst. 2025 Feb;36(2):2288-2299. doi: 10.1109/TNNLS.2023.3346992. Epub 2025 Feb 6.
10
Continuous-Time Time-Varying Policy Iteration.连续时间时变策略迭代
IEEE Trans Cybern. 2020 Dec;50(12):4958-4971. doi: 10.1109/TCYB.2019.2926631. Epub 2020 Dec 3.

本文引用的文献

1
Thousands of conductance levels in memristors integrated on CMOS.在 CMOS 上集成的数千个电导水平的忆阻器。
Nature. 2023 Mar;615(7954):823-829. doi: 10.1038/s41586-023-05759-5. Epub 2023 Mar 29.
2
Highly-scaled and fully-integrated 3-dimensional ferroelectric transistor array for hardware implementation of neural networks.用于神经网络硬件实现的高扩展和全集成三维铁电晶体管阵列。
Nat Commun. 2023 Jan 31;14(1):504. doi: 10.1038/s41467-023-36270-0.
3
Continuous-Time Fitted Value Iteration for Robust Policies.连续时间拟合值迭代法的鲁棒策略。
IEEE Trans Pattern Anal Mach Intell. 2023 May;45(5):5534-5548. doi: 10.1109/TPAMI.2022.3215769. Epub 2023 Apr 3.
4
Nanosecond protonic programmable resistors for analog deep learning.纳秒质子可程控电阻器用于模拟深度学习。
Science. 2022 Jul 29;377(6605):539-543. doi: 10.1126/science.abp8064. Epub 2022 Jul 28.
5
CMOS-compatible compute-in-memory accelerators based on integrated ferroelectric synaptic arrays for convolution neural networks.基于集成铁电突触阵列的用于卷积神经网络的CMOS兼容内存计算加速器。
Sci Adv. 2022 Apr 8;8(14):eabm8537. doi: 10.1126/sciadv.abm8537.
6
Evolution of the conductive filament system in HfO-based memristors observed by direct atomic-scale imaging.通过直接原子尺度成像观察到的基于HfO的忆阻器中传导细丝系统的演变。
Nat Commun. 2021 Dec 13;12(1):7232. doi: 10.1038/s41467-021-27575-z.
7
Memristive control of mutual spin Hall nano-oscillator synchronization for neuromorphic computing.用于神经形态计算的互自旋霍尔纳米振荡器同步的忆阻控制
Nat Mater. 2022 Jan;21(1):81-87. doi: 10.1038/s41563-021-01153-6. Epub 2021 Nov 29.
8
Nanoscale neural network using non-linear spin-wave interference.利用非线性自旋波干涉的纳米级神经网络。
Nat Commun. 2021 Nov 5;12(1):6422. doi: 10.1038/s41467-021-26711-z.
9
In materia reservoir computing with a fully memristive architecture based on self-organizing nanowire networks.在基于自组织纳米线网络的全忆阻器架构的材料库计算中。
Nat Mater. 2022 Feb;21(2):195-202. doi: 10.1038/s41563-021-01099-9. Epub 2021 Oct 4.
10
Spontaneous sparse learning for PCM-based memristor neural networks.基于 PCM 的忆阻器神经网络的自发稀疏学习。
Nat Commun. 2021 Jan 12;12(1):319. doi: 10.1038/s41467-020-20519-z.