• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于强化学习的时空建模最优传感器布局。

Reinforcement Learning-Based Optimal Sensor Placement for Spatiotemporal Modeling.

出版信息

IEEE Trans Cybern. 2020 Jun;50(6):2861-2871. doi: 10.1109/TCYB.2019.2901897. Epub 2019 Mar 18.

DOI:10.1109/TCYB.2019.2901897
PMID:30892267
Abstract

A reinforcement learning-based method is proposed for optimal sensor placement in the spatial domain for modeling distributed parameter systems (DPSs). First, a low-dimensional subspace, derived by Karhunen-Loève decomposition, is identified to capture the dominant dynamic features of the DPS. Second, a spatial objective function is proposed for the sensor placement. This function is defined in the obtained low-dimensional subspace by exploiting the time-space separation property of distributed processes, and in turn aims at minimizing the modeling error over the entire time and space domain. Third, the sensor placement configuration is mathematically formulated as a Markov decision process (MDP) with specified elements. Finally, the sensor locations are optimized through learning the optimal policies of the MDP according to the spatial objective function. The experimental results of a simulated catalytic rod and a real snap curing oven system are provided to demonstrate the feasibility and efficiency of the proposed method in solving the combinatorial optimization problems, such as optimal sensor placement.

摘要

提出了一种基于强化学习的方法,用于在空间域中进行最优传感器布置,以对分布参数系统(DPS)进行建模。首先,通过卡恩-洛维分解,确定一个低维子空间,以捕获 DPS 的主要动态特征。其次,提出了一个用于传感器布置的空间目标函数。该函数是在获得的低维子空间中定义的,利用了分布过程的时空分离特性,并旨在最小化整个时间和空间域上的建模误差。第三,将传感器布置配置形式化地表述为具有指定元素的马尔可夫决策过程(MDP)。最后,根据空间目标函数,通过学习 MDP 的最优策略来优化传感器位置。提供了一个模拟催化棒和一个真实的热固性烤箱系统的实验结果,以证明所提出的方法在解决组合优化问题(如最优传感器布置)方面的可行性和效率。

相似文献

1
Reinforcement Learning-Based Optimal Sensor Placement for Spatiotemporal Modeling.基于强化学习的时空建模最优传感器布局。
IEEE Trans Cybern. 2020 Jun;50(6):2861-2871. doi: 10.1109/TCYB.2019.2901897. Epub 2019 Mar 18.
2
Spatiotemporal Transformation-Based Neural Network With Interpretable Structure for Modeling Distributed Parameter Systems.基于时空变换且具有可解释结构的神经网络用于分布式参数系统建模
IEEE Trans Neural Netw Learn Syst. 2025 Jan;36(1):729-737. doi: 10.1109/TNNLS.2023.3334764. Epub 2025 Jan 7.
3
Optimizing the Sensor Placement for Foot Plantar Center of Pressure without Prior Knowledge Using Deep Reinforcement Learning.基于深度强化学习的足底压力中心无先验知识传感器优化放置。
Sensors (Basel). 2020 Sep 29;20(19):5588. doi: 10.3390/s20195588.
4
MOO-MDP: An Object-Oriented Representation for Cooperative Multiagent Reinforcement Learning.MOO-MDP:面向协同多智能体强化学习的面向对象表示。
IEEE Trans Cybern. 2019 Feb;49(2):567-579. doi: 10.1109/TCYB.2017.2781130. Epub 2017 Dec 28.
5
Efficient Sensor Placement Optimization for Shape Deformation Sensing of Antenna Structures with Fiber Bragg Grating Strain Sensors.基于光纤布拉格光栅应变传感器的天线结构形状变形感应的高效传感器位置优化。
Sensors (Basel). 2018 Aug 1;18(8):2481. doi: 10.3390/s18082481.
6
Hierarchical approximate policy iteration with binary-tree state space decomposition.基于二叉树状态空间分解的分层近似策略迭代
IEEE Trans Neural Netw. 2011 Dec;22(12):1863-77. doi: 10.1109/TNN.2011.2168422. Epub 2011 Oct 10.
7
An Edge Server Placement Method Based on Reinforcement Learning.一种基于强化学习的边缘服务器放置方法。
Entropy (Basel). 2022 Feb 23;24(3):317. doi: 10.3390/e24030317.
8
Intelligent control of a sensor-actuator system via kernelized least-squares policy iteration.基于核最小二乘策略迭代的传感器执行器系统智能控制。
Sensors (Basel). 2012;12(3):2632-53. doi: 10.3390/s120302632. Epub 2012 Feb 28.
9
Learning-Based DoS Attack Power Allocation in Multiprocess Systems.
IEEE Trans Neural Netw Learn Syst. 2023 Oct;34(10):8017-8030. doi: 10.1109/TNNLS.2022.3148924. Epub 2023 Oct 5.
10
An Energy-Efficient Spectrum-Aware Reinforcement Learning-Based Clustering Algorithm for Cognitive Radio Sensor Networks.一种用于认知无线电传感器网络的基于节能频谱感知强化学习的聚类算法。
Sensors (Basel). 2015 Aug 13;15(8):19783-818. doi: 10.3390/s150819783.

引用本文的文献

1
Domain Knowledge-Based Evolutionary Reinforcement Learning for Sensor Placement.基于领域知识的进化强化学习在传感器布放中的应用。
Sensors (Basel). 2022 May 17;22(10):3799. doi: 10.3390/s22103799.