• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过强化学习估计时空场。

Estimating spatio-temporal fields through reinforcement learning.

作者信息

Padrao Paulo, Fuentes Jose, Bobadilla Leonardo, Smith Ryan N

机构信息

Knight Foundation School of Computing and Information Sciences, Florida International University, Miami, FL, United States.

Institute for Environment, Florida International University, Miami, FL, United States.

出版信息

Front Robot AI. 2022 Sep 5;9:878246. doi: 10.3389/frobt.2022.878246. eCollection 2022.

DOI:10.3389/frobt.2022.878246
PMID:36134337
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9483151/
Abstract

Prediction and estimation of phenomena of interest in aquatic environments are challenging since they present complex spatio-temporal dynamics. Over the past few decades, advances in machine learning and data processing contributed to ocean exploration and sampling using autonomous robots. In this work, we formulate a reinforcement learning framework to estimate spatio-temporal fields modeled by partial differential equations. The proposed framework addresses problems of the classic methods regarding the sampling process to determine the path to be used by the agent to collect samples. Simulation results demonstrate the applicability of our approach and show that the error at the end of the learning process is close to the expected error given by the fitting process due to added noise.

摘要

预测和估计水生环境中感兴趣的现象具有挑战性,因为它们呈现出复杂的时空动态。在过去几十年中,机器学习和数据处理的进步推动了使用自主机器人进行海洋探索和采样。在这项工作中,我们制定了一个强化学习框架,以估计由偏微分方程建模的时空场。所提出的框架解决了经典方法在采样过程中存在的问题,即确定智能体用于收集样本的路径。仿真结果证明了我们方法的适用性,并表明学习过程结束时的误差接近由于添加噪声而由拟合过程给出的预期误差。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f76/9483151/5ec7e3b11b8b/frobt-09-878246-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f76/9483151/21955b36b61c/frobt-09-878246-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f76/9483151/6480eae5e87b/frobt-09-878246-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f76/9483151/ba3b40227679/frobt-09-878246-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f76/9483151/f980b61220c0/frobt-09-878246-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f76/9483151/0f251a9332b6/frobt-09-878246-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f76/9483151/5ec7e3b11b8b/frobt-09-878246-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f76/9483151/21955b36b61c/frobt-09-878246-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f76/9483151/6480eae5e87b/frobt-09-878246-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f76/9483151/ba3b40227679/frobt-09-878246-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f76/9483151/f980b61220c0/frobt-09-878246-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f76/9483151/0f251a9332b6/frobt-09-878246-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f76/9483151/5ec7e3b11b8b/frobt-09-878246-g006.jpg

相似文献

1
Estimating spatio-temporal fields through reinforcement learning.通过强化学习估计时空场。
Front Robot AI. 2022 Sep 5;9:878246. doi: 10.3389/frobt.2022.878246. eCollection 2022.
2
An End-to-End Deep Reinforcement Learning-Based Intelligent Agent Capable of Autonomous Exploration in Unknown Environments.一种基于端到端深度学习的智能代理,能够在未知环境中自主探索。
Sensors (Basel). 2018 Oct 22;18(10):3575. doi: 10.3390/s18103575.
3
Cognitive navigation based on nonuniform Gabor space sampling, unsupervised growing networks, and reinforcement learning.基于非均匀伽柏空间采样、无监督生长网络和强化学习的认知导航。
IEEE Trans Neural Netw. 2004 May;15(3):639-52. doi: 10.1109/TNN.2004.826221.
4
A novel framework for spatio-temporal prediction of environmental data using deep learning.一种基于深度学习的环境数据时空预测新框架。
Sci Rep. 2020 Dec 17;10(1):22243. doi: 10.1038/s41598-020-79148-7.
5
Navigation in Unknown Dynamic Environments Based on Deep Reinforcement Learning.基于深度强化学习的未知动态环境导航。
Sensors (Basel). 2019 Sep 5;19(18):3837. doi: 10.3390/s19183837.
6
Plume Tracing via Model-Free Reinforcement Learning Method.基于无模型强化学习方法的羽流追踪
IEEE Trans Neural Netw Learn Syst. 2019 Aug;30(8):2515-2527. doi: 10.1109/TNNLS.2018.2885374. Epub 2019 Jan 1.
7
Toward Benchmarking of Long-Term Spatio-Temporal Maps of Pedestrian Flows for Human-Aware Navigation.迈向用于人感知导航的行人流动长期时空地图的基准测试
Front Robot AI. 2022 Jul 4;9:890013. doi: 10.3389/frobt.2022.890013. eCollection 2022.
8
Reinforcement learning-based estimation for spatio-temporal systems.基于强化学习的时空系统估计
Sci Rep. 2024 Sep 28;14(1):22464. doi: 10.1038/s41598-024-72055-1.
9
Deep learning with 4D spatio-temporal data representations for OCT-based force estimation.基于 OCT 的力估计的 4D 时空数据表示的深度学习。
Med Image Anal. 2020 Aug;64:101730. doi: 10.1016/j.media.2020.101730. Epub 2020 May 23.
10
Learning Environmental Field Exploration with Computationally Constrained Underwater Robots: Gaussian Processes Meet Stochastic Optimal Control.利用计算受限水下机器人进行学习环境实地探索:高斯过程与随机最优控制相结合。
Sensors (Basel). 2019 May 6;19(9):2094. doi: 10.3390/s19092094.

引用本文的文献

1
A survey on autonomous environmental monitoring approaches: towards unifying active sensing and reinforcement learning.自主环境监测方法综述:迈向主动感知与强化学习的统一
Front Robot AI. 2024 Mar 12;11:1336612. doi: 10.3389/frobt.2024.1336612. eCollection 2024.

本文引用的文献

1
Reinforcement Learning-Based Tracking Control of USVs in Varying Operational Conditions.基于强化学习的无人水面艇在不同运行条件下的跟踪控制
Front Robot AI. 2020 Mar 20;7:32. doi: 10.3389/frobt.2020.00032. eCollection 2020.
2
Solving high-dimensional partial differential equations using deep learning.使用深度学习解决高维偏微分方程。
Proc Natl Acad Sci U S A. 2018 Aug 21;115(34):8505-8510. doi: 10.1073/pnas.1718942115. Epub 2018 Aug 6.
3
Parameter Estimation of Partial Differential Equation Models.偏微分方程模型的参数估计
J Am Stat Assoc. 2013;108(503). doi: 10.1080/01621459.2013.794730.