具有安全约束的贝叶斯优化：机器人领域中安全且自动的参数调整

Bayesian optimization with safety constraints: safe and automatic parameter tuning in robotics.

作者信息

Berkenkamp Felix, Krause Andreas, Schoellig Angela P

机构信息

Department of Computer Science, ETH Zurich, Zurich, Switzerland.

Institute for Aerospace Studies, University of Toronto, Toronto, Canada.

出版信息

Mach Learn. 2023;112(10):3713-3747. doi: 10.1007/s10994-021-06019-1. Epub 2021 Jun 24.

DOI:10.1007/s10994-021-06019-1

PMID:37692295

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10485113/

Abstract

Selecting the right tuning parameters for algorithms is a pravelent problem in machine learning that can significantly affect the performance of algorithms. Data-efficient optimization algorithms, such as Bayesian optimization, have been used to automate this process. During experiments on real-world systems such as robotic platforms these methods can evaluate unsafe parameters that lead to safety-critical system failures and can destroy the system. Recently, a safe Bayesian optimization algorithm, called SafeOpt, has been developed, which guarantees that the performance of the system never falls below a critical value; that is, safety is defined based on the performance function. However, coupling performance and safety is often not desirable in practice, since they are often opposing objectives. In this paper, we present a generalized algorithm that allows for multiple safety constraints separate from the objective. Given an initial set of safe parameters, the algorithm maximizes performance but only evaluates parameters that satisfy safety for all constraints with high probability. To this end, it carefully explores the parameter space by exploiting regularity assumptions in terms of a Gaussian process prior. Moreover, we show how context variables can be used to safely transfer knowledge to new situations and tasks. We provide a theoretical analysis and demonstrate that the proposed algorithm enables fast, automatic, and safe optimization of tuning parameters in experiments on a quadrotor vehicle.

摘要

为算法选择合适的调优参数是机器学习中一个普遍存在的问题，它会显著影响算法的性能。数据高效的优化算法，如贝叶斯优化，已被用于自动化这一过程。在诸如机器人平台等现实世界系统的实验中，这些方法可能会评估导致安全关键系统故障并可能破坏系统的不安全参数。最近，一种名为SafeOpt的安全贝叶斯优化算法已经被开发出来，它保证系统性能永远不会低于临界值；也就是说，安全是基于性能函数来定义的。然而，在实践中，将性能和安全耦合起来往往是不可取的，因为它们通常是相互对立的目标。在本文中，我们提出了一种广义算法，该算法允许将多个安全约束与目标分开。给定一组初始安全参数，该算法会最大化性能，但只评估极有可能满足所有约束条件下的安全性的参数。为此，它通过利用高斯过程先验中的正则性假设来仔细探索参数空间。此外，我们展示了上下文变量如何用于安全地将知识转移到新的情况和任务中。我们进行了理论分析，并证明了所提出的算法能够在四旋翼飞行器实验中快速、自动且安全地优化调优参数。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2be6/10485113/0663391eda9a/10994_2021_6019_Fig1_HTML.jpg

相似文献

Bayesian optimization with safety constraints: safe and automatic parameter tuning in robotics.具有安全约束的贝叶斯优化：机器人领域中安全且自动的参数调整

Mach Learn. 2023;112(10):3713-3747. doi: 10.1007/s10994-021-06019-1. Epub 2021 Jun 24.

SAFE-OPT: a Bayesian optimization algorithm for learning optimal deep brain stimulation parameters with safety constraints.SAFE-OPT：一种贝叶斯优化算法，用于学习具有安全约束的最优深部脑刺激参数。

J Neural Eng. 2024 Aug 14;21(4). doi: 10.1088/1741-2552/ad6cf3.

Progressive sampling-based Bayesian optimization for efficient and automatic machine learning model selection.基于渐进采样的贝叶斯优化，用于高效自动的机器学习模型选择。

Health Inf Sci Syst. 2017 Sep 27;5(1):2. doi: 10.1007/s13755-017-0023-z. eCollection 2017 Dec.

Optimizing Machine Learning Algorithms for Landslide Susceptibility Mapping along the Karakoram Highway, Gilgit Baltistan, Pakistan: A Comparative Study of Baseline, Bayesian, and Metaheuristic Hyperparameter Optimization Techniques.优化巴基斯坦吉尔吉特-巴尔蒂斯坦喀喇昆仑公路沿线滑坡易发性制图的机器学习算法：基线、贝叶斯和元启发式超参数优化技术的比较研究

Sensors (Basel). 2023 Aug 1;23(15):6843. doi: 10.3390/s23156843.

Kernel-imbedded Gaussian processes for disease classification using microarray gene expression data.使用微阵列基因表达数据的用于疾病分类的核嵌入高斯过程。

BMC Bioinformatics. 2007 Feb 28;8:67. doi: 10.1186/1471-2105-8-67.

Parameter Sensitivity Analysis for the Progressive Sampling-Based Bayesian Optimization Method for Automated Machine Learning Model Selection.用于自动化机器学习模型选择的基于渐进采样的贝叶斯优化方法的参数敏感性分析

Heterog Data Manag Polystores Anal Healthc (2020). 2021;12633:213-227. doi: 10.1007/978-3-030-71055-2_17. Epub 2021 Mar 4.

Bayesian reaction optimization as a tool for chemical synthesis.贝叶斯反应优化作为化学合成的工具。

Nature. 2021 Feb;590(7844):89-96. doi: 10.1038/s41586-021-03213-y. Epub 2021 Feb 3.

Implicit Posteriori Parameter Distribution Optimization in Reinforcement Learning.强化学习中的隐式后验参数分布优化

IEEE Trans Cybern. 2024 May;54(5):3051-3064. doi: 10.1109/TCYB.2023.3254596. Epub 2024 Apr 16.

Shielded Planning Guided Data-Efficient and Safe Reinforcement Learning.屏蔽规划引导的数据高效且安全的强化学习

IEEE Trans Neural Netw Learn Syst. 2025 Feb;36(2):3808-3819. doi: 10.1109/TNNLS.2024.3359031. Epub 2025 Feb 6.

Funneled Bayesian Optimization for Design, Tuning and Control of Autonomous Systems.漏斗贝叶斯优化在自主系统设计、调优和控制中的应用。

IEEE Trans Cybern. 2019 Apr;49(4):1489-1500. doi: 10.1109/TCYB.2018.2805695. Epub 2018 Feb 27.

引用本文的文献

Failure modes and mitigations for Bayesian optimization of neuromodulation parameters.神经调节参数贝叶斯优化的失效模式及缓解措施。

J Neural Eng. 2025 Jun 13;22(3):036038. doi: 10.1088/1741-2552/ade189.

Safe contact-based robot active search using Bayesian optimization and control barrier functions.基于安全接触的机器人主动搜索：利用贝叶斯优化和控制障碍函数

Front Robot AI. 2024 Apr 29;11:1344367. doi: 10.3389/frobt.2024.1344367. eCollection 2024.

Bayesian optimization for demographic inference.贝叶斯优化在人口推断中的应用。

G3 (Bethesda). 2023 Jul 5;13(7). doi: 10.1093/g3journal/jkad080.

Bayesian optimization with unknown constraints in graphical skill models for compliant manipulation tasks using an industrial robot.使用工业机器人进行柔顺操作任务的图形技能模型中具有未知约束的贝叶斯优化。

Front Robot AI. 2022 Oct 14;9:993359. doi: 10.3389/frobt.2022.993359. eCollection 2022.

Bayesian optimization of distributed neurodynamical controller models for spatial navigation.用于空间导航的分布式神经动力学控制器模型的贝叶斯优化

Array (N Y). 2022 Sep;15. doi: 10.1016/j.array.2022.100218. Epub 2022 Jul 15.

A time-dependent parameter estimation framework for crop modeling.基于时间的作物模型参数估计框架。

Sci Rep. 2021 Jun 1;11(1):11437. doi: 10.1038/s41598-021-90835-x.

本文引用的文献

Reinforcement learning of motor skills with policy gradients.基于策略梯度的运动技能强化学习。

Neural Netw. 2008 May;21(4):682-97. doi: 10.1016/j.neunet.2008.02.003. Epub 2008 Apr 26.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

具有安全约束的贝叶斯优化：机器人领域中安全且自动的参数调整

Bayesian optimization with safety constraints: safe and automatic parameter tuning in robotics.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献