基于高效且可靠的强化学习的建筑暖通空调控制及异构专家指导训练

Efficient and assured reinforcement learning-based building HVAC control with heterogeneous expert-guided training.

作者信息

Xu Shichao, Fu Yangyang, Wang Yixuan, Yang Zhuoran, Huang Chao, O'Neill Zheng, Wang Zhaoran, Zhu Qi

机构信息

Northwestern University, Mccormick School of Engineering, Evanston, 60208, USA.

Department of Mechanical Engineering, Texas a&M University, College Station, 77843, Texas, USA.

出版信息

Sci Rep. 2025 Mar 5;15(1):7677. doi: 10.1038/s41598-025-91326-z.

DOI:10.1038/s41598-025-91326-z

PMID:40044883

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11882997/

Abstract

Building heating, ventilation, and air conditioning (HVAC) systems account for nearly half of building energy consumption and [Formula: see text] of total energy consumption in the US. Their operation is also crucial for ensuring the physical and mental health of building occupants. Compared with traditional model-based HVAC control methods, the recent model-free deep reinforcement learning (DRL) based methods have shown good performance while do not require the development of detailed and costly physical models. However, these model-free DRL approaches often suffer from long training time to reach a good performance, which is a major obstacle for their practical deployment. In this work, we present a systematic approach to accelerate online reinforcement learning for HVAC control by taking full advantage of the knowledge from domain experts in various forms. Specifically, the algorithm stages include learning expert functions from existing abstract physical models and from historical data via offline reinforcement learning, integrating the expert functions with rule-based guidelines, conducting training guided by the integrated expert function and performing policy initialization from distilled expert function. Moreover, to ensure that the learned DRL-based HVAC controller can effectively keep room temperature within the comfortable range for occupants, we design a runtime shielding framework to reduce the temperature violation rate and incorporate the learned controller into it. Experimental results demonstrate up to 8.8X speedup in DRL training from our approach over previous methods, with low temperature violation rate.

摘要

建筑供暖、通风与空调（HVAC）系统占建筑能耗的近一半，在美国总能耗中占[公式：见原文]。其运行对于确保建筑 occupants 的身心健康也至关重要。与传统的基于模型的HVAC控制方法相比，最近基于无模型深度强化学习（DRL）的方法在不需要开发详细且成本高昂的物理模型的情况下表现出了良好的性能。然而，这些无模型DRL方法通常需要很长的训练时间才能达到良好的性能，这是它们实际部署的主要障碍。在这项工作中，我们提出了一种系统方法，通过充分利用来自领域专家的各种形式的知识来加速HVAC控制的在线强化学习。具体来说，算法阶段包括通过离线强化学习从现有的抽象物理模型和历史数据中学习专家函数，将专家函数与基于规则的指导方针相结合，在集成专家函数的指导下进行训练，并从提炼的专家函数进行策略初始化。此外，为了确保基于DRL的HVAC控制器能够有效地将室温保持在 occupants 的舒适范围内，我们设计了一个运行时屏蔽框架来降低温度违规率，并将学习到的控制器纳入其中。实验结果表明，我们的方法在DRL训练中比以前的方法加速了8.8倍，且温度违规率较低。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe7b/11882997/afb94459a36e/41598_2025_91326_Fig3_HTML.jpg

相似文献

Efficient and assured reinforcement learning-based building HVAC control with heterogeneous expert-guided training.基于高效且可靠的强化学习的建筑暖通空调控制及异构专家指导训练

Sci Rep. 2025 Mar 5;15(1):7677. doi: 10.1038/s41598-025-91326-z.

Heating Control Strategy Based on Dynamic Programming for Building Energy Saving and Emission Reduction.基于动态规划的建筑节能与减排的加热控制策略。

Int J Environ Res Public Health. 2022 Oct 29;19(21):14137. doi: 10.3390/ijerph192114137.

Privacy-Preserving Energy Management of a Shared Energy Storage System for Smart Buildings: A Federated Deep Reinforcement Learning Approach.面向智能楼宇共享储能系统的隐私保护能效管理：联邦深度强化学习方法。

Sensors (Basel). 2021 Jul 19;21(14):4898. doi: 10.3390/s21144898.

From occupants to occupants: A review of the occupant information understanding for building HVAC occupant-centric control.从居住者到居住者：关于建筑暖通空调以居住者为中心控制的居住者信息理解综述。

Build Simul. 2022;15(6):913-932. doi: 10.1007/s12273-021-0861-0. Epub 2021 Dec 7.

Sustainability of Heating, Ventilation and Air-Conditioning (HVAC) Systems in Buildings-An Overview.建筑物采暖、通风与空调（HVAC）系统的可持续性——概述。

Int J Environ Res Public Health. 2022 Jan 17;19(2):1016. doi: 10.3390/ijerph19021016.

A systematic review and meta-analysis of indoor bioaerosols in hospitals: The influence of heating, ventilation, and air conditioning.医院室内生物气溶胶的系统评价和荟萃分析：加热、通风和空调的影响。

PLoS One. 2021 Dec 23;16(12):e0259996. doi: 10.1371/journal.pone.0259996. eCollection 2021.

Cyber-Enabled Optimization of HVAC System Control in Open Space of Office Building.办公建筑开放空间的 HVAC 系统控制的网络增强优化。

Sensors (Basel). 2023 May 18;23(10):4857. doi: 10.3390/s23104857.

Air cleaning technologies: an evidence-based analysis.空气净化技术：基于证据的分析。

Ont Health Technol Assess Ser. 2005;5(17):1-52. Epub 2005 Nov 1.

Enhancing thermal comfort prediction in high-speed trains through machine learning and physiological signals integration.通过机器学习和生理信号集成提高高速列车的热舒适预测。

J Therm Biol. 2024 Apr;121:103828. doi: 10.1016/j.jtherbio.2024.103828. Epub 2024 Mar 27.

Predictive hierarchical reinforcement learning for path-efficient mapless navigation with moving target.具有移动目标的无图路径高效导航的预测分层强化学习。

Neural Netw. 2023 Aug;165:677-688. doi: 10.1016/j.neunet.2023.06.007. Epub 2023 Jun 10.

引用本文的文献

AI-Powered Building Ecosystems: A Narrative Mapping Review on the Integration of Digital Twins and LLMs for Proactive Comfort, IEQ, and Energy Management.人工智能驱动的建筑生态系统：关于数字孪生与大语言模型集成以实现主动式舒适度、室内环境质量和能源管理的叙事映射综述

Sensors (Basel). 2025 Aug 24;25(17):5265. doi: 10.3390/s25175265.

本文引用的文献

The analysis of isolation measures for epidemic control of COVID-19.新型冠状病毒肺炎疫情防控隔离措施分析

Appl Intell (Dordr). 2021;51(5):3074-3085. doi: 10.1007/s10489-021-02239-z. Epub 2021 Feb 15.

The National Human Activity Pattern Survey (NHAPS): a resource for assessing exposure to environmental pollutants.国家人类活动模式调查（NHAPS）：评估环境污染物暴露情况的一项资源。

J Expo Anal Environ Epidemiol. 2001 May-Jun;11(3):231-52. doi: 10.1038/sj.jea.7500165.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于高效且可靠的强化学习的建筑暖通空调控制及异构专家指导训练

Efficient and assured reinforcement learning-based building HVAC control with heterogeneous expert-guided training.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献