Department of Civil, Architectural and Environmental Engineering, University of Texas at Austin, Austin, Texas, USA.
J Air Waste Manag Assoc. 2022 Dec;72(12):1381-1397. doi: 10.1080/10962247.2022.2105439. Epub 2022 Oct 21.
A variety of factors can affect a person's perception of their environment and health, but one factor that is often overlooked in indoor settings is the air quality. To address this gap, we develop and evaluate four Machine Learning (ML) models on two disparate datasets using Indoor Air Quality (IAQ) parameters as primary features and components of self-reported IAQ satisfaction and sleep quality as target variables. In each case, we compare models to each other as well as to a simple model that always predicts the majority outcome. In the first analysis, we use open-source data collected from 93 California residences to predict occupant's satisfaction with their indoor environment. Results indicate building ventilation rate, Relative Humidity (RH), and formaldehyde are most influential when predicting IAQ perception and do so with an accuracy greater than the simplified model. The second analysis uses IAQ data gathered from a field study we conducted with 20 participants over 11 weeks to train similar models. We obtain accuracy and F1 scores similar to the simplified model where PM and TVOCs represent the most important predictors. Our results underscore the ability of IAQ to affect a person's perception of their built environment and health and highlight the utility of ML models to explore the strength of these relationships.: The results from this study show that two outcome variables - occupant's indoor air quality (IAQ) satisfaction and perceived sleep quality - are related to the measured IAQ parameters but not heavily influenced by typical values measured in apartments and homes. This study highlights the ability of machine learning models as exploratory analysis tools to determine underlying relationships within and across datasets in addition to understanding the importance of certain features on the outcome variable. We compare four different models and find that the random forest classifier has the best performance in both analysis on IAQ satisfaction and perceived sleep quality. It is a suitable model for predicting IAQ-related subjective metrics and also provides value insight into the feature importance of the IAQ parameters. The accuracy of any of these machine learning models in predicting occupants' comfort or sleep quality is limited by the dataset size, how data is collected, and range of data. This study identifies the factors that are important to IAQ perception: ventilation rate, relative humidity, and concentrations of formaldehyde, NO2, and particulate matter. It indicates that sensors that can measure these variables are necessary for future, related studies that model occupants' IAQ satisfaction. However, this study does not find strong relationships between any of the IAQ parameters measured in this study and perceived sleep quality despite the logical pathway between these many pollutants and respiratory issues. A prediction model of IAQ perception or sleep quality can be integrated into home management systems to automatically adjust building operations such as ventilation rates in smart buildings. Once buildings are equipped with a network of low-cost sensors that measure concentrations of pollutants and operating conditions of the ventilation system, the prediction model can be used to predict the occupants' comfort and facilitate the control of the ventilation system.
各种因素都会影响一个人对环境和健康的感知,但在室内环境中,有一个因素经常被忽视,那就是空气质量。为了解决这一差距,我们使用室内空气质量 (IAQ) 参数作为主要特征,并将自我报告的 IAQ 满意度和睡眠质量作为目标变量,在两个不同的数据集上开发和评估了四个机器学习 (ML) 模型。在每种情况下,我们将模型相互比较,以及与总是预测多数结果的简单模型进行比较。在第一次分析中,我们使用从 93 个加利福尼亚住宅收集的开源数据来预测居住者对其室内环境的满意度。结果表明,建筑物通风率、相对湿度 (RH) 和甲醛在预测室内空气质量感知方面的影响最大,其准确性高于简化模型。第二次分析使用我们在 11 周内对 20 名参与者进行的现场研究中收集的 IAQ 数据来训练类似的模型。我们获得的准确性和 F1 分数与简化模型相似,其中 PM 和 TVOC 是最重要的预测因素。我们的结果强调了空气质量对一个人对其建筑环境和健康感知的影响,并突出了机器学习模型作为探索性分析工具的实用性,以探索这些关系的强度。这项研究表明,两个结果变量——居住者的室内空气质量 (IAQ) 满意度和感知睡眠质量——与测量的 IAQ 参数有关,但不受公寓和家庭中测量的典型值的严重影响。这项研究强调了机器学习模型作为探索性分析工具的能力,除了了解某些特征对结果变量的重要性外,还可以确定数据集内和跨数据集的潜在关系。我们比较了四个不同的模型,发现随机森林分类器在 IAQ 满意度和感知睡眠质量的分析中表现最好。它是预测与 IAQ 相关的主观指标的合适模型,还为 IAQ 参数的特征重要性提供了有价值的见解。任何这些机器学习模型在预测居住者舒适度或睡眠质量方面的准确性都受到数据集大小、数据收集方式和数据范围的限制。本研究确定了对室内空气质量感知重要的因素:通风率、相对湿度以及甲醛、二氧化氮和颗粒物的浓度。它表明,未来研究模型居住者的室内空气质量满意度,需要能够测量这些变量的传感器。然而,尽管许多污染物和呼吸问题之间存在逻辑关系,但本研究并未发现本研究中测量的任何室内空气质量参数与感知睡眠质量之间存在很强的关系。室内空气质量感知或睡眠质量的预测模型可以集成到家庭管理系统中,以便在智能建筑中自动调整建筑操作,例如通风率。一旦建筑物配备了一个低成本传感器网络,该网络可以测量污染物浓度和通风系统的运行状况,就可以使用预测模型来预测居住者的舒适度,并促进通风系统的控制。