Alkurdi Abdulrahman, He Maxine, Cerna Jonathan, Clore Jean, Sowers Richard, Hsiao-Wecksler Elizabeth T, Hernandez Manuel E
Department of Mechanical Science & Engineering, University of Illinois Urbana-Champaign, Urbana, IL 61801, USA.
Neuroscience Program, University of Illinois Urbana-Champaign, Urbana, IL 61801, USA.
Sensors (Basel). 2025 Feb 18;25(4):1241. doi: 10.3390/s25041241.
This study quantitatively evaluated whether and how machine learning (ML) models built by data from controlled conditions can fit real-world conditions. This study focused on feature-based models using wearable technology from real-world data collected from young adults, so as to provide insights into the models' robustness and the specific challenges posed by diverse environmental noise. Feature-based models, particularly XGBoost and Decision Trees, demonstrated considerable resilience, maintaining higher accuracy and reliability across different noise levels. This investigation included an in-depth analysis of transfer learning, highlighting its potential and limitations in adapting models developed from standard datasets, like WESAD, to complex real-world scenarios. Moreover, this study analyzed the distributed feature importance across various physiological signals, such as electrodermal activity (EDA) and electrocardiography (ECG), considering their vulnerability to environmental factors. It was found that integrating multiple physiological data types could significantly enhance model robustness. The results underscored the need for a nuanced understanding of signal contributions to model efficacy, suggesting that feature-based models showed much promise in practical applications.
本研究定量评估了基于受控条件下的数据构建的机器学习(ML)模型是否以及如何适用于现实世界的条件。本研究聚焦于使用从年轻人收集的现实世界数据中的可穿戴技术构建的基于特征的模型,以便深入了解模型的稳健性以及各种环境噪声带来的具体挑战。基于特征的模型,特别是XGBoost和决策树,展现出了相当的韧性,在不同噪声水平下保持了较高的准确性和可靠性。这项调查包括对迁移学习的深入分析,突出了其在使从标准数据集(如WESAD)开发的模型适应复杂现实世界场景方面的潜力和局限性。此外,本研究分析了各种生理信号(如皮肤电活动(EDA)和心电图(ECG))的分布式特征重要性,考虑到它们对环境因素的敏感性。研究发现,整合多种生理数据类型可以显著提高模型的稳健性。结果强调了对信号对模型效能的贡献进行细致理解的必要性,表明基于特征的模型在实际应用中很有前景。