Ghiasi S, Patane A, Greco A, Laurenti L, Scilingo E P, Kwiatkowska M
Annu Int Conf IEEE Eng Med Biol Soc. 2020 Jul;2020:54-57. doi: 10.1109/EMBC44109.2020.9176437.
While machine learning algorithms are able to detect subtle patterns of interest in data, expert knowledge may contain crucial information that is not easily extracted from a given dataset, especially when the latter is small or noisy. In this paper we investigate the suitability of Gaussian Process Classification (GPC) as an effective model to implement the domain knowledge in an algorithm's training phase. Building on their Bayesian nature, we proceed by injecting problem- specific domain knowledge in the form of an a-priori distribution on the GPC latent function. We do this by extracting handcrafted features from the input data, and correlating them to the logits of the classification problem through fitting a prior function informed by the physiology of the problem. The physiologically-informed prior of the GPC is then updated through the Bayes formula using the available dataset. We apply the methods discussed here to a two-class classification problem associated to a dataset comprising Heart Rate Variability (HRV) and Electrodermal Activity (EDA) signals collected from 26 subjects who were exposed to a physical stressor aimed at altering their autonomic nervous systems dynamics. We provide comparative computational experiments on the selection of appropriate physiologically-inspired GPC prior functions. We find that the recognition of the presence of the physical stressor is significantly enhanced when the physiologically-inspired prior knowledge is injected into the GPC model.
虽然机器学习算法能够检测数据中感兴趣的微妙模式,但专家知识可能包含从给定数据集中不易提取的关键信息,尤其是当数据集较小或存在噪声时。在本文中,我们研究高斯过程分类(GPC)作为一种有效模型在算法训练阶段实施领域知识的适用性。基于其贝叶斯性质,我们通过以GPC潜在函数的先验分布形式注入特定问题的领域知识来进行研究。我们通过从输入数据中提取手工特征,并通过拟合由问题生理学提供信息的先验函数,将它们与分类问题的对数几率相关联来实现这一点。然后,通过使用可用数据集通过贝叶斯公式更新GPC的生理学信息先验。我们将这里讨论的方法应用于与一个数据集相关的两类分类问题,该数据集包含从26名受试者收集的心率变异性(HRV)和皮肤电活动(EDA)信号,这些受试者暴露于旨在改变其自主神经系统动态的身体应激源。我们提供了关于选择合适的生理学启发的GPC先验函数的比较计算实验。我们发现,当将生理学启发的先验知识注入GPC模型时,对身体应激源存在的识别显著增强。