IEEE Trans Cybern. 2021 Jul;51(7):3455-3468. doi: 10.1109/TCYB.2019.2947622. Epub 2021 Jun 23.
Soft sensors have been widely accepted for online estimating key quality-related variables in industrial processes. The Gaussian mixture models (GMM) is one of the most popular soft sensing methods for the non-Gaussian industrial processes. However, in industrial applications, the quantity of samples with known labels is usually quite limited because of the technical limitations or economical reasons. Traditional GMM-based soft sensor models solely depending on labeled samples may easily suffer from singular covariances, overfitting, and difficulties in model selection, which results in the performance deterioration. To tackle these issues, we propose a semisupervised Bayesian GMM (SBGMM). In the SBGMM, we first propose a semisupervised fully Bayesian model, which enables learning from both the labeled and unlabeled datasets for remedying the deficiency of infrequent labeled samples. Subsequently, a general framework of weighted variational inference is developed to train the SBGMM, such that the rate of learning from unlabeled samples can be controlled by penalizing the unlabeled dataset. Case studies are carried out to evaluate the performance of the SBGMM through a numerical example and two real-world industrial processes, which demonstrate the effectiveness and reliability of the proposed approach.
软传感器已被广泛接受,用于在线估计工业过程中的关键质量相关变量。高斯混合模型(GMM)是最流行的非高斯工业过程软传感方法之一。然而,在工业应用中,由于技术限制或经济原因,通常只有有限数量的带标签样本。传统的基于 GMM 的软传感器模型仅依赖于带标签的样本,可能容易受到奇异协方差、过拟合和模型选择困难的影响,从而导致性能下降。为了解决这些问题,我们提出了一种半监督贝叶斯 GMM(SBGMM)。在 SBGMM 中,我们首先提出了一种半监督全贝叶斯模型,该模型能够从带标签和无标签数据集进行学习,以弥补带标签样本不足的问题。随后,开发了一种通用的加权变分推理框架来训练 SBGMM,以便通过惩罚无标签数据集来控制从无标签样本中学习的速度。通过数值示例和两个实际工业过程进行了案例研究,以评估 SBGMM 的性能,结果表明了所提出方法的有效性和可靠性。