Kelemen János Z, Kertész-Farkas Attila, Kocsor András, Puskás László G
Laboratory of Functional Genomics, Biological Research Centre, Hungarian Academy of Sciences, Szeged Temesvári krt. 62, H-6726, Hungary.
Bioinformatics. 2006 Dec 15;22(24):3047-53. doi: 10.1093/bioinformatics/btl545. Epub 2006 Oct 25.
In this paper, we propose using the Kalman filter (KF) as a pre-processing step in microarray-based molecular diagnosis. Incorporating the expression covariance between genes is important in such classification problems, since this represents the functional relationships that govern tissue state. Failing to fulfil such requirements may result in biologically implausible class prediction models. Here, we show that employing the KF to remove noise (while retaining meaningful covariance and thus being able to estimate the underlying biological state from microarray measurements) yields linearly separable data suitable for most classification algorithms.
We demonstrate the utility and performance of the KF as a robust disease-state estimator on publicly available binary and multi-class microarray datasets in combination with the most widely used classification methods to date. Moreover, using popular graphical representation schemes we show that our filtered datasets also have an improved visualization capability.
在本文中,我们提议将卡尔曼滤波器(KF)用作基于微阵列的分子诊断中的预处理步骤。在这类分类问题中纳入基因间的表达协方差很重要,因为这代表了控制组织状态的功能关系。未能满足此类要求可能会导致生物学上不合理的类别预测模型。在此,我们表明使用KF去除噪声(同时保留有意义的协方差,从而能够从微阵列测量中估计潜在的生物学状态)会产生适用于大多数分类算法的线性可分数据。
我们结合迄今为止使用最广泛的分类方法,在公开可用的二元和多类微阵列数据集上证明了KF作为稳健疾病状态估计器的效用和性能。此外,使用流行的图形表示方案,我们表明我们的滤波后数据集还具有改进的可视化能力。