Department of Teoría de la Señal y Comunicaciones, University of Valladolid, Valladolid, Spain.
IEEE Trans Biomed Eng. 2010 Dec;57(12):2850-60. doi: 10.1109/TBME.2010.2080679. Epub 2010 Sep 27.
We present a method for supervised, automatic, and reliable classification of healthy controls, patients with bipolar disorder, and patients with schizophrenia using brain imaging data. The method uses four supervised classification learning machines trained with a stochastic gradient learning rule based on the minimization of Kullback-Leibler divergence and an optimal model complexity search through posterior probability estimation. Prior to classification, given the high dimensionality of functional MRI (fMRI) data, a dimension reduction stage comprising two steps is performed: first, a one-sample univariate t-test mean-difference T(score) approach is used to reduce the number of significant discriminative functional activated voxels, and then singular value decomposition is performed to further reduce the dimension of the input patterns to a number comparable to the limited number of subjects available for each of the three classes. Experimental results using functional brain imaging (fMRI) data include receiver operation characteristic curves for the three-way classifier with area under curve values around 0.82, 0.89, and 0.90 for healthy control versus nonhealthy, bipolar disorder versus nonbipolar, and schizophrenia patients versus nonschizophrenia binary problems, respectively. The average three-way correct classification rate (CCR) is in the range of 70%-72%, for the test set, remaining close to the estimated Bayesian optimal CCR theoretical upper bound of about 80% , estimated from the one nearest-neighbor classifier over the same data.
我们提出了一种使用脑成像数据对健康对照组、双相情感障碍患者和精神分裂症患者进行监督、自动和可靠分类的方法。该方法使用四种基于最小化 Kullback-Leibler 散度的随机梯度学习规则和通过后验概率估计进行最优模型复杂度搜索的监督分类学习机进行训练。在分类之前,鉴于功能磁共振成像(fMRI)数据的高维性,执行包含两个步骤的降维阶段:首先,使用单样本单变量 t 检验均值差异 T(得分)方法来减少显著区分功能激活体素的数量,然后进行奇异值分解,以进一步将输入模式的维度降低到与每个三类中可用的有限数量的受试者数量可比的数量。使用功能脑成像(fMRI)数据的实验结果包括三分类器的接收者操作特征曲线,其曲线下面积分别约为 0.82、0.89 和 0.90,用于健康对照组与非健康对照组、双相情感障碍组与非双相情感障碍组、精神分裂症患者与非精神分裂症患者的二元问题。对于测试集,平均三分类器正确分类率(CCR)在 70%-72%范围内,接近估计的贝叶斯最优 CCR 理论上限 80%左右,该值是根据同一数据上的最近邻分类器估计得出的。