Department of Electrical and System Engineering, University of Pennsylvania, Philadelphia, PA 19104, USA.
IEEE Trans Med Imaging. 2012 Jan;31(1):51-69. doi: 10.1109/TMI.2011.2162961. Epub 2011 Jul 25.
This paper presents a novel dimensionality reduction method for classification in medical imaging. The goal is to transform very high-dimensional input (typically, millions of voxels) to a low-dimensional representation (small number of constructed features) that preserves discriminative signal and is clinically interpretable. We formulate the task as a constrained optimization problem that combines generative and discriminative objectives and show how to extend it to the semi-supervised learning (SSL) setting. We propose a novel large-scale algorithm to solve the resulting optimization problem. In the fully supervised case, we demonstrate accuracy rates that are better than or comparable to state-of-the-art algorithms on several datasets while producing a representation of the group difference that is consistent with prior clinical reports. Effectiveness of the proposed algorithm for SSL is evaluated with both benchmark and medical imaging datasets. In the benchmark datasets, the results are better than or comparable to the state-of-the-art methods for SSL. For evaluation of the SSL setting in medical datasets, we use images of subjects with mild cognitive impairment (MCI), which is believed to be a precursor to Alzheimer's disease (AD), as unlabeled data. AD subjects and normal control (NC) subjects are used as labeled data, and we try to predict conversion from MCI to AD on follow-up. The semi-supervised extension of this method not only improves the generalization accuracy for the labeled data (AD/NC) slightly but is also able to predict subjects which are likely to converge to AD.
本文提出了一种新的医学影像分类降维方法。目标是将非常高维的输入(通常是数百万体素)转换为低维表示(少量构建的特征),同时保留有判别信号且具有临床可解释性。我们将任务表述为一个结合生成和判别目标的约束优化问题,并展示了如何将其扩展到半监督学习(SSL)设置中。我们提出了一种新颖的大规模算法来解决由此产生的优化问题。在完全监督的情况下,我们在几个数据集上展示了比最先进的算法更好或相当的准确率,同时产生的组间差异表示与先前的临床报告一致。我们还使用基准数据集和医学影像数据集评估了所提出算法在 SSL 中的有效性。在基准数据集上,结果优于或与 SSL 的最先进方法相当。对于医学数据集的 SSL 设置评估,我们使用轻度认知障碍(MCI)患者的图像作为未标记数据,MCI 被认为是阿尔茨海默病(AD)的前兆。AD 患者和正常对照组(NC)患者被用作标记数据,我们尝试预测随访中从 MCI 到 AD 的转换。该方法的半监督扩展不仅略微提高了标记数据(AD/NC)的泛化准确性,而且还能够预测可能向 AD 收敛的患者。