Learned-Miller Erik G
Department of Computer Science, University of Massachusetts, Amherst, 140 Governor's Drive, Amherst, MA 01003, USA.
IEEE Trans Pattern Anal Mach Intell. 2006 Feb;28(2):236-50. doi: 10.1109/TPAMI.2006.34.
This paper presents a family of techniques that we call congealing for modeling image classes from data. The idea is to start with a set of images and make them appear as similar as possible by removing variability along the known axes of variation. This technique can be used to eliminate "nuisance" variables such as affine deformations from handwritten digits or unwanted bias fields from magnetic resonance images. In addition to separating and modeling the latent images-i.e., the images without the nuisance variables-we can model the nuisance variables themselves, leading to factorized generative image models. When nuisance variable distributions are shared between classes, one can share the knowledge learned in one task with another task, leading to efficient learning. We demonstrate this process by building a handwritten digit classifier from just a single example of each class. In addition to applications in handwritten character recognition, we describe in detail the application of bias removal from magnetic resonance images. Unlike previous methods, we use a separate, nonparametric model for the intensity values at each pixel. This allows us to leverage the data from the MR images of different patients to remove bias from each other. Only very weak assumptions are made about the distributions of intensity values in the images. In addition to the digit and MR applications, we discuss a number of other uses of congealing and describe experiments about the robustness and consistency of the method.
本文提出了一类我们称为“凝聚”的技术,用于从数据中对图像类别进行建模。其思路是从一组图像开始,通过消除沿已知变化轴的变异性,使它们尽可能相似。该技术可用于消除“干扰”变量,如手写数字中的仿射变形或磁共振图像中不需要的偏置场。除了分离和建模潜在图像(即没有干扰变量的图像)之外,我们还可以对干扰变量本身进行建模,从而得到因式分解的生成图像模型。当干扰变量分布在不同类别之间共享时,一个人可以在一个任务中学到的知识与另一个任务共享,从而实现高效学习。我们通过仅从每个类别的单个示例构建手写数字分类器来演示这个过程。除了在手写字符识别中的应用,我们还详细描述了从磁共振图像中去除偏置的应用。与以前的方法不同,我们对每个像素的强度值使用单独的非参数模型。这使我们能够利用来自不同患者的磁共振图像数据相互去除偏置。对于图像中强度值的分布只做了非常弱的假设。除了数字和磁共振应用之外,我们还讨论了凝聚的一些其他用途,并描述了关于该方法的鲁棒性和一致性的实验。