Wang Wei, Slepčev Dejan, Basu Saurav, Ozolek John A, Rohde Gustavo K
Center for Bioimage Informatics, Department of Biomedical Engineering, Carnegie Mellon University, Pittsburgh, PA, 15213 USA.
Int J Comput Vis. 2013 Jan 1;101(2):254-269. doi: 10.1007/s11263-012-0566-z.
Transportation-based metrics for comparing images have long been applied to analyze images, especially where one can interpret the pixel intensities (or derived quantities) as a distribution of 'mass' that can be transported without strict geometric constraints. Here we describe a new transportation-based framework for analyzing sets of images. More specifically, we describe a new transportation-related distance between pairs of images, which we denote as linear optimal transportation (LOT). The LOT can be used directly on pixel intensities, and is based on a linearized version of the Kantorovich-Wasserstein metric (an optimal transportation distance, as is the earth mover's distance). The new framework is especially well suited for computing all pairwise distances for a large database of images efficiently, and thus it can be used for pattern recognition in sets of images. In addition, the new LOT framework also allows for an isometric linear embedding, greatly facilitating the ability to visualize discriminant information in different classes of images. We demonstrate the application of the framework to several tasks such as discriminating nuclear chromatin patterns in cancer cells, decoding differences in facial expressions, galaxy morphologies, as well as sub cellular protein distributions.
长期以来,基于传输的图像比较指标一直被用于分析图像,特别是在人们可以将像素强度(或派生量)解释为一种“质量”分布的情况下,这种分布可以在没有严格几何约束的情况下进行传输。在此,我们描述一种用于分析图像集的新的基于传输的框架。更具体地说,我们描述了一种新的图像对之间与传输相关的距离,我们将其称为线性最优传输(LOT)。LOT可以直接用于像素强度,并且基于 Kantorovich-Wasserstein 度量(一种最优传输距离,如推土机距离)的线性化版本。这个新框架特别适合高效地计算大型图像数据库中所有图像对之间的距离,因此可用于图像集的模式识别。此外,新的LOT框架还允许进行等距线性嵌入,极大地促进了在不同类别的图像中可视化判别信息的能力。我们展示了该框架在多个任务中的应用,如区分癌细胞中的核染色质模式、解读面部表情差异、星系形态以及亚细胞蛋白质分布。