Baraldi Andrea, Bruzzone Lorenzo, Blonda Palma
Istituto di Studi su Sistemi Intelligenti per l'Automazione, Consiglio Nazionale delle Ricerche, 70126 Bari, Italy.
IEEE Trans Image Process. 2006 Aug;15(8):2208-25. doi: 10.1109/tip.2006.875220.
This paper deals with the problem of badly posed image classification. Although underestimated in practice, bad-posedness is likely to affect many real-world image classification tasks, where reference samples are difficult to collect (e.g., in remote sensing (RS) image mapping) and/or spatial autocorrelation is relevant. In an image classification context affected by a lack of reference samples, an original inductive learning multiscale image classifier, termed multiscale semisupervised expectation maximization (MSEM), is proposed. The rationale behind MSEM is to combine useful complementary properties of two alternative data mapping procedures recently published outside of image processing literature, namely, the multiscale modified Pappas adaptive clustering (MPAC) algorithm and the sample-based semisupervised expectation maximization (SEM) classifier. To demonstrate its potential utility, MSEM is compared against nonstandard classifiers, such as MPAC, SEM and the single-scale contextual SEM (CSEM) classifier, besides against well-known standard classifiers in two RS image classification problems featuring few reference samples and modestly useful texture information. These experiments yield weak (subjective) but numerous quantitative map quality indexes that are consistent with both theoretical considerations and qualitative evaluations by expert photointerpreters. According to these quantitative results, MSEM is competitive in terms of overall image mapping performance at the cost of a computational overhead three to six times superior to that of its most interesting rival, SEM. More in general, our experiments confirm that, even if they rely on heavy class-conditional normal distribution assumptions that may not be true in many real-world problems (e.g., in highly textured images), semisupervised classifiers based on the iterative expectation maximization Gaussian mixture model solution can be very powerful in practice when: 1) there is a lack of reference samples with respect to the problem/model complexity and 2) texture information is considered negligible (i.e., a piecewise constant image model holds).
本文探讨不适定图像分类问题。尽管在实践中未得到充分重视,但不适定性可能会影响许多现实世界中的图像分类任务,在这些任务中,参考样本难以收集(例如在遥感(RS)图像映射中)和/或空间自相关性是相关的。在受参考样本缺乏影响的图像分类背景下,提出了一种原始的归纳学习多尺度图像分类器,称为多尺度半监督期望最大化(MSEM)。MSEM背后的基本原理是结合最近在图像处理文献之外发表的两种替代数据映射程序的有用互补特性,即多尺度修正帕帕斯自适应聚类(MPAC)算法和基于样本的半监督期望最大化(SEM)分类器。为了证明其潜在效用,除了在两个具有少量参考样本和适度有用纹理信息的RS图像分类问题中与著名的标准分类器进行比较外,还将MSEM与非标准分类器(如MPAC、SEM和单尺度上下文SEM(CSEM)分类器)进行了比较。这些实验产生了微弱(主观)但众多的定量地图质量指标,这些指标与理论考虑以及专家图像判读员的定性评估一致。根据这些定量结果,MSEM在整体图像映射性能方面具有竞争力,但其计算开销比最具竞争力的对手SEM高出三到六倍。更一般地说,我们的实验证实,即使它们依赖于在许多现实世界问题(例如在纹理丰富的图像中)可能不成立的重类条件正态分布假设,但基于迭代期望最大化高斯混合模型解决方案的半监督分类器在以下情况下在实践中可能非常强大:1)相对于问题/模型复杂性缺乏参考样本,以及2)纹理信息被认为可以忽略不计(即分段常数图像模型成立)。