School of Computer Engineering, The Nanyang Technological University, Singapore 639798.
IEEE Trans Image Process. 2010 Jan;19(1):174-84. doi: 10.1109/TIP.2009.2032939.
Biologically inspired feature (BIF) and its variations have been demonstrated to be effective and efficient for scene classification. It is unreasonable to measure the dissimilarity between two BIFs based on their Euclidean distance. This is because BIFs are extrinsically very high dimensional and intrinsically low dimensional, i.e., BIFs are sampled from a low-dimensional manifold and embedded in a high-dimensional space. Therefore, it is essential to find the intrinsic structure of a set of BIFs, obtain a suitable mapping to implement the dimensionality reduction, and measure the dissimilarity between two BIFs in the low-dimensional space based on their Euclidean distance. In this paper, we study the manifold constructed by a set of BIFs utilized for scene classification, form a new dimensionality reduction algorithm by preserving both the geometry of intra BIFs and the discriminative information inter BIFs termed Discriminative and Geometry Preserving Projections (DGPP), and construct a new framework for scene classification. In this framework, we represent an image based on a new BIF, which combines the intensity channel, the color channel, and the C1 unit of a color image; then we project the high-dimensional BIF to a low-dimensional space based on DGPP; and, finally, we conduct the classification based on the multiclass support vector machine (SVM). Thorough empirical studies based on the USC scene dataset demonstrate that the proposed framework improves the classification rates around 100% relatively and the training speed 60 times for different sites in comparing with previous gist proposed by Siagian and Itti in 2007.
受生物启发的特征(BIF)及其变体已被证明在场景分类中非常有效和高效。基于欧几里得距离来衡量两个 BIF 之间的不相似性是不合理的。这是因为 BIF 在外在维度上非常高,而内在维度上非常低,即 BIF 是从低维流形中采样的,并嵌入在高维空间中。因此,找到一组 BIF 的内在结构,找到合适的映射来实现降维,并基于它们的欧几里得距离在低维空间中衡量两个 BIF 之间的不相似性是至关重要的。在本文中,我们研究了用于场景分类的一组 BIF 所构造的流形,通过同时保留 BIF 内的几何形状和 BIF 间的判别信息,形成了一种新的降维算法,称为判别和几何保持投影(DGPP),并构建了一个新的场景分类框架。在这个框架中,我们基于一个新的 BIF 来表示一个图像,这个 BIF 结合了强度通道、颜色通道和彩色图像的 C1 单元;然后,我们根据 DGPP 将高维 BIF 投影到低维空间;最后,我们基于多类支持向量机(SVM)进行分类。基于 USC 场景数据集的深入实证研究表明,与 2007 年 Siagian 和 Itti 提出的先前的概图相比,所提出的框架相对提高了约 100%的分类率,并将训练速度提高了 60 倍。