Destrero Augusto, De Mol Christine, Odone Francesca, Verri Alessandro
Department of Computer and Information Sciences, Università di Genova, Genova, Italy.
IEEE Trans Image Process. 2009 Jan;18(1):188-201. doi: 10.1109/TIP.2008.2007610.
In this paper, we propose a new trainable system for selecting face features from over-complete dictionaries of image measurements. The starting point is an iterative thresholding algorithm which provides sparse solutions to linear systems of equations. Although the proposed methodology is quite general and could be applied to various image classification tasks, we focus here on the case study of face and eyes detection. For our initial representation, we adopt rectangular features in order to allow straightforward comparisons with existing techniques. For computational efficiency and memory saving requirements, instead of implementing the full optimization scheme on tenths of thousands of features, we propose a three-stage architecture which consists of finding first intermediate solutions to smaller size optimization problems, then merging the obtained results, and next applying further selection procedures. The devised system requires the solution of a number of independent problems, and, hence, the necessary computations could be implemented in parallel. Experimental results obtained on both benchmark and newly acquired face and eyes images indicate that our method is a serious competitor to other feature selection schemes recently popularized in computer vision for dealing with problems of real-time object detection. A major advantage of the proposed system is that it performs well even with relatively small training sets.
在本文中,我们提出了一种新的可训练系统,用于从图像测量的超完备字典中选择面部特征。起点是一种迭代阈值算法,它为线性方程组提供稀疏解。尽管所提出的方法相当通用,可应用于各种图像分类任务,但我们在此专注于面部和眼睛检测的案例研究。对于我们的初始表示,我们采用矩形特征,以便能与现有技术进行直接比较。出于计算效率和内存节省的要求,我们不是在数以万计的特征上实施完整的优化方案,而是提出一种三阶段架构,该架构首先为由较小规模优化问题找到中间解,然后合并所得结果,接着应用进一步的选择程序。所设计的系统需要解决多个独立问题,因此,必要的计算可以并行实现。在基准图像以及新获取的面部和眼睛图像上获得的实验结果表明,我们的方法是计算机视觉中最近流行的用于处理实时目标检测问题的其他特征选择方案的有力竞争对手。所提出系统的一个主要优点是,即使训练集相对较小,它也能表现良好。