Google Inc., Mountain View, CA 94043, USA.
IEEE Trans Pattern Anal Mach Intell. 2012 Nov;34(11):2259-73. doi: 10.1109/TPAMI.2012.21.
Machine learning techniques for computer vision applications like object recognition, scene classification, etc., require a large number of training samples for satisfactory performance. Especially when classification is to be performed over many categories, providing enough training samples for each category is infeasible. This paper describes new ideas in multiclass active learning to deal with the training bottleneck, making it easier to train large multiclass image classification systems. First, we propose a new interaction modality for training which requires only yes-no type binary feedback instead of a precise category label. The modality is especially powerful in the presence of hundreds of categories. For the proposed modality, we develop a Value-of-Information (VOI) algorithm that chooses informative queries while also considering user annotation cost. Second, we propose an active selection measure that works with many categories and is extremely fast to compute. This measure is employed to perform a fast seed search before computing VOI, resulting in an algorithm that scales linearly with dataset size. Third, we use locality sensitive hashing to provide a very fast approximation to active learning, which gives sublinear time scaling, allowing application to very large datasets. The approximation provides up to two orders of magnitude speedups with little loss in accuracy. Thorough empirical evaluation of classification accuracy, noise sensitivity, imbalanced data, and computational performance on a diverse set of image datasets demonstrates the strengths of the proposed algorithms.
机器学习技术在计算机视觉应用中,如目标识别、场景分类等,需要大量的训练样本才能达到满意的性能。特别是在需要对多个类别进行分类时,为每个类别提供足够的训练样本是不可行的。本文描述了用于处理训练瓶颈的多类主动学习的新思路,使训练大型多类图像分类系统变得更加容易。首先,我们提出了一种新的训练交互模式,只需要是/否类型的二进制反馈,而不需要精确的类别标签。在存在数百个类别的情况下,这种模式特别强大。对于所提出的模式,我们开发了一种信息价值(VOI)算法,该算法在选择信息量丰富的查询的同时还考虑用户的注释成本。其次,我们提出了一种适用于多类别的主动选择度量标准,并且计算速度极快。该度量标准用于在计算 VOI 之前进行快速种子搜索,从而导致算法的规模与数据集的大小呈线性关系。第三,我们使用局部敏感哈希来提供非常快速的主动学习近似,这可以实现次线性时间缩放,从而可以应用于非常大的数据集。该近似提供了高达两个数量级的加速,而精度损失很小。在各种图像数据集上对分类准确性、噪声敏感性、不平衡数据和计算性能进行了彻底的实证评估,证明了所提出算法的优势。