Suppr超能文献

多类图像分类的可扩展主动学习。

Scalable active learning for multiclass image classification.

机构信息

Google Inc., Mountain View, CA 94043, USA.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2012 Nov;34(11):2259-73. doi: 10.1109/TPAMI.2012.21.

Abstract

Machine learning techniques for computer vision applications like object recognition, scene classification, etc., require a large number of training samples for satisfactory performance. Especially when classification is to be performed over many categories, providing enough training samples for each category is infeasible. This paper describes new ideas in multiclass active learning to deal with the training bottleneck, making it easier to train large multiclass image classification systems. First, we propose a new interaction modality for training which requires only yes-no type binary feedback instead of a precise category label. The modality is especially powerful in the presence of hundreds of categories. For the proposed modality, we develop a Value-of-Information (VOI) algorithm that chooses informative queries while also considering user annotation cost. Second, we propose an active selection measure that works with many categories and is extremely fast to compute. This measure is employed to perform a fast seed search before computing VOI, resulting in an algorithm that scales linearly with dataset size. Third, we use locality sensitive hashing to provide a very fast approximation to active learning, which gives sublinear time scaling, allowing application to very large datasets. The approximation provides up to two orders of magnitude speedups with little loss in accuracy. Thorough empirical evaluation of classification accuracy, noise sensitivity, imbalanced data, and computational performance on a diverse set of image datasets demonstrates the strengths of the proposed algorithms.

摘要

机器学习技术在计算机视觉应用中,如目标识别、场景分类等,需要大量的训练样本才能达到满意的性能。特别是在需要对多个类别进行分类时,为每个类别提供足够的训练样本是不可行的。本文描述了用于处理训练瓶颈的多类主动学习的新思路,使训练大型多类图像分类系统变得更加容易。首先,我们提出了一种新的训练交互模式,只需要是/否类型的二进制反馈,而不需要精确的类别标签。在存在数百个类别的情况下,这种模式特别强大。对于所提出的模式,我们开发了一种信息价值(VOI)算法,该算法在选择信息量丰富的查询的同时还考虑用户的注释成本。其次,我们提出了一种适用于多类别的主动选择度量标准,并且计算速度极快。该度量标准用于在计算 VOI 之前进行快速种子搜索,从而导致算法的规模与数据集的大小呈线性关系。第三,我们使用局部敏感哈希来提供非常快速的主动学习近似,这可以实现次线性时间缩放,从而可以应用于非常大的数据集。该近似提供了高达两个数量级的加速,而精度损失很小。在各种图像数据集上对分类准确性、噪声敏感性、不平衡数据和计算性能进行了彻底的实证评估,证明了所提出算法的优势。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验