IEEE Rev Biomed Eng. 2017;10:213-234. doi: 10.1109/RBME.2017.2651164. Epub 2017 Jan 10.
Multiple-instance learning (MIL) is a recent machine-learning paradigm that is particularly well suited to medical image and video analysis (MIVA) tasks. Based solely on class labels assigned globally to images or videos, MIL algorithms learn to detect relevant patterns locally in images or videos. These patterns are then used for classification at a global level. Because supervision relies on global labels, manual segmentations are not needed to train MIL algorithms, unlike traditional single-instance learning (SIL) algorithms. Consequently, these solutions are attracting increasing interest from the MIVA community: since the term was coined by Dietterich et al. in 1997, 73 research papers about MIL have been published in the MIVA literature. This paper reviews the existing strategies for modeling MIVA tasks as MIL problems, recommends general-purpose MIL algorithms for each type of MIVA tasks, and discusses MIVA-specific MIL algorithms. Various experiments performed in medical image and video datasets are compiled in order to back up these discussions. This meta-analysis shows that, besides being more convenient than SIL solutions, MIL algorithms are also more accurate in many cases. In other words, MIL is the ideal solution for many MIVA tasks. Recent trends are discussed, and future directions are proposed for this emerging paradigm.
多示例学习(MIL)是一种新兴的机器学习范例,特别适合医学图像和视频分析(MIVA)任务。MIL 算法仅基于全局分配给图像或视频的类别标签,学习在图像或视频中局部检测相关模式。然后,这些模式用于全局分类。由于监督依赖于全局标签,因此与传统的单示例学习(SIL)算法不同,不需要手动分割来训练 MIL 算法。因此,这些解决方案越来越受到 MIVA 社区的关注:自 1997 年 Dietterich 等人提出该术语以来,已有 73 篇关于 MIL 的研究论文发表在 MIVA 文献中。本文回顾了将 MIVA 任务建模为 MIL 问题的现有策略,为每种类型的 MIVA 任务推荐了通用的 MIL 算法,并讨论了特定于 MIVA 的 MIL 算法。为了支持这些讨论,编译了在医学图像和视频数据集上进行的各种实验。这项荟萃分析表明,除了比 SIL 解决方案更方便之外,在许多情况下,MIL 算法也更准确。换句话说,MIL 是许多 MIVA 任务的理想解决方案。讨论了最近的趋势,并为这一新兴范例提出了未来的方向。