Heidelberg University, HCI, Speyerer Str. 6, D-69115 Heidelberg, Germany.
Comput Med Imaging Graph. 2015 Jun;42:44-50. doi: 10.1016/j.compmedimag.2014.11.010. Epub 2014 Nov 20.
Supervised machine learning is a powerful tool frequently used in computer-aided diagnosis (CAD) applications. The bottleneck of this technique is its demand for fine grained expert annotations, which are tedious for medical image analysis applications. Furthermore, information is typically localized in diagnostic images, which makes representation of an entire image by a single feature set problematic. The multiple instance learning framework serves as a remedy to these two problems by allowing labels to be provided for groups of observations, called bags, and assuming the group label to be the maximum of the instance labels within the bag. This setup can effectively be applied to CAD by splitting a given diagnostic image into a Cartesian grid, treating each grid element (patch) as an instance by representing it with a feature set, and grouping instances belonging to the same image into a bag. We quantify the power of existing multiple instance learning methods by evaluating their performance on two distinct CAD applications: (i) Barrett's cancer diagnosis and (ii) diabetic retinopathy screening. In the experiments, mi-Graph appears as the best-performing method in bag-level prediction (i.e. diagnosis) for both of these applications that have drastically different visual characteristics. For instance-level prediction (i.e. disease localization), mi-SVM ranks as the most accurate method.
监督机器学习是一种常用于计算机辅助诊断 (CAD) 应用的强大工具。该技术的瓶颈在于其对精细的专家注释的需求,这对于医学图像分析应用来说是繁琐的。此外,信息通常在诊断图像中本地化,这使得通过单个特征集表示整个图像成为问题。多实例学习框架通过允许为称为袋子的观察分组提供标签,并假设组标签是袋子内实例标签的最大值,从而解决了这两个问题。通过将给定的诊断图像分割成笛卡尔网格,将每个网格元素(补丁)表示为一个实例,并使用特征集表示它,然后将属于同一图像的实例分组到一个袋子中,可以有效地将这种设置应用于 CAD。我们通过在两个不同的 CAD 应用程序上评估现有多实例学习方法的性能来量化它们的能力:(i) Barrett 癌症诊断和(ii)糖尿病性视网膜病变筛查。在实验中,mi-Graph 似乎是这两种具有明显不同视觉特征的应用程序中在袋级预测(即诊断)方面表现最好的方法。对于实例级预测(即疾病定位),mi-SVM 是最准确的方法。