Athinoula A. Martinos Center for Biomedical Imaging, Massachusetts General Hospital/Harvard Medical School, MA 02129, USA.
Med Image Anal. 2013 Oct;17(7):790-804. doi: 10.1016/j.media.2013.04.013. Epub 2013 May 10.
Leveraging available annotated data is an essential component of many modern methods for medical image analysis. In particular, approaches making use of the "neighbourhood" structure between images for this purpose have shown significant potential. Such techniques achieve high accuracy in analysing an image by propagating information from its immediate "neighbours" within an annotated database. Despite their success in certain applications, wide use of these methods is limited due to the challenging task of determining the neighbours for an out-of-sample image. This task is either computationally expensive due to large database sizes and costly distance evaluations, or infeasible due to distance definitions over semantic information, such as ground truth annotations, which is not available for out-of-sample images. This article introduces Neighbourhood Approximation Forests (NAFs), a supervised learning algorithm providing a general and efficient approach for the task of approximate nearest neighbour retrieval for arbitrary distances. Starting from an image training database and a user-defined distance between images, the algorithm learns to use appearance-based features to cluster images approximating the neighbourhood structured induced by the distance. NAF is able to efficiently infer nearest neighbours of an out-of-sample image, even when the original distance is based on semantic information. We perform experimental evaluation in two different scenarios: (i) age prediction from brain MRI and (ii) patch-based segmentation of unregistered, arbitrary field of view CT images. The results demonstrate the performance, computational benefits, and potential of NAF for different image analysis applications.
利用现有的标注数据是许多医学图像分析现代方法的重要组成部分。特别是,为实现这一目标而利用图像之间“邻域”结构的方法显示出了巨大的潜力。这些技术通过从标注数据库中图像的直接“邻居”传播信息,从而在分析图像时达到了高精度。尽管这些方法在某些应用中取得了成功,但由于确定样本外图像的邻居的任务具有挑战性,因此它们的广泛使用受到了限制。这个任务要么由于数据库规模大且距离评估成本高而计算成本高,要么由于距离定义在语义信息上而变得不可行,例如地面真实标注,而样本外图像则无法获得这些信息。本文介绍了邻域逼近森林(NAF),这是一种监督学习算法,为任意距离的近似最近邻检索任务提供了一种通用且高效的方法。该算法从图像训练数据库和用户定义的图像之间的距离开始,学习使用基于外观的特征来聚类图像,从而近似由距离诱导的邻域结构。即使原始距离基于语义信息,NAF 也能够有效地推断样本外图像的最近邻。我们在两个不同的场景中进行了实验评估:(i)基于脑 MRI 的年龄预测,(ii)基于未注册、任意视场 CT 图像的基于补丁的分割。结果表明了 NAF 在不同的图像分析应用中的性能、计算优势和潜力。