Karimi Davood
Computational Radiology Laboratory, Department of Radiology, Boston Children's Hospital, and Harvard Medical School, Boston, MA, USA.
Med Image Comput Comput Assist Interv. 2024 Oct;15012:421-431. doi: 10.1007/978-3-031-72390-2_40. Epub 2024 Oct 23.
Existing machine learning methods for brain image analysis are mostly based on supervised training. They require large labeled datasets, which can be costly or impossible to obtain. Moreover, the trained models are useful only for the narrow task defined by the labels. In this work, we developed a new method, based on the concept of foundation models, to overcome these limitations. Our model is an attention-based neural network that is trained using a novel self-supervised approach. Specifically, the model is trained to generate brain images in a patch-wise manner, thereby learning the brain structure. To facilitate learning of image details, we propose a new method that encodes high-frequency information using convolutional kernels with random weights. We trained our model on a pool of 10 public datasets. We then applied the model on five independent datasets to perform segmentation, lesion detection, denoising, and brain age estimation. Results showed that the foundation model achieved competitive or better results on all tasks, while significantly reducing the required amount of labeled training data. Our method enables leveraging large unlabeled neuroimaging datasets to effectively address diverse brain image analysis tasks and reduce the time and cost requirements of acquiring labels.
现有的用于脑图像分析的机器学习方法大多基于监督训练。它们需要大量带标签的数据集,而获取这些数据集可能成本高昂或根本无法实现。此外,训练好的模型仅对标签所定义的狭窄任务有用。在这项工作中,我们基于基础模型的概念开发了一种新方法,以克服这些局限性。我们的模型是一个基于注意力的神经网络,使用一种新颖的自监督方法进行训练。具体而言,该模型被训练以逐块的方式生成脑图像,从而学习脑结构。为了便于学习图像细节,我们提出了一种新方法,该方法使用具有随机权重的卷积核来编码高频信息。我们在10个公共数据集的集合上训练了我们的模型。然后,我们将该模型应用于五个独立数据集,以执行分割、病变检测、去噪和脑年龄估计。结果表明,基础模型在所有任务上都取得了具有竞争力的或更好的结果,同时显著减少了所需的带标签训练数据量。我们的方法能够利用大量未标记的神经影像数据集来有效解决各种脑图像分析任务,并降低获取标签的时间和成本要求。