Sghirripa Sabrina, Bhalerao Gaurav, Griffanti Ludovica, Gillis Grace, Mackay Clare, Voets Natalie, Wong Stephanie, Jenkinson Mark
Australian Institute for Machine Learning, School of Computer and Mathematical Sciences, The University of Adelaide, Adelaide, South Australia, Australia.
Hopwood Centre of Neurobiology, Lifelong Health Theme, South Australian Health and Medical Research Institute, Adelaide, South Australia, Australia.
Hum Brain Mapp. 2025 Apr 1;46(5):e70200. doi: 10.1002/hbm.70200.
Given the relationship between hippocampal atrophy and cognitive impairment in various pathological conditions, hippocampus segmentation from MRI is an important task in neuroimaging. Manual segmentation, though considered the gold standard, is time-consuming and error-prone, leading to the development of numerous automatic segmentation methods. However, no study has yet independently compared the performance of traditional, deep learning-based and hippocampal subfield segmentation methods within a single investigation. We evaluated 10 automatic hippocampal segmentation methods (FreeSurfer, SynthSeg, FastSurfer, FIRST, e2dhipseg, Hippmapper, Hippodeep, FreeSurfer-Subfields, HippUnfold and HSF) across 3 datasets with manually segmented hippocampus labels. Performance metrics included overlap with manual labels, correlations between manual and automatic volumes, volume similarity, diagnostic group differentiation and systematically located false positives and negatives. Most methods, especially deep learning-based ones that were trained on manual labels, performed well on public datasets but showed more error and variability on clinical data. Many methods tended to over-segment, particularly at the anterior hippocampus border, but were able to distinguish between healthy controls, MCI, and dementia patients based on hippocampal volume. Our findings highlight the challenges in hippocampal segmentation from MRI and the need for more publicly accessible datasets with manual labels across diverse ages and pathological conditions.
鉴于在各种病理状况下海马萎缩与认知障碍之间的关系,从磁共振成像(MRI)中进行海马分割是神经影像学中的一项重要任务。手动分割虽被视为金标准,但耗时且容易出错,这促使了众多自动分割方法的发展。然而,尚无研究在单一调查中独立比较传统方法、基于深度学习的方法以及海马亚区分割方法的性能。我们在3个带有手动分割海马标签的数据集上评估了10种自动海马分割方法(FreeSurfer、SynthSeg、FastSurfer、FIRST、e2dhipseg、Hippmapper、Hippodeep、FreeSurfer-Subfields、HippUnfold和HSF)。性能指标包括与手动标签的重叠度、手动和自动体积之间的相关性、体积相似性、诊断组区分以及系统性定位的假阳性和假阴性。大多数方法,尤其是基于在手动标签上训练的深度学习方法,在公共数据集上表现良好,但在临床数据上显示出更多误差和变异性。许多方法倾向于过度分割,特别是在海马前部边界,但能够根据海马体积区分健康对照、轻度认知障碍(MCI)和痴呆患者。我们的研究结果突出了从MRI进行海马分割的挑战,以及需要更多跨不同年龄和病理状况且带有手动标签的可公开获取数据集。