Hosseini Mohammad-Parsa, Nazem-Zadeh Mohammad-Reza, Pompili Dario, Jafari-Khouzani Kourosh, Elisevich Kost, Soltanian-Zadeh Hamid
Department of Electrical and Computer Engineering, Rutgers University, New Brunswick, New Jersey 08854 and Medical Image Analysis Laboratory, Departments of Radiology and Research Administration, Henry Ford Health System, Detroit, Michigan 48202.
Medical Image Analysis Laboratory, Departments of Radiology and Research Administration, Henry Ford Health System, Detroit, Michigan 48202.
Med Phys. 2016 Jan;43(1):538. doi: 10.1118/1.4938411.
Segmentation of the hippocampus from magnetic resonance (MR) images is a key task in the evaluation of mesial temporal lobe epilepsy (mTLE) patients. Several automated algorithms have been proposed although manual segmentation remains the benchmark. Choosing a reliable algorithm is problematic since structural definition pertaining to multiple edges, missing and fuzzy boundaries, and shape changes varies among mTLE subjects. Lack of statistical references and guidance for quantifying the reliability and reproducibility of automated techniques has further detracted from automated approaches. The purpose of this study was to develop a systematic and statistical approach using a large dataset for the evaluation of automated methods and establish a method that would achieve results better approximating those attained by manual tracing in the epileptogenic hippocampus.
A template database of 195 (81 males, 114 females; age range 32-67 yr, mean 49.16 yr) MR images of mTLE patients was used in this study. Hippocampal segmentation was accomplished manually and by two well-known tools (FreeSurfer and hammer) and two previously published methods developed at their institution [Automatic brain structure segmentation (ABSS) and LocalInfo]. To establish which method was better performing for mTLE cases, several voxel-based, distance-based, and volume-based performance metrics were considered. Statistical validations of the results using automated techniques were compared with the results of benchmark manual segmentation. Extracted metrics were analyzed to find the method that provided a more similar result relative to the benchmark.
Among the four automated methods, ABSS generated the most accurate results. For this method, the Dice coefficient was 5.13%, 14.10%, and 16.67% higher, Hausdorff was 22.65%, 86.73%, and 69.58% lower, precision was 4.94%, -4.94%, and 12.35% higher, and the root mean square (RMS) was 19.05%, 61.90%, and 65.08% lower than LocalInfo, FreeSurfer, and hammer, respectively. The Bland-Altman similarity analysis revealed a low bias for the ABSS and LocalInfo techniques compared to the others.
The ABSS method for automated hippocampal segmentation outperformed other methods, best approximating what could be achieved by manual tracing. This study also shows that four categories of input data can cause automated segmentation methods to fail. They include incomplete studies, artifact, low signal-to-noise ratio, and inhomogeneity. Different scanner platforms and pulse sequences were considered as means by which to improve reliability of the automated methods. Other modifications were specially devised to enhance a particular method assessed in this study.
从磁共振(MR)图像中分割海马体是评估内侧颞叶癫痫(mTLE)患者的一项关键任务。尽管手动分割仍是基准方法,但已经提出了几种自动算法。由于mTLE患者之间与多个边缘、缺失和模糊边界以及形状变化相关的结构定义各不相同,选择一种可靠的算法存在问题。缺乏用于量化自动技术可靠性和可重复性的统计参考和指导进一步削弱了自动方法的应用。本研究的目的是开发一种系统的统计方法,使用大型数据集评估自动方法,并建立一种能在致痫海马体中获得更接近手动追踪结果的方法。
本研究使用了一个包含195例mTLE患者MR图像的模板数据库(81例男性,114例女性;年龄范围32 - 67岁,平均49.16岁)。海马体分割通过手动以及两种知名工具(FreeSurfer和hammer)和该机构之前发表的两种方法[自动脑结构分割(ABSS)和LocalInfo]来完成。为确定哪种方法在mTLE病例中表现更好,考虑了几种基于体素、基于距离和基于体积的性能指标。将使用自动技术得到的结果与基准手动分割结果进行统计验证比较。对提取的指标进行分析,以找出相对于基准提供更相似结果的方法。
在四种自动方法中,ABSS产生了最准确的结果。对于该方法,与LocalInfo、FreeSurfer和hammer相比,Dice系数分别高出5.13%、14.10%和16.67%,豪斯多夫距离分别低22.65%、86.73%和69.58%,精度分别高4.94%、 - 4.94%和12.35%,均方根(RMS)分别低19.05%、61.90%和65.08%。Bland - Altman相似性分析显示,与其他方法相比,ABSS和LocalInfo技术的偏差较低。
用于自动海马体分割的ABSS方法优于其他方法,最接近手动追踪所能达到的效果。本研究还表明,四类输入数据可能导致自动分割方法失败。它们包括不完整的研究、伪影、低信噪比和不均匀性。考虑了不同的扫描仪平台和脉冲序列,作为提高自动方法可靠性的手段。还专门设计了其他修改措施来增强本研究中评估的特定方法。