使用分类森林（CF）、卷积神经网络（CNN）和多图谱（MA）方法对正常全身磁共振成像（MRI）进行全自动多器官分割。

Fully automatic, multiorgan segmentation in normal whole body magnetic resonance imaging (MRI), using classification forests (CFs), convolutional neural networks (CNNs), and a multi-atlas (MA) approach.

机构信息

Imperial College Comprehensive Cancer Imaging Centre (C.C.I.C.), Hammersmith Campus, Commonwealth Building Main Office, Ground Floor, Du Cane Road, London, W12 0NN, UK.

Biomedical Image Analysis Group, Department of Computing, Imperial College London, Huxley Building, 180 Queen's Gate, London, SW7 2AZ, UK.

出版信息

Med Phys. 2017 Oct;44(10):5210-5220. doi: 10.1002/mp.12492. Epub 2017 Aug 31.

DOI:10.1002/mp.12492

PMID:28756622

Abstract

PURPOSE

As part of a program to implement automatic lesion detection methods for whole body magnetic resonance imaging (MRI) in oncology, we have developed, evaluated, and compared three algorithms for fully automatic, multiorgan segmentation in healthy volunteers.

METHODS

The first algorithm is based on classification forests (CFs), the second is based on 3D convolutional neural networks (CNNs) and the third algorithm is based on a multi-atlas (MA) approach. We examined data from 51 healthy volunteers, scanned prospectively with a standardized, multiparametric whole body MRI protocol at 1.5 T. The study was approved by the local ethics committee and written consent was obtained from the participants. MRI data were used as input data to the algorithms, while training was based on manual annotation of the anatomies of interest by clinical MRI experts. Fivefold cross-validation experiments were run on 34 artifact-free subjects. We report three overlap and three surface distance metrics to evaluate the agreement between the automatic and manual segmentations, namely the dice similarity coefficient (DSC), recall (RE), precision (PR), average surface distance (ASD), root-mean-square surface distance (RMSSD), and Hausdorff distance (HD). Analysis of variances was used to compare pooled label metrics between the three algorithms and the DSC on a 'per-organ' basis. A Mann-Whitney U test was used to compare the pooled metrics between CFs and CNNs and the DSC on a 'per-organ' basis, when using different imaging combinations as input for training.

RESULTS

All three algorithms resulted in robust segmenters that were effectively trained using a relatively small number of datasets, an important consideration in the clinical setting. Mean overlap metrics for all the segmented structures were: CFs: DSC = 0.70 ± 0.18, RE = 0.73 ± 0.18, PR = 0.71 ± 0.14, CNNs: DSC = 0.81 ± 0.13, RE = 0.83 ± 0.14, PR = 0.82 ± 0.10, MA: DSC = 0.71 ± 0.22, RE = 0.70 ± 0.34, PR = 0.77 ± 0.15. Mean surface distance metrics for all the segmented structures were: CFs: ASD = 13.5 ± 11.3 mm, RMSSD = 34.6 ± 37.6 mm and HD = 185.7 ± 194.0 mm, CNNs; ASD = 5.48 ± 4.84 mm, RMSSD = 17.0 ± 13.3 mm and HD = 199.0 ± 101.2 mm, MA: ASD = 4.22 ± 2.42 mm, RMSSD = 6.13 ± 2.55 mm, and HD = 38.9 ± 28.9 mm. The pooled performance of CFs improved when all imaging combinations (T2w + T1w + DWI) were used as input, while the performance of CNNs deteriorated, but in neither case, significantly. CNNs with T2w images as input, performed significantly better than CFs with all imaging combinations as input for all anatomical labels, except for the bladder.

CONCLUSIONS

Three state-of-the-art algorithms were developed and used to automatically segment major organs and bones in whole body MRI; good agreement to manual segmentations performed by clinical MRI experts was observed. CNNs perform favorably, when using T2w volumes as input. Using multimodal MRI data as input to CNNs did not improve the segmentation performance.

摘要

目的

作为在肿瘤学中实施全身磁共振成像（MRI）自动病变检测方法计划的一部分，我们开发、评估和比较了三种用于健康志愿者多器官自动分割的算法。

方法

第一种算法基于分类森林（CFs），第二种算法基于 3D 卷积神经网络（CNNs），第三种算法基于多图谱（MA）方法。我们检查了 51 名健康志愿者的数据，这些志愿者前瞻性地在 1.5T 下进行了标准化的多参数全身 MRI 扫描。该研究得到了当地伦理委员会的批准，并获得了参与者的书面同意。MRI 数据被用作算法的输入数据，而训练则基于临床 MRI 专家对感兴趣解剖结构的手动注释。在 34 个无伪影的受试者上进行了五次交叉验证实验。我们报告了三种重叠和三种表面距离指标来评估自动分割与手动分割之间的一致性，即骰子相似系数（DSC）、召回率（RE）、精度（PR）、平均表面距离（ASD）、均方根表面距离（RMSSD）和 Hausdorff 距离（HD）。方差分析用于比较三种算法和 DSC 在“逐个器官”基础上的汇总标签指标。当使用不同的成像组合作为训练输入时，使用 Mann-Whitney U 检验比较 CFs 和 CNNs 与 DSC 之间的汇总指标。

结果

所有三种算法都得到了稳健的分割器，它们使用相对较少的数据集进行了有效训练，这在临床环境中是一个重要的考虑因素。所有分割结构的平均重叠指标为：CFs：DSC=0.70±0.18，RE=0.73±0.18，PR=0.71±0.14，CNNs：DSC=0.81±0.13，RE=0.83±0.14，PR=0.82±0.10，MA：DSC=0.71±0.22，RE=0.70±0.34，PR=0.77±0.15。所有分割结构的平均表面距离指标为：CFs：ASD=13.5±11.3mm，RMSSD=34.6±37.6mm和 HD=185.7±194.0mm，CNNs；ASD=5.48±4.84mm，RMSSD=17.0±13.3mm和 HD=199.0±101.2mm，MA：ASD=4.22±2.42mm，RMSSD=6.13±2.55mm，HD=38.9±28.9mm。当将所有成像组合（T2w+T1w+DWI）用作输入时，CFs 的性能得到了提高，而 CNNs 的性能则恶化，但在这两种情况下都没有显著提高。当输入 T2w 图像时，CNN 的性能明显优于将所有成像组合作为输入的 CFs，除了膀胱。