IEEE J Biomed Health Inform. 2021 Sep;25(9):3541-3553. doi: 10.1109/JBHI.2021.3064353. Epub 2021 Sep 3.
Automatic quantification of the left ventricle (LV) from cardiac magnetic resonance (CMR) images plays an important role in making the diagnosis procedure efficient, reliable, and alleviating the laborious reading work for physicians. Considerable efforts have been devoted to LV quantification using different strategies that include segmentation-based (SG) methods and the recent direct regression (DR) methods. Although both SG and DR methods have obtained great success for the task, a systematic platform to benchmark them remains absent because of differences in label information during model learning. In this paper, we conducted an unbiased evaluation and comparison of cardiac LV quantification methods that were submitted to the Left Ventricle Quantification (LVQuan) challenge, which was held in conjunction with the Statistical Atlases and Computational Modeling of the Heart (STACOM) workshop at the MICCAI 2018. The challenge was targeted at the quantification of 1) areas of LV cavity and myocardium, 2) dimensions of the LV cavity, 3) regional wall thicknesses (RWT), and 4) the cardiac phase, from mid-ventricle short-axis CMR images. First, we constructed a public quantification dataset Cardiac-DIG with ground truth labels for both the myocardium mask and these quantification targets across the entire cardiac cycle. Then, the key techniques employed by each submission were described. Next, quantitative validation of these submissions were conducted with the constructed dataset. The evaluation results revealed that both SG and DR methods can offer good LV quantification performance, even though DR methods do not require densely labeled masks for supervision. Among the 12 submissions, the DR method LDAMT offered the best performance, with a mean estimation error of 301 mm for the two areas, 2.15 mm for the cavity dimensions, 2.03 mm for RWTs, and a 9.5% error rate for the cardiac phase classification. Three of the SG methods also delivered comparable performances. Finally, we discussed the advantages and disadvantages of SG and DR methods, as well as the unsolved problems in automatic cardiac quantification for clinical practice applications.
从心脏磁共振(CMR)图像自动量化左心室(LV)在提高诊断程序的效率、可靠性和减轻医生的繁琐阅读工作方面发挥着重要作用。已经投入了相当大的努力来使用不同的策略进行 LV 量化,包括基于分割(SG)的方法和最近的直接回归(DR)方法。尽管 SG 和 DR 方法在该任务上都取得了巨大的成功,但由于模型学习过程中标签信息的差异,仍然缺乏一个系统的基准平台来对它们进行评估。在本文中,我们对提交到左心室量化(LVQuan)挑战赛的心脏 LV 量化方法进行了无偏评估和比较,该挑战赛是在 2018 年 MICCAI 的统计图谱和心脏计算建模(STACOM)研讨会上举办的。挑战赛的目标是量化 1)LV 腔和心肌的区域,2)LV 腔的尺寸,3)局部壁厚度(RWT),以及 4)从中心室短轴 CMR 图像获取的心脏相位。首先,我们构建了一个公共量化数据集 Cardiac-DIG,其中包含整个心动周期内心肌掩模和这些量化目标的真实标签。然后,描述了每个提交的关键技术。接下来,使用构建的数据集对这些提交进行了定量验证。评估结果表明,SG 和 DR 方法都可以提供良好的 LV 量化性能,即使 DR 方法不需要密集标记的掩模进行监督。在 12 个提交中,DR 方法 LDAMT 表现最好,两个区域的平均估计误差为 301mm,腔尺寸的平均估计误差为 2.15mm,RWT 的平均估计误差为 2.03mm,心脏相位分类的平均错误率为 9.5%。三个 SG 方法的表现也相当。最后,我们讨论了 SG 和 DR 方法的优缺点,以及自动心脏量化在临床实践应用中尚未解决的问题。