Nieves-Vazquez Heriberto A, Ozkaya Efe, Meinhold Waiman, Geahchan Amine, Bane Octavia, Ueda Jun, Taouli Bachir
Department of Biomedical Engineering, Georgia Institute of Technology, Atlanta, Georgia, USA.
BioMedical Engineering and Imaging Institute, Icahn School of Medicine Mount Sinai, New York, New York, USA.
J Magn Reson Imaging. 2025 Feb;61(2):985-994. doi: 10.1002/jmri.29490. Epub 2024 Jun 22.
Several factors can impair image quality and reliability of liver magnetic resonance elastography (MRE), such as inadequate driver positioning, insufficient wave propagation and patient-related factors.
To report initial results on automatic classification of liver MRE image quality using various deep learning (DL) architectures.
Retrospective, single center, IRB-approved human study.
Ninety patients (male = 51, mean age 52.8 ± 14.1 years).
FIELD STRENGTHS/SEQUENCES: 1.5 T and 3 T MRI, 2D GRE, and 2D SE-EPI.
The curated dataset was comprised of 914 slices obtained from 149 MRE exams in 90 patients. Two independent observers examined the confidence map overlaid elastograms (CMOEs) for liver stiffness measurement and assigned a quality score (non-diagnostic vs. diagnostic) for each slice. Several DL architectures (ResNet18, ResNet34, ResNet50, SqueezeNet, and MobileNetV2) for binary quality classification of individual CMOE slice inputs were evaluated, using an 8-fold stratified cross-validation (800 slices) and a test dataset (114 slices). A majority vote ensemble combining the models' predictions of the highest-performing architecture was evaluated.
The inter-observer agreement and the agreement between DL models and one observer were assessed using Cohen's unweighted Kappa coefficient. Accuracy, precision, and recall of the cross-validation and the ensemble were calculated for the test dataset.
The average accuracy across the eight models trained using each architecture ranged from 0.692 to 0.851 for the test dataset. The ensemble of the best performing architecture (SqueezeNet) yielded an accuracy of 0.921. The inter-observer agreement was excellent (Kappa 0.896 [95% CI 0.845-0.947]). The agreement between observer 1 and the predictions of each SqueezeNet model was slight to almost perfect (Kappa range: 0.197-0.831) and almost perfect for the ensemble (Kappa: 0.833).
Our initial study demonstrates an automated DL-based approach for classifying liver 2D MRE diagnostic quality with an average accuracy of 0.851 (range 0.675-0.921) across the SqueezeNet models.
4 TECHNICAL EFFICACY: Stage 1.
多种因素会损害肝脏磁共振弹性成像(MRE)的图像质量和可靠性,如驱动定位不当、波传播不足以及与患者相关的因素。
报告使用各种深度学习(DL)架构对肝脏MRE图像质量进行自动分类的初步结果。
回顾性、单中心、经机构审查委员会批准的人体研究。
90名患者(男性51名,平均年龄52.8±14.1岁)。
场强/序列:1.5T和3T MRI、二维梯度回波(2D GRE)以及二维自旋回波平面回波成像(2D SE-EPI)。
精心整理的数据集由90名患者149次MRE检查获得的914个切片组成。两名独立观察者检查叠加弹性图的置信度图(CMOE)以测量肝脏硬度,并为每个切片分配质量评分(非诊断性与诊断性)。使用8倍分层交叉验证(800个切片)和测试数据集(114个切片)评估了几种用于对单个CMOE切片输入进行二元质量分类的DL架构(ResNet18、ResNet34、ResNet50、SqueezeNet和MobileNetV2)。评估了结合表现最佳架构的模型预测的多数投票集成方法。
使用科恩无加权kappa系数评估观察者间的一致性以及DL模型与一名观察者之间的一致性。计算测试数据集的交叉验证和集成方法的准确性、精确性和召回率。
使用每种架构训练的八个模型在测试数据集上的平均准确率范围为0.69 ~ 0.851。表现最佳的架构(SqueezeNet)的集成方法准确率为0.921。观察者间的一致性极佳(kappa 0.896 [95% CI 0.845 - 0.947])。观察者1与每个SqueezeNet模型预测之间的一致性为轻微到几乎完美(kappa范围:0.197 - 0.831),而集成方法几乎完美(kappa:0.833)。
我们的初步研究展示了一种基于DL的自动方法,用于对肝脏二维MRE诊断质量进行分类,在SqueezeNet模型中平均准确率为0.851(范围0.675 - 0.921)。
4 技术效能:1级。