Institute of Applied Mathematics, Rheinische Friedrich-Wilhelms-Universität Bonn, Bonn, Germany.
From the Department of Neuroradiology, University Medical Center Bonn, Rheinische Friedrich-Wilhelms-Universität Bonn.
Invest Radiol. 2023 Jun 1;58(6):420-430. doi: 10.1097/RLI.0000000000000955. Epub 2023 Jan 28.
The purpose of this study was to implement a state-of-the-art convolutional neural network used to synthesize artificial T1-weighted (T1w) full-dose images from corresponding noncontrast and low-dose images (using various settings of input sequences) and test its performance on a patient population acquired prospectively.
In this monocentric, institutional review board-approved study, a total of 138 participants were included who received an adapted imaging protocol with acquisition of a T1w low dose after administration of 10% of the standard dose and acquisition of a T1w full dose after administration of the remaining 90% of the standard dose of a gadolinium-containing contrast agent. A total of 83 participants formed the training sample (51.7 ± 16.5 years, 36 women), 25 the validation sample (55.3 ± 16.4 years, 11 women), and 30 the test sample (55.0 ± 15.0 years, 9 women). Four input settings were differentiated: only the T1w noncontrast and T1w low-dose images (standard setting), only the T1w noncontrast and T1w low-dose images with a prolonged postinjection time of 5 minutes (5-minute setting), multiple noncontrast sequences (T1w, T2w, diffusion) and the T1w low-dose images (extended setting), and only noncontrast sequences (T1w, T2w, diffusion) were used (zero-dose setting). For each setting, a deep neural network was trained to synthesize artificial T1w full-dose images, which were assessed on the test sample using an objective evaluation based on quantitative metrics and a subjective evaluation through a reader-based study. Three readers scored the overall image quality, the interchangeability in regard to the clinical conclusion compared with the true T1w full-dose sequence, the contrast enhancement of lesions, and their conformity to the respective references in the true T1w full dose.
Quantitative analysis of the artificial T1w full-dose images of the standard setting provided a peak signal-to-noise ratio of 33.39 ± 0.62 (corresponding to an average improvement of the low-dose sequences of 5.2 dB) and a structural similarity index measure of 0.938 ± 0.005. In the 4-fold cross-validation, the extended setting yielded similar performance to the standard setting in terms of peak signal-to-noise ratio ( P = 0.20), but a slight improvement in structural similarity index measure ( P < 0.0001). For all settings, the reader study found comparable overall image quality between the original and artificial T1w full-dose images. The proportion of scans scored as fully or mostly interchangeable was 55%, 58%, 43%, and 3% and the average counts of false positives per case were 0.42 ± 0.83, 0.34 ± 0.71, 0.82 ± 1.15, and 2.00 ± 1.07 for the standard, 5-minute, extended, and zero-dose setting, respectively. Using a 5-point Likert scale (0 to 4, 0 being the worst), all settings of synthesized full-dose images showed significantly poorer contrast enhancement of lesions compared with the original full-dose sequence (difference of average degree of contrast enhancement-standard: -0.97 ± 0.83, P = <0.001; 5-minute: -0.93 ± 0.91, P = <0.001; extended: -0.96 ± 0.97, P = <0.001; zero-dose: -2.39 ± 1.14, P = <0.001). The average scores of conformity of the lesions compared with the original full-dose sequence were 2.25 ± 1.21, 2.22 ± 1.27, 2.24 ± 1.25, and 0.73 ± 0.93 for the standard, 5-minute, extended, and zero-dose setting, respectively.
The tested deep learning algorithm for synthesis of artificial T1w full-dose sequences based on images after administration of only 10% of the standard dose of a gadolinium-based contrast agent showed very good quantitative performance. Despite good image quality in all settings, both false-negative and false-positive signals resulted in significantly limited interchangeability of the synthesized sequences with the original full-dose sequences.
本研究旨在应用最先进的卷积神经网络,从相应的非对比和低剂量图像(使用各种输入序列设置)中合成人工 T1 加权(T1w)全剂量图像,并在前瞻性采集的患者人群中测试其性能。
在这项单中心、机构审查委员会批准的研究中,共纳入 138 名参与者,他们接受了改良的成像方案,在给予标准剂量 10%的钆基造影剂后,采集 T1w 低剂量图像,在给予标准剂量剩余 90%的钆基造影剂后,采集 T1w 全剂量图像。共有 83 名参与者构成训练样本(51.7±16.5 岁,36 名女性),25 名参与者构成验证样本(55.3±16.4 岁,11 名女性),30 名参与者构成测试样本(55.0±15.0 岁,9 名女性)。区分了 4 种输入设置:仅 T1w 非对比和 T1w 低剂量图像(标准设置),仅 T1w 非对比和 T1w 低剂量图像,注射后延长 5 分钟(5 分钟设置),多个非对比序列(T1w、T2w、扩散)和 T1w 低剂量图像(扩展设置),以及仅使用非对比序列(T1w、T2w、扩散)(零剂量设置)。对于每个设置,训练一个深度神经网络来合成人工 T1w 全剂量图像,在测试样本上使用基于定量指标的客观评估和基于读者的研究进行主观评估。三位读者对整体图像质量、与真实 T1w 全剂量序列相比的临床结论可互换性、病变的对比增强以及与真实 T1w 全剂量的相应参考的一致性进行评分。
标准设置的人工 T1w 全剂量图像的定量分析提供了 33.39±0.62 的峰值信噪比(对应于低剂量序列的平均提高 5.2dB)和 0.938±0.005 的结构相似性指数测量值。在 4 倍交叉验证中,扩展设置在峰值信噪比方面与标准设置具有相似的性能(P=0.20),但结构相似性指数测量值略有提高(P<0.0001)。对于所有设置,读者研究发现原始和人工 T1w 全剂量图像之间的整体图像质量相当。评分完全或大部分可互换的扫描比例分别为 55%、58%、43%和 3%,平均每个病例的假阳性数分别为 0.42±0.83、0.34±0.71、0.82±1.15 和 2.00±1.07,用于标准、5 分钟、扩展和零剂量设置。使用 5 分制(0 到 4,0 为最差),所有全剂量合成图像的设置显示病变的对比增强明显较差,与原始全剂量序列相比(平均对比增强标准差异:-0.97±0.83,P<0.001;5 分钟:-0.93±0.91,P<0.001;扩展:-0.96±0.97,P<0.001;零剂量:-2.39±1.14,P<0.001)。与原始全剂量序列相比,病变的一致性平均评分分别为 2.25±1.21、2.22±1.27、2.24±1.25 和 0.73±0.93,用于标准、5 分钟、扩展和零剂量设置。
基于仅给予标准剂量 10%的钆基造影剂后采集的图像,用于合成人工 T1w 全剂量序列的测试深度学习算法显示出非常好的定量性能。尽管在所有设置中都具有良好的图像质量,但假阴性和假阳性信号都导致合成序列与原始全剂量序列的可互换性受到显著限制。