Lu Yuan-Chiao, Zuo Lianrui, Chou Yi-Yu, Dewey Blake E, Remedios Samuel, Shinohara Russell T, Steele Sonya U, Nair Govind, Reich Daniel S, Prince Jerry L, Pham Dzung L
The Henry M Jackson Foundation for the Advancement of Military Medicine, Inc., Bethesda, MD, United States.
Department of Radiology and Bioengineering, Uniformed Services University of the Health Sciences, Bethesda, MD, United States.
Imaging Neurosci (Camb). 2025 Jul 14;3. doi: 10.1162/IMAG.a.73. eCollection 2025.
Volumetric analysis of magnetic resonance brain images is often complicated by variations in scanner hardware, software, and acquisition settings. Over the past several years, there has been an increase in the use of retrospective harmonization techniques for addressing these variations. This research evaluates three image harmonization methods-neuroCombat (a statistical batch correction tool), DeepHarmony (a supervised deep learning method based on image-to-image translation), and HACA3 (an unsupervised deep learning image translation approach). The study focuses on their effectiveness in achieving consistent brain volume measurements across differing T1-weighted acquisitions (GRE and MPRAGE) and their ability to detect simulated atrophy changes in the same acquisitions. While all three methods notably enhance the consistency of regional brain volumes compared with unharmonized images, HACA3 demonstrated the lowest measurement variation in terms of absolute volume difference percentage (AVDP) across all brain regions (<3%). It also demonstrated the highest agreement between the coefficient of variation (CV) measurements of GRE and MPRAGE images, evidenced by the smallest mean difference (0.12) and the narrowest 95% confidence intervals ([-1.04, 1.28]), alongside achieving the highest intra-class correlation (ICC) values across all regions (ICC >0.9). In the atrophy simulation experiments, HACA3 consistently achieved the smallest AVDPs across most unchanged brain regions, while DeepHarmony showed significant improvements in several regions, and neuroCombat exhibited higher variability. Additionally, using neuroCombat with training data effectively detected hippocampal atrophy, whereas without training, neuroCombat could not differentiate between images with and without atrophy, highlighting a potential limitation in its ability to detect subtle brain volume changes when training data are unavailable. In most metrics, HACA3 was found to be the most effective for harmonizing MRI data, followed by DeepHarmony, with neuroCombat showing more measurement variability but still offering improvements over unharmonized data.
磁共振脑图像的容积分析常常因扫描仪硬件、软件及采集设置的差异而变得复杂。在过去几年中,用于解决这些差异的回顾性协调技术的使用有所增加。本研究评估了三种图像协调方法——NeuroCombat(一种统计批量校正工具)、DeepHarmony(一种基于图像到图像转换的监督深度学习方法)和HACA3(一种无监督深度学习图像转换方法)。该研究聚焦于它们在不同T1加权采集(GRE和MPRAGE)中实现一致脑容量测量的有效性,以及在相同采集中检测模拟萎缩变化的能力。虽然与未协调的图像相比,所有这三种方法都显著提高了区域脑容量的一致性,但HACA3在所有脑区的绝对体积差异百分比(AVDP)方面显示出最低的测量变化(<3%)。它还在GRE和MPRAGE图像的变异系数(CV)测量之间显示出最高的一致性,表现为最小的平均差异(0.12)和最窄的95%置信区间([-1.04, 1.28]),同时在所有区域实现了最高的组内相关系数(ICC)值(ICC>0.9)。在萎缩模拟实验中,HACA3在大多数未改变的脑区始终实现最小的AVDP,而DeepHarmony在几个区域显示出显著改善,NeuroCombat表现出更高的变异性。此外,使用带有训练数据的NeuroCombat能有效检测海马萎缩,而没有训练时,NeuroCombat无法区分有无萎缩的图像,这突出了在没有训练数据时其检测细微脑容量变化能力的潜在局限性。在大多数指标中,发现HACA3在协调MRI数据方面最有效,其次是DeepHarmony,NeuroCombat显示出更多的测量变异性,但仍比未协调的数据有所改进。