Lab for Artificial Intelligence in Medical Imaging (AI-Med), Department of Child and Adolescent Psychiatry, University Hospital, LMU München, Germany.
Umeå Center for Functional Brain Imaging, Department of Radiation Sciences, Umeå University.
Med Image Anal. 2021 Jan;67:101879. doi: 10.1016/j.media.2020.101879. Epub 2020 Oct 21.
The desire to train complex machine learning algorithms and to increase the statistical power in association studies drives neuroimaging research to use ever-larger datasets. The most obvious way to increase sample size is by pooling scans from independent studies. However, simple pooling is often ill-advised as selection, measurement, and confounding biases may creep in and yield spurious correlations. In this work, we combine 35,320 magnetic resonance images of the brain from 17 studies to examine bias in neuroimaging. In the first experiment, Name That Dataset, we provide empirical evidence for the presence of bias by showing that scans can be correctly assigned to their respective dataset with 71.5% accuracy. Given such evidence, we take a closer look at confounding bias, which is often viewed as the main shortcoming in observational studies. In practice, we neither know all potential confounders nor do we have data on them. Hence, we model confounders as unknown, latent variables. Kolmogorov complexity is then used to decide whether the confounded or the causal model provides the simplest factorization of the graphical model. Finally, we present methods for dataset harmonization and study their ability to remove bias in imaging features. In particular, we propose an extension of the recently introduced ComBat algorithm to control for global variation across image features, inspired by adjusting for unknown population stratification in genetics. Our results demonstrate that harmonization can reduce dataset-specific information in image features. Further, confounding bias can be reduced and even turned into a causal relationship. However, harmonization also requires caution as it can easily remove relevant subject-specific information. Code is available at https://github.com/ai-med/Dataset-Bias.
人们希望训练复杂的机器学习算法并提高关联研究中的统计能力,这推动神经影像学研究使用越来越大的数据集。增加样本量最明显的方法是汇集来自独立研究的扫描。然而,简单的汇集通常是不明智的,因为选择、测量和混杂偏差可能会潜入并产生虚假的相关性。在这项工作中,我们结合了来自 17 项研究的 35320 张大脑磁共振图像,以检查神经影像学中的偏差。在第一个实验“命名数据集”中,我们通过显示扫描可以以 71.5%的准确率正确分配给其各自的数据集,提供了存在偏差的经验证据。有了这样的证据,我们仔细研究了混杂偏差,这通常被认为是观察性研究的主要缺点。在实践中,我们既不知道所有潜在的混杂因素,也没有关于它们的数据。因此,我们将混杂因素建模为未知的潜在变量。然后使用柯尔莫哥洛夫复杂性来确定混杂模型还是因果模型为图形模型提供了最简单的因子分解。最后,我们提出了数据集协调的方法,并研究了它们去除成像特征偏差的能力。特别是,我们提出了一种最近引入的 ComBat 算法的扩展,以控制图像特征中的全局变化,这受到遗传学中调整未知群体分层的启发。我们的结果表明,协调可以减少图像特征中特定于数据集的信息。此外,可以减少混杂偏差,甚至可以将其转变为因果关系。然而,协调也需要谨慎,因为它很容易去除相关的个体特定信息。代码可在 https://github.com/ai-med/Dataset-Bias 获得。