Department of Computer Science and Automation, Technische Universität Ilmenau, Ilmenau, Germany.
Department of Neurology, Jena University Hospital, Jena, Germany.
Hum Brain Mapp. 2023 Oct 1;44(14):4848-4858. doi: 10.1002/hbm.26417. Epub 2023 Jul 17.
Deep learning is increasingly being proposed for detecting neurological and psychiatric diseases from electroencephalogram (EEG) data but the method is prone to inadvertently incorporate biases from training data and exploit illegitimate patterns. The recent demonstration that deep learning can detect the sex from EEG implies potential sex-related biases in deep learning-based disease detectors for the many diseases with unequal prevalence between males and females. In this work, we present the male- and female-typical patterns used by a convolutional neural network that detects the sex from clinical EEG (81% accuracy in a separate test set with 142 patients). We considered neural sources, anatomical differences, and non-neural artifacts as sources of differences in the EEG curves. Using EEGs from 1140 patients, we found electrocardiac artifacts to be leaking into the supposedly brain activity-based classifiers. Nevertheless, the sex remained detectable after rejecting heart-related and other artifacts. In the cleaned data, EEG topographies were critical to detect the sex, but waveforms and frequencies were not. None of the traditional frequency bands was particularly important for sex detection. We were able to determine the sex even from EEGs with shuffled time points and therewith completely destroyed waveforms. Researchers should consider neural and non-neural sources as potential origins of sex differences in their data, they should maintain best practices of artifact rejection, even when datasets are large, and they should test their classifiers for sex biases.
深度学习越来越多地被提议用于从脑电图 (EEG) 数据中检测神经和精神疾病,但该方法容易无意中从训练数据中纳入偏差,并利用非法模式。最近的研究表明,深度学习可以从 EEG 中检测到性别,这意味着对于许多男性和女性患病率不等的疾病,基于深度学习的疾病探测器可能存在潜在的与性别相关的偏差。在这项工作中,我们展示了一个卷积神经网络用于从临床 EEG 中检测性别的男性和女性典型模式(在一个包含 142 名患者的独立测试集中准确率为 81%)。我们认为神经源、解剖差异和非神经伪影是 EEG 曲线差异的来源。使用来自 1140 名患者的 EEG,我们发现心电伪影会泄露到基于所谓的大脑活动的分类器中。尽管如此,在拒绝与心脏相关的和其他伪影后,性别仍然可以被检测到。在清理数据中,脑电图拓扑结构对于检测性别至关重要,但波形和频率并非如此。没有一个传统的频带对性别检测特别重要。即使从时间点被打乱且完全破坏了波形的 EEG 中,我们也能够确定性别。研究人员应该将神经和非神经源视为其数据中性别差异的潜在来源,他们应该保持最佳的伪影拒绝实践,即使数据集很大,并且他们应该测试他们的分类器是否存在性别偏差。