在估计基于深度学习的计算机辅助精神障碍诊断性能的系统中,存在数据泄露的风险。
Risk of data leakage in estimating the diagnostic performance of a deep-learning-based computer-aided system for psychiatric disorders.
机构信息
Department of Electronics and Information Engineering, Korea University, Sejong, Republic of Korea.
Interdisciplinary Graduate Program for Artificial Intelligence Smart Convergence Technology, Korea University, Sejong, Republic of Korea.
出版信息
Sci Rep. 2023 Oct 3;13(1):16633. doi: 10.1038/s41598-023-43542-8.
Deep-learning approaches with data augmentation have been widely used when developing neuroimaging-based computer-aided diagnosis (CAD) systems. To prevent the inflated diagnostic performance caused by data leakage, a correct cross-validation (CV) method should be employed, but this has been still overlooked in recent deep-learning-based CAD studies. The goal of this study was to investigate the impact of correct and incorrect CV methods on the diagnostic performance of deep-learning-based CAD systems after data augmentation. To this end, resting-state electroencephalogram (EEG) data recorded from post-traumatic stress disorder patients and healthy controls were augmented using a cropping method with different window sizes, respectively. Four different CV approaches were used to estimate the diagnostic performance of the CAD system, i.e., subject-wise CV (sCV), overlapped sCV (oSCV), trial-wise CV (tCV), and overlapped tCV (otCV). Diagnostic performances were evaluated using two deep-learning models based on convolutional neural network. Data augmentation can increase the performance with all CVs, but inflated diagnostic performances were observed when using incorrect CVs (tCV and otCV) due to data leakage. Therefore, the correct CV (sCV and osCV) should be used to develop a deep-learning-based CAD system. We expect that our investigation can provide deep-insight for researchers who plan to develop neuroimaging-based CAD systems for psychiatric disorders using deep-learning algorithms with data augmentation.
深度学习方法结合数据增强已被广泛应用于开发基于神经影像学的计算机辅助诊断 (CAD) 系统。为了防止数据泄露导致诊断性能膨胀,应采用正确的交叉验证 (CV) 方法,但在最近基于深度学习的 CAD 研究中,这一点仍被忽视。本研究旨在探讨数据增强后,正确和不正确的 CV 方法对基于深度学习的 CAD 系统诊断性能的影响。为此,分别使用裁剪方法对创伤后应激障碍患者和健康对照组的静息态脑电图 (EEG) 数据进行了不同窗口大小的扩充。使用四种不同的 CV 方法来评估 CAD 系统的诊断性能,即个体内 CV (sCV)、重叠 sCV (oSCV)、试验内 CV (tCV) 和重叠 tCV (otCV)。使用两种基于卷积神经网络的深度学习模型评估了诊断性能。数据扩充可以提高所有 CV 的性能,但由于数据泄露,使用不正确的 CV (tCV 和 otCV) 会导致诊断性能膨胀。因此,应使用正确的 CV (sCV 和 osCV) 来开发基于深度学习的 CAD 系统。我们希望我们的研究可以为计划使用数据增强的深度学习算法为精神障碍开发基于神经影像学的 CAD 系统的研究人员提供深入的见解。