Department of Translational Biomedical Sciences, Dong-A University, Busan, Korea.
Institute of Convergence Bio-Health, Dong-A University, Busan, Korea.
PLoS One. 2021 Oct 20;16(10):e0258214. doi: 10.1371/journal.pone.0258214. eCollection 2021.
High accuracy has been reported in deep learning classification for amyloid brain scans, an important factor in Alzheimer's disease diagnosis. However, the possibility of overfitting should be considered, as this model is fitted with sample data. Therefore, we created and evaluated an [18F]Florbetaben amyloid brain positron emission tomography (PET) scan classification model with a Dong-A University Hospital (DAUH) dataset based on a convolutional neural network (CNN), and performed external validation with the Alzheimer's Disease Neuroimaging Initiative dataset. Spatial normalization, count normalization, and skull stripping preprocessing were performed on the DAUH and external datasets. However, smoothing was only performed on the external dataset. Three types of models were used, depending on their structure: Inception3D, ResNet3D, and VGG3D. After training with 80% of the DAUH dataset, an appropriate model was selected, and the rest of the DAUH dataset was used for model evaluation. The generalization potential of the selected model was then validated using the external dataset. The accuracy of the model evaluation for Inception3D, ResNet3D, and VGG3D was 95.4%, 92.0%, and 97.7%, and the accuracy of the external validation was 76.7%, 67.1%, and 85.3%, respectively. Inception3D and ResNet3D were retrained with the external dataset; then, the area under the curve was compared to determine the binary classification performance with a significance level of less than 0.05. When external validation was performed again after fine tuning, the performance improved to 15.3%p for Inception3D and 16.9%p for ResNet3D. In [18F]Florbetaben amyloid brain PET scan classification using CNN, the generalization potential can be seen through external validation. When there is a significant difference between the model classification performance and the external validation, changing the model structure or fine tuning the model can help improve the classification performance, and the optimal model can also be found by collaborating through a web-based open platform.
深度学习分类在淀粉样脑扫描中具有较高的准确性,这是阿尔茨海默病诊断的一个重要因素。然而,应该考虑到过拟合的可能性,因为该模型是根据样本数据进行拟合的。因此,我们基于卷积神经网络(CNN)创建并评估了一个东国大学医院(DAUH)数据集的[18F]Florbetaben 淀粉样脑正电子发射断层扫描(PET)扫描分类模型,并使用阿尔茨海默病神经影像学倡议(ADNI)数据集进行外部验证。对 DAUH 和外部数据集进行了空间标准化、计数标准化和颅骨剥离预处理。然而,仅对外部数据集进行了平滑处理。根据其结构使用了三种类型的模型:Inception3D、ResNet3D 和 VGG3D。在使用 DAUH 数据集的 80%进行训练后,选择了一个合适的模型,并使用剩余的 DAUH 数据集进行模型评估。然后使用外部数据集验证所选模型的泛化潜力。Inception3D、ResNet3D 和 VGG3D 的模型评估准确性分别为 95.4%、92.0%和 97.7%,外部验证的准确性分别为 76.7%、67.1%和 85.3%。用外部数据集重新训练了 Inception3D 和 ResNet3D;然后,比较了曲线下面积,以确定具有统计学意义的二进制分类性能(p 值小于 0.05)。在微调后再次进行外部验证时,性能提高到 Inception3D 为 15.3%p,ResNet3D 为 16.9%p。在使用 CNN 的[18F]Florbetaben 淀粉样脑 PET 扫描分类中,可以通过外部验证看到泛化潜力。当模型分类性能与外部验证存在显著差异时,可以通过改变模型结构或微调模型来提高分类性能,还可以通过基于网络的开放平台协作找到最佳模型。