Institute of Computing, Fluminense Federal University, Niteroi, RJ 24.310-346, Brazil.
Sensors (Basel). 2021 Mar 20;21(6):2174. doi: 10.3390/s21062174.
Convolutional Neural Networks (CNNs) have been successfully applied in the medical diagnosis of different types of diseases. However, selecting the architecture and the best set of hyperparameters among the possible combinations can be a significant challenge. The purpose of this work is to investigate the use of the Hyperband optimization algorithm in the process of optimizing a CNN applied to the diagnosis of SARS-Cov2 disease (COVID-19). The test was performed with the Optuna framework, and the optimization process aimed to optimize four hyperparameters: (1) backbone architecture, (2) the number of inception modules, (3) the number of neurons in the fully connected layers, and (4) the learning rate. CNNs were trained on 2175 computed tomography (CT) images. The CNN that was proposed by the optimization process was a VGG16 with five inception modules, 128 neurons in the two fully connected layers, and a learning rate of 0.0027. The proposed method achieved a sensitivity, precision, and accuracy of 97%, 82%, and 88%, outperforming the sensitivity of the Real-Time Polymerase Chain Reaction (RT-PCR) tests (53-88%) and the accuracy of the diagnosis performed by human experts (72%).
卷积神经网络 (CNN) 已成功应用于不同类型疾病的医学诊断。然而,在可能的组合中选择架构和最佳超参数集可能是一个重大挑战。本工作旨在研究使用 Hyperband 优化算法来优化应用于 SARS-CoV2 疾病 (COVID-19) 诊断的 CNN。该测试是在 Optuna 框架中进行的,优化过程旨在优化四个超参数:(1)骨干架构,(2)inception 模块的数量,(3)全连接层中的神经元数量,和 (4)学习率。CNN 在 2175 张计算机断层扫描 (CT) 图像上进行训练。优化过程提出的 CNN 是一个具有五个 inception 模块、两个全连接层中的 128 个神经元和学习率为 0.0027 的 VGG16。所提出的方法实现了 97%的敏感性、82%的精度和 88%的准确率,优于实时聚合酶链反应 (RT-PCR) 测试的敏感性 (53-88%)和人类专家诊断的准确性 (72%)。