Mostafa Sakib, Mondal Debajyoti, Beck Michael A, Bidinosti Christopher P, Henry Christopher J, Stavness Ian
Department of Computer Science, University of Saskatchewan, Saskatoon, SK, Canada.
Department of Physics, University of Winnipeg, Winnipeg, MB, Canada.
Front Artif Intell. 2022 May 11;5:871162. doi: 10.3389/frai.2022.871162. eCollection 2022.
The development of state-of-the-art convolutional neural networks (CNN) has allowed researchers to perform plant classification tasks previously thought impossible and rely on human judgment. Researchers often develop complex CNN models to achieve better performances, introducing over-parameterization and forcing the model to overfit on a training dataset. The most popular process for evaluating overfitting in a deep learning model is using accuracy and loss curves. Train and loss curves may help understand the performance of a model but do not provide guidance on how the model could be modified to attain better performance. In this article, we analyzed the relation between the features learned by a model and its capacity and showed that a model with higher representational capacity might learn many subtle features that may negatively affect its performance. Next, we showed that the shallow layers of a deep learning model learn more diverse features than the ones learned by the deeper layers. Finally, we propose SSIM cut curve, a new way to select the depth of a CNN model by using the pairwise similarity matrix between the visualization of the features learned at different depths by using Guided Backpropagation. We showed that our proposed method could potentially pave a new way to select a better CNN model.
先进的卷积神经网络(CNN)的发展使研究人员能够执行以前被认为不可能的植物分类任务,而不再依赖人工判断。研究人员经常开发复杂的CNN模型以实现更好的性能,这会引入过度参数化问题,并迫使模型在训练数据集上过度拟合。评估深度学习模型中过拟合的最常用方法是使用准确率和损失曲线。训练曲线和损失曲线可能有助于理解模型的性能,但无法为如何修改模型以获得更好的性能提供指导。在本文中,我们分析了模型学习到的特征与其能力之间的关系,并表明具有更高表示能力的模型可能会学习到许多可能对其性能产生负面影响的细微特征。接下来,我们表明深度学习模型的浅层比深层学习到更多样化的特征。最后,我们提出了SSIM切割曲线,这是一种通过使用引导反向传播在不同深度学习到的特征可视化之间的成对相似性矩阵来选择CNN模型深度的新方法。我们表明,我们提出的方法可能为选择更好的CNN模型开辟一条新途径。