Figueroa-Mata Geovanni, Mata-Montero Erick
School of Mathematics, Costa Rica Institute of Technology, calle 15, avenida 14, Cartago 30101, Costa Rica.
School of Computing, Costa Rica Institute of Technology, calle 15, avenida 14, Cartago 30101, Costa Rica.
Biomimetics (Basel). 2020 Mar 1;5(1):8. doi: 10.3390/biomimetics5010008.
The application of deep learning techniques may prove difficult when datasets are small. Recently, techniques such as one-shot learning, few-shot learning, and Siamese networks have been proposed to address this problem. In this paper, we propose the use a convolutional Siamese network (CSN) that learns a similarity metric that discriminates between plant species based on images of leaves. Once the CSN has learned the similarity function, its discriminatory power is generalized to classify not just new pictures of the species used during training but also entirely new species for which only a few images are available. This is achieved by exposing the network to pairs of similar and dissimilar observations and minimizing the Euclidean distance between similar pairs while simultaneously maximizing it between dissimilar pairs. We conducted experiments to study two different scenarios. In the first one, the CSN was trained and validated with datasets that comprise 5, 10, 15, 20, 25, and 30 pictures per species, extracted from the well-known FLAVIAmathsizesmall dataset. Then, the trained model was tested with another dataset composed of 320 images (10 images per species) also from FLAVIAmathsizesmall. The obtained accuracy was compared with the results of feeding the same training, validation, and testing datasets to a convolutional neural network (CNN) in order to determine if there is a threshold value for dataset size that defines the intervals for which either the CSN or the CNN has better accuracy. In the second studied scenario, the accuracy of both the CSN and the CNN-both trained and validated with the same datasets extracted from FLAVIAmathsizesmall-were compared when tested on a set of images of leaves of 20 Costa Rican tree species that are not represented in FLAVIAmathsizesmall.
当数据集较小时,深度学习技术的应用可能会遇到困难。最近,诸如一次性学习、少样本学习和连体网络等技术已被提出以解决这一问题。在本文中,我们建议使用卷积连体网络(CSN),它学习一种相似性度量,该度量基于叶片图像来区分植物物种。一旦CSN学习到相似性函数,其判别能力不仅可以推广到对训练期间使用的物种的新图片进行分类,还可以对仅有少量图像的全新物种进行分类。这是通过让网络接触相似和不相似的观测对,并最小化相似对之间的欧几里得距离,同时最大化不相似对之间的欧几里得距离来实现的。我们进行了实验以研究两种不同的情况。在第一种情况中,CSN使用从著名的FLAVIA数据集提取的每个物种包含5、10、15、20、25和30张图片的数据集进行训练和验证。然后,使用同样来自FLAVIA的另一个由320张图像(每个物种10张图像)组成的数据集对训练好的模型进行测试。将获得的准确率与将相同的训练、验证和测试数据集输入到卷积神经网络(CNN)的结果进行比较,以确定是否存在数据集大小的阈值,该阈值定义了CSN或CNN具有更高准确率的区间。在第二种研究的情况中,当在一组未在FLAVIA中出现的20种哥斯达黎加树种的叶片图像上进行测试时,比较了CSN和CNN(两者均使用从FLAVIA提取的相同数据集进行训练和验证)的准确率。