Rostami Masoud A, Kydd LeMaur, Balmaki Behnaz, Dyer Lee A, Allen Julie M
Data Science Division, University of Texas at Arlington, Arlington, TX, United States.
Department of Biology, University of Texas at Arlington, Arlington, TX, United States.
Front Big Data. 2025 Feb 14;8:1507036. doi: 10.3389/fdata.2025.1507036. eCollection 2025.
Accurate identification of pollen grains from (fir), (spruce), and (pine) is an important method for reconstructing historical environments, past landscapes and understanding human-environment interactions. However, distinguishing between pollen grains of conifer genera poses challenges in palynology due to their morphological similarities. To address this identification challenge, this study leverages advanced deep learning techniques, specifically transfer learning models, which are effective in identifying similarities among detailed features. We evaluated nine different transfer learning architectures: DenseNet201, EfficientNetV2S, InceptionV3, MobileNetV2, ResNet101, ResNet50, VGG16, VGG19, and Xception. Each model was trained and validated on a dataset of images of pollen grains collected from museum specimens, mounted and imaged for training purposes. The models were assessed on various performance metrics, including accuracy, precision, recall, and F1-score across training, validation, and testing phases. Our results indicate that ResNet101 relatively outperformed other models, achieving a test accuracy of 99%, with equally high precision, recall, and F1-score. This study underscores the efficacy of transfer learning to produce models that can aid in identifications of difficult species. These models may aid conifer species classification and enhance pollen grain analysis, critical for ecological research and monitoring environmental changes.
准确识别冷杉、云杉和松树的花粉粒是重建历史环境、过去景观以及理解人类与环境相互作用的重要方法。然而,由于针叶树属花粉粒在形态上的相似性,在孢粉学中区分它们具有挑战性。为应对这一识别挑战,本研究利用先进的深度学习技术,特别是迁移学习模型,这些模型在识别详细特征之间的相似性方面很有效。我们评估了九种不同的迁移学习架构:DenseNet201、EfficientNetV2S、InceptionV3、MobileNetV2、ResNet101、ResNet50、VGG16、VGG19和Xception。每个模型都在从博物馆标本收集的花粉粒图像数据集上进行训练和验证,这些标本经过装片和成像用于训练目的。在训练、验证和测试阶段,根据各种性能指标对模型进行评估,包括准确率、精确率、召回率和F1分数。我们的结果表明,ResNet101相对优于其他模型,测试准确率达到99%,精确率、召回率和F1分数同样很高。这项研究强调了迁移学习在生成有助于识别困难物种的模型方面的有效性。这些模型可能有助于针叶树物种分类并加强花粉粒分析,这对生态研究和监测环境变化至关重要。