Xi'an Institute of Optics and Precision Mechanics, Chinese Academy of Science, Xi'an 710119, China; College of Photoelectricity, University of Chinese Academy of Science, Beijing 100049, China.
Xi'an Institute of Optics and Precision Mechanics, Chinese Academy of Science, Xi'an 710119, China.
Spectrochim Acta A Mol Biomol Spectrosc. 2024 Apr 15;311:123938. doi: 10.1016/j.saa.2024.123938. Epub 2024 Jan 22.
Accurate identification of algal populations plays a pivotal role in monitoring seawater quality. Fluorescence-based techniques are effective tools for quickly identifying different algae. However, multiple coexisting algae and their similar photosynthetic pigments can constrain the efficacy of fluorescence methods. This study introduces a multi-label classification model that combines a specific Excitation-Emission matric convolutional neural network (EEM-CNN) with three-dimensional (3D) fluorescence spectroscopy to detect single and mixed algal samples. Spectral data can be input directly into the model without transforming into images. Rectangular convolutional kernels and double convolutional layers are applied to enhance the extraction of balanced and comprehensive spectral features for accurate classification. A dataset comprising 3D fluorescence spectra from eight distinct algae species representing six different algal classes was obtained, preprocessed, and augmented to create input data for the classification model. The classification model was trained and validated using 4448 sets of test samples and 60 sets of test samples, resulting in an accuracy of 0.883 and an F1 score of 0.925. This model exhibited the highest recognition accuracy in both single and mixed algae samples, outperforming comparative methods such as ML-kNN and N-PLS-DA. Furthermore, the classification results were extended to three different algae species and mixed samples of skeletonema costatum to assess the impact of spectral similarity on multi-label classification performance. The developed classification models demonstrated robust performance across samples with varying concentrations and growth stages, highlighting CNN's potential as a promising tool for the precise identification of marine algae.
准确识别藻类种群在监测海水水质方面起着关键作用。基于荧光的技术是快速识别不同藻类的有效工具。然而,多种共存的藻类及其相似的光合色素会限制荧光方法的效果。本研究引入了一种多标签分类模型,该模型将特定的激发-发射矩阵卷积神经网络(EEM-CNN)与三维(3D)荧光光谱相结合,用于检测单一和混合藻类样本。光谱数据可以直接输入模型,而无需转换为图像。应用矩形卷积核和双卷积层来增强对平衡和全面光谱特征的提取,以实现准确的分类。获得了一个由 8 种不同藻类物种的 3D 荧光光谱组成的数据集,对其进行预处理和扩充,以创建分类模型的输入数据。使用 4448 组测试样本和 60 组测试样本对分类模型进行训练和验证,得到了 0.883 的准确率和 0.925 的 F1 分数。该模型在单一和混合藻类样本中表现出最高的识别精度,优于 ML-kNN 和 N-PLS-DA 等比较方法。此外,还将分类结果扩展到三种不同的藻类物种和混合的 skeletonema costatum 样本,以评估光谱相似性对多标签分类性能的影响。所开发的分类模型在具有不同浓度和生长阶段的样本中表现出稳健的性能,突出了 CNN 作为一种有前途的海洋藻类精确识别工具的潜力。