Gansu Key Laboratory for Environmental Pollution Prediction and Control, College of Earth and Environmental Sciences, Lanzhou University, Lanzhou, Gansu 730000, China.
College of Chemistry and Chemical Engineering, Lanzhou University, Lanzhou, Gansu 730000, China.
Environ Sci Technol. 2023 Apr 25;57(16):6656-6663. doi: 10.1021/acs.est.2c08952. Epub 2023 Apr 13.
Microplastics (MPs) are currently recognized as emerging pollutants; their identification and classification are therefore essential during their monitoring and management. In contrast to most studies based on small datasets and library searches, this study developed and compared four machine learning-based classifiers and two large-scale blended plastic datasets, where a 1D convolutional neural network (CNN), decision tree, and random forest (RF) were fed with raw spectral data from Fourier transform infrared spectroscopy, while a 2D CNN used the corresponding spectral images as the input. With an overall accuracy of 96.43% on a small dataset and 97.44% on a large dataset, the 1D CNN outperformed other models. The 1D CNN was the best at predicting environment samples, while the RF was the most robust with less spectral data. Overall, RF and 2D CNNs might be evaluated for plastic identification with fewer spectral data; however, 1D CNNs were thought to be the most effective with sufficient spectral data. Accordingly, an open-source MP spectroscopic analysis tool was developed to facilitate a quick and accurate analysis of existing MP samples.
微塑料(MPs)目前被认为是新兴污染物;因此,在对其进行监测和管理时,识别和分类至关重要。与大多数基于小数据集和库搜索的研究不同,本研究开发并比较了四种基于机器学习的分类器和两个大型混合塑料数据集,其中一维卷积神经网络(CNN)、决策树和随机森林(RF)使用傅里叶变换红外光谱的原始光谱数据进行输入,而二维 CNN 使用相应的光谱图像作为输入。在小数据集上的整体准确率为 96.43%,在大数据集上的准确率为 97.44%,一维 CNN 的表现优于其他模型。一维 CNN 最擅长预测环境样本,而 RF 在光谱数据较少的情况下最为稳健。总体而言,RF 和 2D CNN 可以在较少的光谱数据下评估塑料识别;然而,在有足够光谱数据的情况下,一维 CNN 被认为是最有效的。因此,开发了一个开源的 MP 光谱分析工具,以方便快速准确地分析现有的 MP 样本。