Department of Forensic Science, School of Basic Medical Sciences, Central South University, Changsha 410013, Hunan, China.
Key Laboratory of Carcinogenesis and Cancer Invasion of the Chinese Ministry of Education, Cancer Research Institute and School of Basic Medicine Sciences, Central South University, Changsha, Hunan, China.
Spectrochim Acta A Mol Biomol Spectrosc. 2024 Mar 5;308:123713. doi: 10.1016/j.saa.2023.123713. Epub 2023 Dec 2.
Accurate identification of insect species holds paramount significance in diverse fields as it facilitates a comprehensive understanding of their ecological habits, distribution range, and impact on both the environment and humans. While morphological characteristics have traditionally been employed for species identification, the utilization of empty pupariums for this purpose remains relatively limited. In this study, ATR-FTIR was employed to acquire spectral information from empty pupariums of five fly species, subjecting the data to spectral pre-processing to obtain average spectra for preliminary analysis. Subsequently, PCA and OPLS-DA were utilized for clustering and classification. Notably, two wavebands (3000-2800 cm and 1800-1300 cm) were found to be significant in distinguishing A. grahami. Further, we established three machine learning models, including SVM, KNN, and RF, to analyze spectra from different waveband groups. The biological fingerprint region (1800-1300 cm) demonstrated a substantial advantage in identifying empty puparium species. Remarkably, the SVM model exhibited an impressive accuracy of 100 % in identifying all five fly species. This study represents the first instance of employing infrared spectroscopy and machine learning methods for identifying insect species using empty pupariums, providing a robust research foundation for future investigations in this area.
准确识别昆虫物种在多个领域都具有至关重要的意义,因为这有助于全面了解它们的生态习性、分布范围以及对环境和人类的影响。虽然传统上采用形态特征来进行物种鉴定,但利用空蛹进行这一目的的应用仍然相对有限。在这项研究中,我们使用衰减全反射傅里叶变换红外光谱(ATR-FTIR)从五种蝇类的空蛹中获取光谱信息,对数据进行光谱预处理以获得平均光谱进行初步分析。随后,我们使用主成分分析(PCA)和正交偏最小二乘判别分析(OPLS-DA)进行聚类和分类。值得注意的是,两个波段(3000-2800 cm 和 1800-1300 cm)被发现可用于区分 A. grahami。此外,我们建立了三个机器学习模型,包括支持向量机(SVM)、K 近邻(KNN)和随机森林(RF),以分析来自不同波段组的光谱。生物指纹区(1800-1300 cm)在识别空蛹物种方面具有显著优势。令人瞩目的是,SVM 模型在识别所有五种蝇类时表现出了 100%的出色准确率。本研究首次采用红外光谱和机器学习方法,利用空蛹识别昆虫物种,为该领域的未来研究提供了坚实的研究基础。