Li Guangyu, Wang Zijian, Wu Chieh, Wang Dongqi, Han Il, Lee Jangho, Kaeli David R, Dy Jennifer G, Weinberger Kilian Q, Gu April Z
School of Civil and Environmental Engineering, Cornell University, Ithaca, NY 14850, United States.
Department of Biological and Environmental Engineering, College of Agriculture and Life Sciences, Cornell University, NY 14850, United States.
ISME Commun. 2025 Mar 9;5(1):ycaf015. doi: 10.1093/ismeco/ycaf015. eCollection 2025 Jan.
Single-cell Raman Spectroscopy (SCRS) emerges as a promising tool for single-cell phenotyping in environmental ecological studies, offering non-intrusive, high-resolution, and high-throughput capabilities. In this study, we obtained a large and the first comprehensive SCRS dataset that captured phenotypic variations with cell growth status for 36 microbial strains, and we compared and optimized analysis techniques and classifiers for SCRS-based taxonomy identification. First, we benchmarked five dimensionality reduction (DR) methods, 10 classifiers, and the impact of cell growth variances using a SCRS dataset with both taxonomy and cellular growth stage labels. Unsupervised DR methods and non-neural network classifiers are recommended for at a balance between accuracy and time efficiency, achieved up to 96.1% taxonomy classification accuracy. Second, accuracy variances caused by cellular growth variance (<2.9% difference) was found less than the influence from model selection (up to 41.4% difference). Remarkably, simultaneous high accuracy in growth stage classification (93.3%) and taxonomy classification (94%) were achievable using an innovative two-step classifier model. Third, this study is the first to successfully apply models trained on pure culture SCRS data to achieve taxonomic identification of microbes in environmental samples at an accuracy of 79%, and with validation via Raman-FISH (fluorescence hybridization). This study paves the groundwork for standardizing SCRS-based biotechnologies in single-cell phenotyping and taxonomic classification beyond laboratory pure culture to real environmental microorganisms and promises advances in SCRS applications for elucidating organismal functions, ecological adaptability, and environmental interactions.
单细胞拉曼光谱(SCRS)成为环境生态研究中单细胞表型分析的一种有前景的工具,具有非侵入性、高分辨率和高通量的能力。在本研究中,我们获得了首个大型且全面的SCRS数据集,该数据集捕捉了36种微生物菌株随细胞生长状态的表型变化,并且我们比较并优化了基于SCRS的分类鉴定的分析技术和分类器。首先,我们使用一个带有分类学和细胞生长阶段标签的SCRS数据集,对五种降维(DR)方法、十种分类器以及细胞生长差异的影响进行了基准测试。为了在准确性和时间效率之间取得平衡,推荐使用无监督DR方法和非神经网络分类器,分类准确率高达96.1%。其次,发现由细胞生长差异引起的准确性差异(差异<2.9%)小于模型选择的影响(差异高达41.4%)。值得注意的是,使用创新的两步分类器模型可同时实现较高的生长阶段分类准确率(93.3%)和分类学分类准确率(94%)。第三,本研究首次成功将基于纯培养SCRS数据训练的模型应用于环境样品中微生物的分类鉴定,准确率达到79%,并通过拉曼 - 荧光原位杂交(Raman - FISH)进行了验证。本研究为将基于SCRS的生物技术从实验室纯培养标准化到实际环境微生物的单细胞表型分析和分类学分类奠定了基础,并有望推动SCRS在阐明生物体功能、生态适应性和环境相互作用方面的应用取得进展。