基于信号聚合表示的拉曼光谱深度学习用于增强细胞表型和特征识别。

Raman spectroscopic deep learning with signal aggregated representations for enhanced cell phenotype and signature identification.

作者信息

Lu Songlin, Huang Yuanfang, Shen Wan Xiang, Cao Yu Lin, Cai Mengna, Chen Yan, Tan Ying, Jiang Yu Yang, Chen Yu Zong

机构信息

The State Key Laboratory of Chemical Oncogenomics, Key Laboratory of Chemical Biology, Tsinghua Shenzhen International Graduate School, Tsinghua University, 2279 Lishui Road, Nanshan District, Shenzhen 518055, Guangdong, P. R. China.

Institute of Biomedical Health Technology and Engineering, Shenzhen Bay Laboratory, 9 Kexue Avenue, Guangming District, Shenzhen 518132, Guangdong, P. R. China.

出版信息

PNAS Nexus. 2024 Jul 3;3(8):pgae268. doi: 10.1093/pnasnexus/pgae268. eCollection 2024 Aug.

DOI:10.1093/pnasnexus/pgae268

PMID:39192845

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11348106/

Abstract

Feature representation is critical for data learning, particularly in learning spectroscopic data. Machine learning (ML) and deep learning (DL) models learn Raman spectra for rapid, nondestructive, and label-free cell phenotype identification, which facilitate diagnostic, therapeutic, forensic, and microbiological applications. But these are challenged by high-dimensional, unordered, and low-sample spectroscopic data. Here, we introduced novel 2D image-like dual signal and component aggregated representations by restructuring Raman spectra and principal components, which enables spectroscopic DL for enhanced cell phenotype and signature identification. New ConvNet models DSCARNets significantly outperformed the state-of-the-art (SOTA) ML and DL models on six benchmark datasets, mostly with >2% improvement over the SOTA performance of 85-97% accuracies. DSCARNets also performed well on four additional datasets against SOTA models of extremely high performances (>98%) and two datasets without a published supervised phenotype classification model. Explainable DSCARNets identified Raman signatures consistent with experimental indications.

摘要

特征表示对于数据学习至关重要，特别是在学习光谱数据时。机器学习（ML）和深度学习（DL）模型学习拉曼光谱以进行快速、无损且无标记的细胞表型识别，这有助于诊断、治疗、法医和微生物学应用。但这些模型受到高维、无序和低样本光谱数据的挑战。在此，我们通过重组拉曼光谱和主成分引入了新颖的二维图像状双信号和成分聚合表示，这使得光谱深度学习能够增强细胞表型和特征识别。新的卷积神经网络模型DSCARNets在六个基准数据集上显著优于当前最先进的（SOTA）ML和DL模型，大多数情况下比准确率为85 - 97%的SOTA性能提高了2%以上。DSCARNets在另外四个数据集上与极高性能（>98%）的SOTA模型以及两个没有已发表的监督表型分类模型的数据集相比也表现良好。可解释的DSCARNets识别出与实验指征一致的拉曼特征。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae15/11348106/f6b737547e08/pgae268f1.jpg

相似文献

Raman spectroscopic deep learning with signal aggregated representations for enhanced cell phenotype and signature identification.基于信号聚合表示的拉曼光谱深度学习用于增强细胞表型和特征识别。

PNAS Nexus. 2024 Jul 3;3(8):pgae268. doi: 10.1093/pnasnexus/pgae268. eCollection 2024 Aug.

Deep learning of 2D-Restructured gene expression representations for improved low-sample therapeutic response prediction.二维重构基因表达图谱的深度学习，以提高低样本治疗反应预测。

Comput Biol Med. 2023 Sep;164:107245. doi: 10.1016/j.compbiomed.2023.107245. Epub 2023 Jul 18.

AggMapNet: enhanced and explainable low-sample omics deep learning with feature-aggregated multi-channel networks.AggMapNet：基于特征聚合多通道网络的增强型和可解释的低样本组学深度学习。

Nucleic Acids Res. 2022 May 6;50(8):e45. doi: 10.1093/nar/gkac010.

Enhanced metagenomic deep learning for disease prediction and consistent signature recognition by restructured microbiome 2D representations.通过重组微生物组二维表示增强宏基因组深度学习用于疾病预测和一致特征识别

Patterns (N Y). 2022 Dec 15;4(1):100658. doi: 10.1016/j.patter.2022.100658. eCollection 2023 Jan 13.

Semi Supervised Learning with Deep Embedded Clustering for Image Classification and Segmentation.用于图像分类和分割的深度嵌入聚类半监督学习

IEEE Access. 2019;7:11093-11104. doi: 10.1109/ACCESS.2019.2891970. Epub 2019 Jan 9.

Deep metric learning framework combined with Gramian angular difference field image generation for Raman spectra classification based on a handheld Raman spectrometer.基于手持式拉曼光谱仪的深度度量学习框架与用于拉曼光谱分类的格拉姆角差分场图像生成相结合。

Spectrochim Acta A Mol Biomol Spectrosc. 2023 Dec 15;303:123085. doi: 10.1016/j.saa.2023.123085. Epub 2023 Jun 30.

Development of Crime Scene Intelligence Using a Hand-Held Raman Spectrometer and Transfer Learning.利用手持拉曼光谱仪和迁移学习开发犯罪现场情报。

Anal Chem. 2021 Jun 29;93(25):8889-8896. doi: 10.1021/acs.analchem.1c01099. Epub 2021 Jun 17.

Local contrastive loss with pseudo-label based self-training for semi-supervised medical image segmentation.基于伪标签自训练的局部对比损失的半监督医学图像分割。

Med Image Anal. 2023 Jul;87:102792. doi: 10.1016/j.media.2023.102792. Epub 2023 Mar 11.

Vec2image: an explainable artificial intelligence model for the feature representation and classification of high-dimensional biological data by vector-to-image conversion.Vec2image：一种通过向量到图像的转换对高维生物数据进行特征表示和分类的可解释人工智能模型。

Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbab584.

Culture-Independent Raman Spectroscopic Identification of Bacterial Pathogens from Clinical Samples Using Deep Transfer Learning.使用深度迁移学习的临床样本中细菌病原体的无培养拉曼光谱鉴定。

Anal Chem. 2022 Oct 25;94(42):14745-14754. doi: 10.1021/acs.analchem.2c03391. Epub 2022 Oct 10.

本文引用的文献

An integrated computational pipeline for machine learning-driven diagnosis based on Raman spectra of saliva samples.基于唾液样本拉曼光谱的机器学习驱动诊断的集成计算管道。

Comput Biol Med. 2024 Mar;171:108028. doi: 10.1016/j.compbiomed.2024.108028. Epub 2024 Feb 1.

Micro-Raman Analysis of Sperm Cells on Glass Slide: Potential Label-Free Assessment of Sperm DNA toward Clinical Applications.载玻片上精子细胞的微拉曼分析：用于临床应用的精子 DNA 无标记评估的潜在方法。

Biosensors (Basel). 2022 Nov 21;12(11):1051. doi: 10.3390/bios12111051.

Database resources of the National Center for Biotechnology Information in 2023.2023 年国立生物技术信息中心的数据库资源。

Nucleic Acids Res. 2023 Jan 6;51(D1):D29-D38. doi: 10.1093/nar/gkac1032.

Highly Accurate Identification of Bacteria's Antibiotic Resistance Based on Raman Spectroscopy and U-Net Deep Learning Algorithms.基于拉曼光谱和U-Net深度学习算法的细菌抗生素抗性高精度鉴定

ACS Omega. 2022 Aug 12;7(33):29443-29451. doi: 10.1021/acsomega.2c03856. eCollection 2022 Aug 23.

Amino acid catabolism regulates hematopoietic stem cell proteostasis via a GCN2-eIF2α axis.氨基酸分解代谢通过 GCN2-eIF2α 轴调节造血干细胞的蛋白质稳态。

Cell Stem Cell. 2022 Jul 7;29(7):1119-1134.e7. doi: 10.1016/j.stem.2022.06.004.

Nucleic Acids Res. 2022 May 6;50(8):e45. doi: 10.1093/nar/gkac010.

Investigating the cellular responses of osteosarcoma to cisplatin by confocal Raman microspectroscopy.通过共聚焦拉曼显微光谱研究骨肉瘤对顺铂的细胞反应。

J Photochem Photobiol B. 2022 Jan;226:112366. doi: 10.1016/j.jphotobiol.2021.112366. Epub 2021 Nov 19.

Chemometric analysis in Raman spectroscopy from experimental design to machine learning-based modeling.拉曼光谱化学计量分析：从实验设计到基于机器学习的建模。

Nat Protoc. 2021 Dec;16(12):5426-5459. doi: 10.1038/s41596-021-00620-3. Epub 2021 Nov 5.

Determination of the structural changes by FT-IR, Raman, and CP/MAS C NMR spectroscopy on retrograded starch of maize tortillas.通过傅里叶变换红外光谱（FT-IR）、拉曼光谱和交叉极化/魔角旋转核磁共振光谱（CP/MAS C NMR）对玉米饼回生淀粉结构变化的测定。

Carbohydr Polym. 2012 Jan 4;87(1):61-68. doi: 10.1016/j.carbpol.2011.07.011. Epub 2011 Jul 19.

Application of Raman spectroscopy for characterization of the functional polarization of macrophages into M1 and M2 cells.拉曼光谱在表征巨噬细胞功能极化为 M1 和 M2 细胞中的应用。

Spectrochim Acta A Mol Biomol Spectrosc. 2022 Jan 15;265:120328. doi: 10.1016/j.saa.2021.120328. Epub 2021 Aug 27.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于信号聚合表示的拉曼光谱深度学习用于增强细胞表型和特征识别。

Raman spectroscopic deep learning with signal aggregated representations for enhanced cell phenotype and signature identification.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献