深度学习从单细胞测序数据中解析分支聚糖的生物学特性。

Deep learning explains the biology of branched glycans from single-cell sequencing data.

作者信息

Qin Rui, Mahal Lara K, Bojar Daniel

机构信息

Department of Chemistry, University of Alberta, Edmonton, AB T6G 2G2, Canada.

Department of Chemistry and Molecular Biology, University of Gothenburg, 405 30 Gothenburg, Sweden.

出版信息

iScience. 2022 Sep 19;25(10):105163. doi: 10.1016/j.isci.2022.105163. eCollection 2022 Oct 21.

DOI:10.1016/j.isci.2022.105163

PMID:36217547

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9547197/

Abstract

Glycosylation is ubiquitous and often dysregulated in disease. However, the regulation and functional significance of various types of glycosylation at cellular levels is hard to unravel experimentally. Multi-omics, single-cell measurements such as SUGAR-seq, which quantifies transcriptomes and cell surface glycans, facilitate addressing this issue. Using SUGAR-seq data, we pioneered a deep learning model to predict the glycan phenotypes of cells (mouse T lymphocytes) from transcripts, with the example of predicting β1,6GlcNAc-branching across T cell subtypes (test set F1 score: 0.9351). Model interpretation via SHAP (SHapley Additive exPlanations) identified highly predictive genes, in part known to impact (i) branched glycan levels and (ii) the biology of branched glycans. These genes included physiologically relevant low-abundance genes that were not captured by conventional differential expression analysis. Our work shows that interpretable deep learning models are promising for uncovering novel functions and regulatory mechanisms of glycans from integrated transcriptomic and glycomic datasets.

摘要

糖基化普遍存在，且在疾病中常常失调。然而，在细胞水平上，各种类型糖基化的调控及其功能意义很难通过实验来阐明。多组学、单细胞测量方法，如SUGAR-seq（可对转录组和细胞表面聚糖进行定量），有助于解决这一问题。利用SUGAR-seq数据，我们率先开发了一种深度学习模型，以从转录本预测细胞（小鼠T淋巴细胞）的聚糖表型，例如预测跨T细胞亚型的β1,6GlcNAc分支（测试集F1分数：0.9351）。通过SHAP（Shapley加性解释）进行的模型解释确定了具有高度预测性的基因，其中部分基因已知会影响（i）分支聚糖水平和（ii）分支聚糖的生物学特性。这些基因包括传统差异表达分析未捕获的生理相关低丰度基因。我们的工作表明，可解释的深度学习模型有望从整合的转录组和糖组数据集中揭示聚糖的新功能和调控机制。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ed4/9547197/b35b4e295a31/fx1.jpg

相似文献

Deep learning explains the biology of branched glycans from single-cell sequencing data.深度学习从单细胞测序数据中解析分支聚糖的生物学特性。

iScience. 2022 Sep 19;25(10):105163. doi: 10.1016/j.isci.2022.105163. eCollection 2022 Oct 21.

Quantitative analysis of β1,6GlcNAc-branched N-glycans on β4 integrin in cutaneous squamous cell carcinoma.定量分析皮肤鳞状细胞癌中β4 整合素上β1,6GlcNAc 分支的 N-聚糖。

Fukushima J Med Sci. 2020 Dec 10;66(3):119-123. doi: 10.5387/fms.2020-12. Epub 2020 Aug 7.

Control of T Cell-mediated autoimmunity by metabolite flux to N-glycan biosynthesis.通过代谢物流向N-聚糖生物合成来控制T细胞介导的自身免疫。

J Biol Chem. 2007 Jul 6;282(27):20027-35. doi: 10.1074/jbc.M701890200. Epub 2007 May 8.

β4-Integrin/PI3K Signaling Promotes Tumor Progression through the Galectin-3--Glycan Complex.β4-整合素/PI3K 信号通过半乳糖凝集素-3-聚糖复合物促进肿瘤进展。

Mol Cancer Res. 2018 Jun;16(6):1024-1034. doi: 10.1158/1541-7786.MCR-17-0365. Epub 2018 Mar 16.

Discovery of Pancreatic Ductal Adenocarcinoma-Related Aberrant Glycosylations: A Multilateral Approach of Lectin Microarray-Based Tissue Glycomic Profiling With Public Transcriptomic Datasets.胰腺导管腺癌相关异常糖基化的发现：基于凝集素微阵列的组织糖组学分析与公开转录组数据集的多边方法

Front Oncol. 2020 Mar 13;10:338. doi: 10.3389/fonc.2020.00338. eCollection 2020.

Pathway importance by graph convolutional network and Shapley additive explanations in gene expression phenotype of diffuse large B-cell lymphoma.基于图卷积网络和 Shapley 加法解释的弥漫性大 B 细胞淋巴瘤基因表达表型通路重要性分析。

PLoS One. 2022 Jun 24;17(6):e0269570. doi: 10.1371/journal.pone.0269570. eCollection 2022.

Verifying explainability of a deep learning tissue classifier trained on RNA-seq data.验证基于 RNA-seq 数据训练的深度学习组织分类器的可解释性。

Sci Rep. 2021 Jan 29;11(1):2641. doi: 10.1038/s41598-021-81773-9.

Shapley variable importance cloud for interpretable machine learning.用于可解释机器学习的Shapley变量重要性云图

Patterns (N Y). 2022 Feb 22;3(4):100452. doi: 10.1016/j.patter.2022.100452. eCollection 2022 Apr 8.

N-acetylglucosaminyltransferase V (Mgat5)-mediated N-glycosylation negatively regulates Th1 cytokine production by T cells.N-乙酰葡糖胺基转移酶V（Mgat5）介导的N-糖基化负向调节T细胞产生Th1细胞因子。

J Immunol. 2004 Dec 15;173(12):7200-8. doi: 10.4049/jimmunol.173.12.7200.

Development and application of GlycanDIA workflow for glycomic analysis.用于糖组学分析的GlycanDIA工作流程的开发与应用。

bioRxiv. 2024 Mar 13:2024.03.12.584702. doi: 10.1101/2024.03.12.584702.

引用本文的文献

Integration of RNAseq transcriptomics and -glycomics reveal biosynthetic pathways and predict structure-specific -glycan expression.RNA测序转录组学与糖组学的整合揭示了生物合成途径并预测了结构特异性聚糖表达。

Chem Sci. 2025 Apr 4;16(17):7155-7172. doi: 10.1039/d5sc00467e. eCollection 2025 Apr 30.

Cell- and tissue-specific glycosylation pathways informed by single-cell transcriptomics.由单细胞转录组学揭示的细胞和组织特异性糖基化途径。

NAR Genom Bioinform. 2024 Dec 18;6(4):lqae169. doi: 10.1093/nargab/lqae169. eCollection 2024 Dec.

Emerging technologies for single-cell glycomics.单细胞糖组学的新兴技术

BBA Adv. 2024 Nov 26;6:100125. doi: 10.1016/j.bbadva.2024.100125. eCollection 2024.

Leveraging explainable deep learning methodologies to elucidate the biological underpinnings of Huntington's disease using single-cell RNA sequencing data.利用可解释的深度学习方法，利用单细胞 RNA 测序数据阐明亨廷顿病的生物学基础。

BMC Genomics. 2024 Oct 4;25(1):930. doi: 10.1186/s12864-024-10855-5.

Designing interpretable deep learning applications for functional genomics: a quantitative analysis.设计可解释的深度学习应用于功能基因组学：一项定量分析。

Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae449.

Interpretable feature extraction and dimensionality reduction in ESM2 for protein localization prediction.ESM2 中用于蛋白质定位预测的可解释特征提取和降维。

Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbad534.

Glycan Shape, Motions, and Interactions Explored by NMR Spectroscopy.通过核磁共振光谱法探索聚糖的形状、运动及相互作用

JACS Au. 2024 Jan 3;4(1):20-39. doi: 10.1021/jacsau.3c00639. eCollection 2024 Jan 22.

本文引用的文献

Glycan profiling of the gut microbiota by Glycan-seq.通过聚糖测序对肠道微生物群进行聚糖分析。

ISME Commun. 2022 Jan 5;2(1):1. doi: 10.1038/s43705-021-00084-2.

A Useful Guide to Lectin Binding: Machine-Learning Directed Annotation of 57 Unique Lectin Specificities.一种有用的凝集素结合指南：57 种独特凝集素特异性的机器学习定向注释。

ACS Chem Biol. 2022 Nov 18;17(11):2993-3012. doi: 10.1021/acschembio.1c00689. Epub 2022 Jan 27.

Whole-Blood 3-Gene Signature as a Decision Aid for Rifapentine-based Tuberculosis Preventive Therapy.全血 3 基因标志物作为利福平为基础的结核病预防治疗决策辅助工具。

Clin Infect Dis. 2022 Sep 14;75(5):743-752. doi: 10.1093/cid/ciac003.

O-GlcNAcylation regulates β1,4-GlcNAc-branched N-glycan biosynthesis via the OGT/SLC35A3/GnT-IV axis.O-GlcNAc 修饰通过 OGT/SLC35A3/GnT-IV 轴调节 β1,4-GlcNAc 分支 N-聚糖生物合成。

FASEB J. 2022 Feb;36(2):e22149. doi: 10.1096/fj.202101520R.

Installation of O-glycan sulfation capacities in human HEK293 cells for display of sulfated mucins.在人源 HEK293 细胞中构建 O-聚糖硫酸化能力，用于展示硫酸化粘蛋白。

J Biol Chem. 2022 Feb;298(2):101382. doi: 10.1016/j.jbc.2021.101382. Epub 2021 Dec 24.

Multifaceted Roles of Chemokines and Chemokine Receptors in Tumor Immunity.趋化因子和趋化因子受体在肿瘤免疫中的多方面作用

Cancers (Basel). 2021 Dec 6;13(23):6132. doi: 10.3390/cancers13236132.

Targeting the CCL2/CCR2 Axis in Cancer Immunotherapy: One Stone, Three Birds?靶向 CCL2/CCR2 轴在癌症免疫治疗中的应用：一石三鸟？

Front Immunol. 2021 Nov 3;12:771210. doi: 10.3389/fimmu.2021.771210. eCollection 2021.

Integrated Systems Analysis of the Murine and Human Pancreatic Cancer Glycomes Reveals a Tumor-Promoting Role for ST6GAL1.整合系统分析鼠类和人类胰腺癌糖组图谱揭示 ST6GAL1 在肿瘤促进中的作用。

Mol Cell Proteomics. 2021;20:100160. doi: 10.1016/j.mcpro.2021.100160. Epub 2021 Oct 9.

CCL5/CCR5 axis in human diseases and related treatments.人类疾病及相关治疗中的CCL5/CCR5轴

Genes Dis. 2022 Jan;9(1):12-27. doi: 10.1016/j.gendis.2021.08.004. Epub 2021 Aug 26.

Combining functional metagenomics and glycoanalytics to identify enzymes that facilitate structural characterization of sulfated N-glycans.结合功能宏基因组学和糖组学分析鉴定促进硫酸化 N-聚糖结构特征分析的酶。

Microb Cell Fact. 2021 Aug 21;20(1):162. doi: 10.1186/s12934-021-01652-w.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

深度学习从单细胞测序数据中解析分支聚糖的生物学特性。

Deep learning explains the biology of branched glycans from single-cell sequencing data.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献