• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于监督学习的单细胞数据基因程序可解释性发现

Supervised discovery of interpretable gene programs from single-cell data.

机构信息

Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY, USA.

Department of Statistics, Columbia University, New York, NY, USA.

出版信息

Nat Biotechnol. 2024 Jul;42(7):1084-1095. doi: 10.1038/s41587-023-01940-3. Epub 2023 Sep 21.

DOI:10.1038/s41587-023-01940-3
PMID:37735262
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10958532/
Abstract

Factor analysis decomposes single-cell gene expression data into a minimal set of gene programs that correspond to processes executed by cells in a sample. However, matrix factorization methods are prone to technical artifacts and poor factor interpretability. We address these concerns with Spectra, an algorithm that combines user-provided gene programs with the detection of novel programs that together best explain expression covariation. Spectra incorporates existing gene sets and cell-type labels as prior biological information, explicitly models cell type and represents input gene sets as a gene-gene knowledge graph using a penalty function to guide factorization toward the input graph. We show that Spectra outperforms existing approaches in challenging tumor immune contexts, as it finds factors that change under immune checkpoint therapy, disentangles the highly correlated features of CD8 T cell tumor reactivity and exhaustion, finds a program that explains continuous macrophage state changes under therapy and identifies cell-type-specific immune metabolic programs.

摘要

因子分析将单细胞基因表达数据分解为一组最小的基因程序,这些程序对应于样本中细胞执行的过程。然而,矩阵分解方法容易受到技术伪影和较差的因子可解释性的影响。我们使用 Spectra 解决了这些问题,这是一种算法,它将用户提供的基因程序与新程序的检测相结合,这些程序共同最好地解释了表达的协变。Spectra 将现有的基因集和细胞类型标签作为先验生物学信息纳入其中,显式地对细胞类型进行建模,并使用惩罚函数将输入基因集表示为基因-基因知识图谱,以引导因子分解到输入图谱。我们表明,Spectra 在具有挑战性的肿瘤免疫环境中优于现有方法,因为它找到了在免疫检查点治疗下发生变化的因子,分离了 CD8 T 细胞肿瘤反应和衰竭的高度相关特征,找到了一个可以解释治疗下巨噬细胞状态连续变化的程序,并确定了细胞类型特异性免疫代谢程序。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e54/11252012/906af6e71da5/41587_2023_1940_Fig14_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e54/11252012/59570fb468ad/41587_2023_1940_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e54/11252012/686d2b832303/41587_2023_1940_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e54/11252012/e186a72191be/41587_2023_1940_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e54/11252012/268426834c8a/41587_2023_1940_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e54/11252012/0adb81b1b0fb/41587_2023_1940_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e54/11252012/271be0b16b10/41587_2023_1940_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e54/11252012/91a5167b0912/41587_2023_1940_Fig7_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e54/11252012/06778a9f2998/41587_2023_1940_Fig8_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e54/11252012/1b902d01250f/41587_2023_1940_Fig9_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e54/11252012/a76613e33a64/41587_2023_1940_Fig10_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e54/11252012/b3678f21f6de/41587_2023_1940_Fig11_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e54/11252012/60b1031ce260/41587_2023_1940_Fig12_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e54/11252012/2a728ea136b3/41587_2023_1940_Fig13_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e54/11252012/906af6e71da5/41587_2023_1940_Fig14_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e54/11252012/59570fb468ad/41587_2023_1940_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e54/11252012/686d2b832303/41587_2023_1940_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e54/11252012/e186a72191be/41587_2023_1940_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e54/11252012/268426834c8a/41587_2023_1940_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e54/11252012/0adb81b1b0fb/41587_2023_1940_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e54/11252012/271be0b16b10/41587_2023_1940_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e54/11252012/91a5167b0912/41587_2023_1940_Fig7_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e54/11252012/06778a9f2998/41587_2023_1940_Fig8_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e54/11252012/1b902d01250f/41587_2023_1940_Fig9_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e54/11252012/a76613e33a64/41587_2023_1940_Fig10_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e54/11252012/b3678f21f6de/41587_2023_1940_Fig11_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e54/11252012/60b1031ce260/41587_2023_1940_Fig12_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e54/11252012/2a728ea136b3/41587_2023_1940_Fig13_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e54/11252012/906af6e71da5/41587_2023_1940_Fig14_ESM.jpg

相似文献

1
Supervised discovery of interpretable gene programs from single-cell data.基于监督学习的单细胞数据基因程序可解释性发现
Nat Biotechnol. 2024 Jul;42(7):1084-1095. doi: 10.1038/s41587-023-01940-3. Epub 2023 Sep 21.
2
Single-Cell Transcriptome Analysis Reveals RGS1 as a New Marker and Promoting Factor for T-Cell Exhaustion in Multiple Cancers.单细胞转录组分析揭示 RGS1 作为多种癌症中 T 细胞耗竭的新标志物和促进因子。
Front Immunol. 2021 Dec 8;12:767070. doi: 10.3389/fimmu.2021.767070. eCollection 2021.
3
Double Selection Based Semi-Supervised Clustering Ensemble for Tumor Clustering from Gene Expression Profiles.基于双重选择的半监督聚类集成用于从基因表达谱中进行肿瘤聚类
IEEE/ACM Trans Comput Biol Bioinform. 2014 Jul-Aug;11(4):727-40. doi: 10.1109/TCBB.2014.2315996.
4
Knowledge based cluster ensemble for cancer discovery from biomolecular data.基于知识的聚类集成在生物分子数据中的癌症发现。
IEEE Trans Nanobioscience. 2011 Jun;10(2):76-85. doi: 10.1109/TNB.2011.2144997. Epub 2011 Jul 7.
5
Semi-supervised Nonnegative Matrix Factorization for gene expression deconvolution: a case study.半监督非负矩阵分解在基因表达解卷积中的应用:案例研究。
Infect Genet Evol. 2012 Jul;12(5):913-21. doi: 10.1016/j.meegid.2011.08.014. Epub 2011 Sep 10.
6
Deciphering the Prognostic Implications of the Components and Signatures in the Immune Microenvironment of Pancreatic Ductal Adenocarcinoma.解析胰腺导管腺癌免疫微环境中各成分和特征的预后意义。
Front Immunol. 2021 Mar 10;12:648917. doi: 10.3389/fimmu.2021.648917. eCollection 2021.
7
scBOL: a universal cell type identification framework for single-cell and spatial transcriptomics data.scBOL:单细胞和空间转录组学数据的通用细胞类型识别框架。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae188.
8
Cauchy hyper-graph Laplacian nonnegative matrix factorization for single-cell RNA-sequencing data analysis.基于柯西超图拉普拉斯的非负矩阵分解方法在单细胞 RNA-seq 数据分析中的应用。
BMC Bioinformatics. 2024 Apr 29;25(1):169. doi: 10.1186/s12859-024-05797-4.
9
A high OXPHOS CD8 T cell subset is predictive of immunotherapy resistance in melanoma patients.高 OXPHOS CD8 T 细胞亚群可预测黑色素瘤患者对免疫治疗的抵抗。
J Exp Med. 2022 Jan 3;219(1). doi: 10.1084/jem.20202084. Epub 2021 Nov 22.
10
Gene Selection Using Locality Sensitive Laplacian Score.使用局部敏感拉普拉斯分数的基因选择
IEEE/ACM Trans Comput Biol Bioinform. 2014 Nov-Dec;11(6):1146-56. doi: 10.1109/TCBB.2014.2328334.

引用本文的文献

1
Differential T cell clonal dynamics underlie outcomes to frontline chemoimmunotherapy in advanced gastric cancer.不同的T细胞克隆动力学是晚期胃癌一线化学免疫治疗疗效的基础。
medRxiv. 2025 Aug 29:2025.08.26.25334455. doi: 10.1101/2025.08.26.25334455.
2
Efficiently quantifying dependence in massive scientific datasets using InterDependence Scores.使用相互依赖分数对海量科学数据集中的依赖关系进行有效量化。
Proc Natl Acad Sci U S A. 2025 Aug 26;122(34):e2509860122. doi: 10.1073/pnas.2509860122. Epub 2025 Aug 20.
3
A message passing framework for precise cell state identification with scClassify2.

本文引用的文献

1
Biologically informed deep learning to query gene programs in single-cell atlases.基于生物学信息的深度学习方法,可用于在单细胞图谱中查询基因程序。
Nat Cell Biol. 2023 Feb;25(2):337-350. doi: 10.1038/s41556-022-01072-x. Epub 2023 Feb 2.
2
The ectonucleotidase CD39 identifies tumor-reactive CD8 T cells predictive of immune checkpoint blockade efficacy in human lung cancer.细胞外核苷酸酶 CD39 鉴定出预测人类肺癌免疫检查点阻断疗效的肿瘤反应性 CD8 T 细胞。
Immunity. 2023 Jan 10;56(1):93-106.e6. doi: 10.1016/j.immuni.2022.12.001. Epub 2022 Dec 26.
3
High-resolution single-cell atlas reveals diversity and plasticity of tissue-resident neutrophils in non-small cell lung cancer.
一种用于通过scClassify2进行精确细胞状态识别的消息传递框架。
Genome Biol. 2025 Aug 19;26(1):252. doi: 10.1186/s13059-025-03722-3.
4
LaGrACE: estimating gene program dysregulation with latent regulatory network.LaGrACE:利用潜在调控网络估计基因程序失调
Mol Syst Biol. 2025 Jun 30. doi: 10.1038/s44320-025-00115-3.
5
Reply to: False positives in study of memory-related gene expression.回复:记忆相关基因表达研究中的假阳性结果。
Nature. 2025 Jun;642(8066):E4-E6. doi: 10.1038/s41586-025-08989-x.
6
Autonomous learning of pathologists' cancer grading rules.病理学家癌症分级规则的自主学习
bioRxiv. 2025 Apr 7:2025.03.18.643999. doi: 10.1101/2025.03.18.643999.
7
Revealing a coherent cell state landscape across single cell datasets with CONCORD.利用CONCORD揭示单细胞数据集中连贯的细胞状态图谱。
bioRxiv. 2025 Apr 11:2025.03.13.643146. doi: 10.1101/2025.03.13.643146.
8
STModule: identifying tissue modules to uncover spatial components and characteristics of transcriptomic landscapes.STModule:识别组织模块以揭示转录组景观的空间组成部分和特征。
Genome Med. 2025 Mar 3;17(1):18. doi: 10.1186/s13073-025-01441-9.
9
Interpretable single-cell factor decomposition using sciRED.使用sciRED进行可解释的单细胞因子分解。
Nat Commun. 2025 Feb 22;16(1):1878. doi: 10.1038/s41467-025-57157-2.
10
MorPhiC Consortium: towards functional characterization of all human genes.形态学联盟:迈向所有人基因的功能特性描述
Nature. 2025 Feb;638(8050):351-359. doi: 10.1038/s41586-024-08243-w. Epub 2025 Feb 12.
高分辨率单细胞图谱揭示了非小细胞肺癌中组织驻留中性粒细胞的多样性和可塑性。
Cancer Cell. 2022 Dec 12;40(12):1503-1520.e8. doi: 10.1016/j.ccell.2022.10.008. Epub 2022 Nov 10.
4
Functional inference of gene regulation using single-cell multi-omics.利用单细胞多组学进行基因调控的功能推断
Cell Genom. 2022 Sep 14;2(9). doi: 10.1016/j.xgen.2022.100166. Epub 2022 Aug 4.
5
Single-cell meta-analyses reveal responses of tumor-reactive CXCL13 T cells to immune-checkpoint blockade.单细胞荟萃分析揭示肿瘤反应性 CXCL13 T 细胞对免疫检查点阻断的反应。
Nat Cancer. 2022 Sep;3(9):1123-1136. doi: 10.1038/s43018-022-00433-7. Epub 2022 Sep 22.
6
CD39 tissue-resident memory CD8 T cells with a clonal overlap across compartments mediate antitumor immunity in breast cancer.肿瘤部位归巢的 CD39 组织驻留记忆 CD8 T 细胞具有跨区室的克隆重叠,介导乳腺癌中的抗肿瘤免疫。
Sci Immunol. 2022 Aug 26;7(74):eabn8390. doi: 10.1126/sciimmunol.abn8390.
7
Cancer cell states recur across tumor types and form specific interactions with the tumor microenvironment.癌细胞状态在多种肿瘤类型中重现,并与肿瘤微环境形成特定的相互作用。
Nat Genet. 2022 Aug;54(8):1192-1201. doi: 10.1038/s41588-022-01141-9. Epub 2022 Aug 5.
8
Lipid-related FABP5 activation of tumor-associated monocytes fosters immune privilege via PD-L1 expression on Treg cells in hepatocellular carcinoma.脂质相关的FABP5激活肿瘤相关单核细胞,通过肝癌中调节性T细胞上的PD-L1表达促进免疫豁免。
Cancer Gene Ther. 2022 Dec;29(12):1951-1960. doi: 10.1038/s41417-022-00510-0. Epub 2022 Jul 28.
9
Lipid Droplet-Related PLIN2 in CD68 Tumor-Associated Macrophage of Oral Squamous Cell Carcinoma: Implications for Cancer Prognosis and Immunotherapy.口腔鳞状细胞癌CD68肿瘤相关巨噬细胞中与脂滴相关的PLIN2:对癌症预后和免疫治疗的意义
Front Oncol. 2022 Mar 15;12:824235. doi: 10.3389/fonc.2022.824235. eCollection 2022.
10
Reprogramming cholesterol metabolism in macrophages and its role in host defense against cholesterol-dependent cytolysins.重编程巨噬细胞中的胆固醇代谢及其在宿主防御胆固醇依赖性细胞毒素中的作用。
Cell Mol Immunol. 2022 Mar;19(3):327-336. doi: 10.1038/s41423-021-00827-0. Epub 2022 Jan 11.