• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

单细胞转录组学数据的半监督整合。

Semi-supervised integration of single-cell transcriptomics data.

机构信息

Department of Oncology, Lausanne Branch, Ludwig Institute for Cancer Research, CHUV and University of Lausanne, 1011, Lausanne, Switzerland.

AGORA Cancer Research Center, 1005, Lausanne, Switzerland.

出版信息

Nat Commun. 2024 Jan 29;15(1):872. doi: 10.1038/s41467-024-45240-z.

DOI:10.1038/s41467-024-45240-z
PMID:38287014
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10825117/
Abstract

Batch effects in single-cell RNA-seq data pose a significant challenge for comparative analyses across samples, individuals, and conditions. Although batch effect correction methods are routinely applied, data integration often leads to overcorrection and can result in the loss of biological variability. In this work we present STACAS, a batch correction method for scRNA-seq that leverages prior knowledge on cell types to preserve biological variability upon integration. Through an open-source benchmark, we show that semi-supervised STACAS outperforms state-of-the-art unsupervised methods, as well as supervised methods such as scANVI and scGen. STACAS scales well to large datasets and is robust to incomplete and imprecise input cell type labels, which are commonly encountered in real-life integration tasks. We argue that the incorporation of prior cell type information should be a common practice in single-cell data integration, and we provide a flexible framework for semi-supervised batch effect correction.

摘要

单细胞 RNA-seq 数据中的批次效应对跨样本、个体和条件的比较分析构成了重大挑战。尽管经常应用批次效应校正方法,但数据集成通常会导致过度校正,并可能导致生物变异性的丧失。在这项工作中,我们提出了 STACAS,这是一种用于 scRNA-seq 的批量校正方法,它利用了对细胞类型的先验知识,在集成时保留生物变异性。通过一个开源基准,我们表明半监督 STACAS 优于最先进的无监督方法,以及 scANVI 和 scGen 等监督方法。STACAS 可很好地扩展到大型数据集,并且对常见于实际集成任务中的不完整和不精确的输入细胞类型标签具有鲁棒性。我们认为,在单细胞数据分析集成中,应将先验细胞类型信息的纳入作为一种常见做法,并且我们提供了一个用于半监督批量效应校正的灵活框架。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02c5/10825117/54eec6d53094/41467_2024_45240_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02c5/10825117/aae9a4162f0b/41467_2024_45240_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02c5/10825117/b970a97b83bb/41467_2024_45240_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02c5/10825117/5f6909d22f83/41467_2024_45240_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02c5/10825117/2bba2c109fa4/41467_2024_45240_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02c5/10825117/792803f91e13/41467_2024_45240_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02c5/10825117/54eec6d53094/41467_2024_45240_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02c5/10825117/aae9a4162f0b/41467_2024_45240_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02c5/10825117/b970a97b83bb/41467_2024_45240_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02c5/10825117/5f6909d22f83/41467_2024_45240_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02c5/10825117/2bba2c109fa4/41467_2024_45240_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02c5/10825117/792803f91e13/41467_2024_45240_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02c5/10825117/54eec6d53094/41467_2024_45240_Fig6_HTML.jpg

相似文献

1
Semi-supervised integration of single-cell transcriptomics data.单细胞转录组学数据的半监督整合。
Nat Commun. 2024 Jan 29;15(1):872. doi: 10.1038/s41467-024-45240-z.
2
iSMNN: batch effect correction for single-cell RNA-seq data via iterative supervised mutual nearest neighbor refinement.iSMNN:通过迭代监督的互近邻修正对单细胞 RNA-seq 数据进行批次效应校正。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab122.
3
STACAS: Sub-Type Anchor Correction for Alignment in Seurat to integrate single-cell RNA-seq data.STACAS:在 Seurat 中进行亚型锚定校正以整合单细胞 RNA-seq 数据。
Bioinformatics. 2021 May 5;37(6):882-884. doi: 10.1093/bioinformatics/btaa755.
4
Probabilistic harmonization and annotation of single-cell transcriptomics data with deep generative models.使用深度生成模型对单细胞转录组学数据进行概率协调和注释。
Mol Syst Biol. 2021 Jan;17(1):e9620. doi: 10.15252/msb.20209620.
5
FIRM: Flexible integration of single-cell RNA-sequencing data for large-scale multi-tissue cell atlas datasets.FIRM:单细胞 RNA 测序数据的灵活整合,适用于大规模多组织细胞图谱数据集。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac167.
6
scInterpreter: a knowledge-regularized generative model for interpretably integrating scRNA-seq data.scInterpreter:一种知识正则化的生成模型,用于可解释地整合 scRNA-seq 数据。
BMC Bioinformatics. 2023 Dec 16;24(1):481. doi: 10.1186/s12859-023-05579-4.
7
A comprehensive comparison of supervised and unsupervised methods for cell type identification in single-cell RNA-seq.单细胞 RNA-seq 中细胞类型识别的有监督与无监督方法的全面比较。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbab567.
8
Single-cell RNA-seq data semi-supervised clustering and annotation via structural regularized domain adaptation.基于结构正则化领域自适应的单细胞 RNA-seq 数据半监督聚类和注释。
Bioinformatics. 2021 May 5;37(6):775-784. doi: 10.1093/bioinformatics/btaa908.
9
BERMAD: batch effect removal for single-cell RNA-seq data using a multi-layer adaptation autoencoder with dual-channel framework.BERMAD:基于双通道框架的多层自适应自动编码器去除单细胞 RNA-seq 数据中的批次效应
Bioinformatics. 2024 Mar 4;40(3). doi: 10.1093/bioinformatics/btae127.
10
Propensity score matching enables batch-effect-corrected imputation in single-cell RNA-seq analysis.倾向得分匹配可实现单细胞 RNA-seq 分析中的批次效应校正填补。
Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac275.

引用本文的文献

1
A Benchmark of Semi-Supervised scRNA-seq Integration Methods in Real-World Scenarios.真实场景下半监督单细胞RNA测序整合方法的基准测试
bioRxiv. 2025 Aug 27:2025.08.23.671952. doi: 10.1101/2025.08.23.671952.
2
Identification of malignant cells in single-cell transcriptomics data.单细胞转录组学数据中恶性细胞的识别。
Commun Biol. 2025 Aug 22;8(1):1264. doi: 10.1038/s42003-025-08695-4.
3
BioLLM: A standardized framework for integrating and benchmarking single-cell foundation models.BioLLM:一个用于整合和基准测试单细胞基础模型的标准化框架。

本文引用的文献

1
An integrated cell atlas of the lung in health and disease.肺部健康与疾病的细胞整合图谱
Nat Med. 2023 Jun;29(6):1563-1577. doi: 10.1038/s41591-023-02327-2. Epub 2023 Jun 8.
2
Metacells untangle large and complex single-cell transcriptome networks.类细胞可梳理大型且复杂的单细胞转录组网络。
BMC Bioinformatics. 2022 Aug 13;23(1):336. doi: 10.1186/s12859-022-04861-1.
3
scSemiAE: a deep model with semi-supervised learning for single-cell transcriptomics.scSemiAE:一种用于单细胞转录组学的具有半监督学习的深度模型。
Patterns (N Y). 2025 Jul 30;6(8):101326. doi: 10.1016/j.patter.2025.101326. eCollection 2025 Aug 8.
4
Shortcomings of silhouette in single-cell integration benchmarking.单细胞整合基准测试中轮廓图的缺点。
Nat Biotechnol. 2025 Jul 30. doi: 10.1038/s41587-025-02743-4.
5
Sketching T cell atlases in the single-cell era: challenges and recommendations.单细胞时代绘制T细胞图谱:挑战与建议
Immunol Cell Biol. 2025 Aug;103(7):723-737. doi: 10.1111/imcb.70040. Epub 2025 Jun 29.
6
Trajectories of macrophage ontogeny and reprogramming in cancer.癌症中巨噬细胞个体发生与重编程的轨迹
iScience. 2025 Apr 22;28(5):112498. doi: 10.1016/j.isci.2025.112498. eCollection 2025 May 16.
7
Age-associated nicotinamide adenine dinucleotide decline drives CAR-T cell failure.与年龄相关的烟酰胺腺嘌呤二核苷酸水平下降导致嵌合抗原受体T细胞(CAR-T)功能衰竭。
Nat Cancer. 2025 May 20. doi: 10.1038/s43018-025-00982-7.
8
Impaired T cell and neoantigen retention in time-serial analysis of metastatic non-small cell lung cancer in patients unresponsive to TIL cell therapy.在对TIL细胞疗法无反应的转移性非小细胞肺癌患者的时间序列分析中,T细胞和新抗原保留受损。
Nat Cancer. 2025 May 8. doi: 10.1038/s43018-025-00946-x.
9
UniMap: Type-Level Integration Enhances Biological Preservation and Interpretability in Single-Cell Annotation.UniMap:类型级集成增强了单细胞注释中的生物学保留和可解释性。
Adv Sci (Weinh). 2025 Apr;12(16):e2410790. doi: 10.1002/advs.202410790. Epub 2025 Feb 27.
10
scCobra allows contrastive cell embedding learning with domain adaptation for single cell data integration and harmonization.scCobra支持通过域适应进行对比细胞嵌入学习,以实现单细胞数据整合与归一化。
Commun Biol. 2025 Feb 13;8(1):233. doi: 10.1038/s42003-025-07692-x.
BMC Bioinformatics. 2022 May 5;23(1):161. doi: 10.1186/s12859-022-04703-0.
4
Mechanistic convergence of the TIGIT and PD-1 inhibitory pathways necessitates co-blockade to optimize anti-tumor CD8 T cell responses.TIGIT 和 PD-1 抑制途径的机制趋同需要联合阻断以优化抗肿瘤 CD8 T 细胞反应。
Immunity. 2022 Mar 8;55(3):512-526.e9. doi: 10.1016/j.immuni.2022.02.005.
5
scGate: marker-based purification of cell types from heterogeneous single-cell RNA-seq datasets.scGate:基于标记的异质单细胞 RNA-seq 数据集细胞类型的纯化。
Bioinformatics. 2022 Apr 28;38(9):2642-2644. doi: 10.1093/bioinformatics/btac141.
6
Temporal single-cell tracing reveals clonal revival and expansion of precursor exhausted T cells during anti-PD-1 therapy in lung cancer.时间分辨单细胞追踪揭示了抗 PD-1 治疗肺癌过程中耗竭前体细胞 T 细胞的克隆复兴和扩增。
Nat Cancer. 2022 Jan;3(1):108-121. doi: 10.1038/s43018-021-00292-8. Epub 2021 Dec 23.
7
scJoint integrates atlas-scale single-cell RNA-seq and ATAC-seq data with transfer learning.scJoint 集成了图谱尺度单细胞 RNA-seq 和 ATAC-seq 数据,并结合了迁移学习。
Nat Biotechnol. 2022 May;40(5):703-710. doi: 10.1038/s41587-021-01161-6. Epub 2022 Jan 20.
8
Benchmarking atlas-level data integration in single-cell genomics.单细胞基因组学中图谱级数据整合的基准测试。
Nat Methods. 2022 Jan;19(1):41-50. doi: 10.1038/s41592-021-01336-8. Epub 2021 Dec 23.
9
SPICA: Swiss portal for immune cell analysis.SPICA:瑞士免疫细胞分析门户。
Nucleic Acids Res. 2022 Jan 7;50(D1):D1109-D1114. doi: 10.1093/nar/gkab1055.
10
Efficient and precise single-cell reference atlas mapping with Symphony.使用 Symphony 进行高效、精确的单细胞参考图谱映射。
Nat Commun. 2021 Oct 7;12(1):5890. doi: 10.1038/s41467-021-25957-x.