• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过CellANOVA在单细胞批量整合中丢失的生物信号的恢复。

Recovery of biological signals lost in single-cell batch integration with CellANOVA.

作者信息

Zhang Zhaojun, Mathew Divij, Lim Tristan L, Mason Kaishu, Martinez Clara Morral, Huang Sijia, Wherry E John, Susztak Katalin, Minn Andy J, Ma Zongming, Zhang Nancy R

机构信息

Department of Statistics and Data Science, The Wharton School, University of Pennsylvania, Philadelphia, PA, USA.

Department of Systems Pharmacology and Translational Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.

出版信息

Nat Biotechnol. 2024 Nov 26. doi: 10.1038/s41587-024-02463-1.

DOI:10.1038/s41587-024-02463-1
PMID:39592777
Abstract

Data integration to align cells across batches has become a cornerstone of single-cell data analysis, critically affecting downstream results. Currently, there are no guidelines for when the biological differences between samples are separable from batch effects. Here we show that current paradigms for single-cell data integration remove biologically meaningful variation and introduce distortion. We present a statistical model and computationally scalable algorithm, CellANOVA (cell state space analysis of variance), that harnesses experimental design to explicitly recover biological signals that are erased during single-cell data integration. CellANOVA uses a 'pool-of-controls' design concept, applicable across diverse settings, to separate unwanted variation from biological variation of interest and allow the recovery of subtle biological signals. We apply CellANOVA to diverse contexts and validate the recovered biological signals by orthogonal assays. In particular, we show that CellANOVA is effective in the challenging case of single-cell and single-nucleus data integration, where it recovers subtle biological signals that can be validated and replicated by external data.

摘要

跨批次对齐细胞的数据整合已成为单细胞数据分析的基石,严重影响下游结果。目前,对于样本之间的生物学差异何时可与批次效应区分开来,尚无指导原则。在这里,我们表明,当前的单细胞数据整合范式会消除生物学上有意义的变异并引入失真。我们提出了一种统计模型和计算上可扩展的算法,即CellANOVA(细胞状态空间方差分析),它利用实验设计来明确恢复在单细胞数据整合过程中被消除的生物学信号。CellANOVA使用一种“对照池”设计概念,适用于各种情况,以将不需要的变异与感兴趣的生物学变异分开,并允许恢复微妙的生物学信号。我们将CellANOVA应用于各种情况,并通过正交分析验证恢复的生物学信号。特别是,我们表明CellANOVA在单细胞和单细胞核数据整合的具有挑战性的情况下是有效的,在这种情况下,它可以恢复可以通过外部数据进行验证和复制的微妙生物学信号。

相似文献

1
Recovery of biological signals lost in single-cell batch integration with CellANOVA.通过CellANOVA在单细胞批量整合中丢失的生物信号的恢复。
Nat Biotechnol. 2024 Nov 26. doi: 10.1038/s41587-024-02463-1.
2
Signal recovery in single cell batch integration.单细胞批量整合中的信号恢复
bioRxiv. 2023 Sep 23:2023.05.05.539614. doi: 10.1101/2023.05.05.539614.
3
SCITUNA: single-cell data integration tool using network alignment.SCITUNA:使用网络比对的单细胞数据整合工具。
BMC Bioinformatics. 2025 Mar 27;26(1):92. doi: 10.1186/s12859-025-06087-3.
4
Integrating single-cell data with biological variables.将单细胞数据与生物学变量相结合。
Proc Natl Acad Sci U S A. 2025 May 6;122(18):e2416516122. doi: 10.1073/pnas.2416516122. Epub 2025 Apr 28.
5
Beaconet: A Reference-Free Method for Integrating Multiple Batches of Single-Cell Transcriptomic Data in Original Molecular Space.Beaconet:一种在原始分子空间中整合多个批次单细胞转录组数据的无参方法。
Adv Sci (Weinh). 2024 Jul;11(26):e2306770. doi: 10.1002/advs.202306770. Epub 2024 May 6.
6
Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).大分子拥挤现象:化学与物理邂逅生物学(瑞士阿斯科纳,2012年6月10日至14日)
Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2.
7
Batch effects correction in scRNA-seq based on biological-noise decoupling autoencoder and central-cross loss.基于生物噪声解耦自动编码器和中心交叉损失的单细胞RNA测序中的批次效应校正
Comput Biol Chem. 2024 Dec;113:108261. doi: 10.1016/j.compbiolchem.2024.108261. Epub 2024 Oct 19.
8
CBA: Cluster-Guided Batch Alignment for Single Cell RNA-seq.CBA:单细胞RNA测序的聚类引导批量比对
Front Genet. 2021 Apr 13;12:644211. doi: 10.3389/fgene.2021.644211. eCollection 2021.
9
Blind estimation and correction of microarray batch effect.盲估计和校正微阵列批次效应。
PLoS One. 2020 Apr 9;15(4):e0231446. doi: 10.1371/journal.pone.0231446. eCollection 2020.
10
scMaui: a widely applicable deep learning framework for single-cell multiomics integration in the presence of batch effects and missing data.scMaui:一种广泛适用的深度学习框架,可在存在批次效应和缺失数据的情况下进行单细胞多组学整合。
BMC Bioinformatics. 2024 Aug 6;25(1):257. doi: 10.1186/s12859-024-05880-w.

引用本文的文献

1
Integrative, high-resolution analysis of single-cell gene expression across experimental conditions with PARAFAC2-RISE.利用PARAFAC2-RISE对跨实验条件的单细胞基因表达进行综合、高分辨率分析。
Cell Syst. 2025 Jun 18;16(6):101294. doi: 10.1016/j.cels.2025.101294. Epub 2025 May 15.
2
Immunotherapy-related cognitive impairment after CAR T cell therapy in mice.小鼠接受嵌合抗原受体T细胞疗法后与免疫疗法相关的认知障碍
Cell. 2025 May 7. doi: 10.1016/j.cell.2025.03.041.
3
Revealing a coherent cell state landscape across single cell datasets with CONCORD.

本文引用的文献

1
An atlas of healthy and injured cell states and niches in the human kidney.人类肾脏健康和损伤细胞状态及生态位图谱
Nature. 2023 Jul;619(7970):585-594. doi: 10.1038/s41586-023-05769-3. Epub 2023 Jul 19.
2
Removing unwanted variation from large-scale RNA sequencing data with PRPS.使用 PRPS 去除大规模 RNA 测序数据中的非期望变异。
Nat Biotechnol. 2023 Jan;41(1):82-95. doi: 10.1038/s41587-022-01440-w. Epub 2022 Sep 15.
3
Sample-multiplexing approaches for single-cell sequencing.单细胞测序的样本多重处理方法。
利用CONCORD揭示单细胞数据集中连贯的细胞状态图谱。
bioRxiv. 2025 Apr 11:2025.03.13.643146. doi: 10.1101/2025.03.13.643146.
4
scLTNN: an innovative tool for automatically visualizing single-cell trajectories.scLTNN:一种用于自动可视化单细胞轨迹的创新工具。
Bioinform Adv. 2025 Feb 26;5(1):vbaf033. doi: 10.1093/bioadv/vbaf033. eCollection 2025.
5
A single-cell transcriptomic atlas of developing inhibitory neurons reveals expanding and contracting modes of diversification.发育中抑制性神经元的单细胞转录组图谱揭示了多样化的扩张和收缩模式。
bioRxiv. 2025 Feb 19:2025.02.19.636192. doi: 10.1101/2025.02.19.636192.
6
Semisynthetic simulation for microbiome data analysis.用于微生物组数据分析的半合成模拟
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf051.
Cell Mol Life Sci. 2022 Aug 5;79(8):466. doi: 10.1007/s00018-022-04482-0.
4
Single-cell multi-omics analysis of human pancreatic islets reveals novel cellular states in type 1 diabetes.单细胞多组学分析人类胰岛揭示 1 型糖尿病中的新型细胞状态。
Nat Metab. 2022 Feb;4(2):284-299. doi: 10.1038/s42255-022-00531-x. Epub 2022 Feb 28.
5
Benchmarking atlas-level data integration in single-cell genomics.单细胞基因组学中图谱级数据整合的基准测试。
Nat Methods. 2022 Jan;19(1):41-50. doi: 10.1038/s41592-021-01336-8. Epub 2021 Dec 23.
6
Efficient and precise single-cell reference atlas mapping with Symphony.使用 Symphony 进行高效、精确的单细胞参考图谱映射。
Nat Commun. 2021 Oct 7;12(1):5890. doi: 10.1038/s41467-021-25957-x.
7
Integrated analysis of multimodal single-cell data.多模态单细胞数据的综合分析。
Cell. 2021 Jun 24;184(13):3573-3587.e29. doi: 10.1016/j.cell.2021.04.048. Epub 2021 May 31.
8
Probabilistic harmonization and annotation of single-cell transcriptomics data with deep generative models.使用深度生成模型对单细胞转录组学数据进行概率协调和注释。
Mol Syst Biol. 2021 Jan;17(1):e9620. doi: 10.15252/msb.20209620.
9
Jointly defining cell types from multiple single-cell datasets using LIGER.使用 LIGER 从多个单细胞数据集联合定义细胞类型。
Nat Protoc. 2020 Nov;15(11):3632-3662. doi: 10.1038/s41596-020-0391-8. Epub 2020 Oct 12.
10
Flexible experimental designs for valid single-cell RNA-sequencing experiments allowing batch effects correction.允许批次效应校正的有效单细胞 RNA-seq 实验的灵活实验设计。
Nat Commun. 2020 Jul 1;11(1):3274. doi: 10.1038/s41467-020-16905-2.