• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

单细胞 RNA-Seq 数据集的整合:计算方法综述。

Integration of Single-Cell RNA-Seq Datasets: A Review of Computational Methods.

机构信息

School of Biological Sciences, Seoul National University, Seoul 08826, Korea.

These authors contributed equally to this work.

出版信息

Mol Cells. 2023 Feb 28;46(2):106-119. doi: 10.14348/molcells.2023.0009. Epub 2023 Feb 24.

DOI:10.14348/molcells.2023.0009
PMID:36859475
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9982060/
Abstract

With the increased number of single-cell RNA sequencing (scRNA-seq) datasets in public repositories, integrative analysis of multiple scRNA-seq datasets has become commonplace. Batch effects among different datasets are inevitable because of differences in cell isolation and handling protocols, library preparation technology, and sequencing platforms. To remove these batch effects for effective integration of multiple scRNA-seq datasets, a number of methodologies have been developed based on diverse concepts and approaches. These methods have proven useful for examining whether cellular features, such as cell subpopulations and marker genes, identified from a certain dataset, are consistently present, or whether their condition-dependent variations, such as increases in cell subpopulations in particular disease-related conditions, are consistently observed in different datasets generated under similar or distinct conditions. In this review, we summarize the concepts and approaches of the integration methods and their pros and cons as has been reported in previous literature.

摘要

随着公共存储库中单细胞 RNA 测序 (scRNA-seq) 数据集数量的增加,对多个 scRNA-seq 数据集进行整合分析已变得很常见。由于细胞分离和处理方案、文库制备技术和测序平台的不同,不同数据集之间不可避免地存在批次效应。为了消除这些批次效应,以便有效地整合多个 scRNA-seq 数据集,已经基于不同的概念和方法开发了许多方法。这些方法已被证明可用于检查从特定数据集识别出的细胞特征(例如细胞亚群和标记基因)是否始终存在,或者它们的条件依赖性变化(例如在特定疾病相关条件下细胞亚群的增加)是否始终在类似或不同条件下生成的不同数据集中观察到。在这篇综述中,我们总结了整合方法的概念和方法及其优缺点,这些内容在之前的文献中已有报道。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/180d/9982060/147ba78ae22c/molce-46-2-106-f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/180d/9982060/00170839f46f/molce-46-2-106-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/180d/9982060/63815f427879/molce-46-2-106-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/180d/9982060/ebb0ea48accf/molce-46-2-106-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/180d/9982060/451c6d9004c3/molce-46-2-106-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/180d/9982060/221526f6a93b/molce-46-2-106-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/180d/9982060/147ba78ae22c/molce-46-2-106-f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/180d/9982060/00170839f46f/molce-46-2-106-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/180d/9982060/63815f427879/molce-46-2-106-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/180d/9982060/ebb0ea48accf/molce-46-2-106-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/180d/9982060/451c6d9004c3/molce-46-2-106-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/180d/9982060/221526f6a93b/molce-46-2-106-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/180d/9982060/147ba78ae22c/molce-46-2-106-f6.jpg

相似文献

1
Integration of Single-Cell RNA-Seq Datasets: A Review of Computational Methods.单细胞 RNA-Seq 数据集的整合:计算方法综述。
Mol Cells. 2023 Feb 28;46(2):106-119. doi: 10.14348/molcells.2023.0009. Epub 2023 Feb 24.
2
Deep Batch Integration and Denoise of Single-Cell RNA-Seq Data.单细胞 RNA-Seq 数据的深度批量整合和去噪。
Adv Sci (Weinh). 2024 Aug;11(29):e2308934. doi: 10.1002/advs.202308934. Epub 2024 May 22.
3
scMRA: a robust deep learning method to annotate scRNA-seq data with multiple reference datasets.scMRA:一种用于用多个参考数据集注释单细胞RNA测序数据的强大深度学习方法。
Bioinformatics. 2022 Jan 12;38(3):738-745. doi: 10.1093/bioinformatics/btab700.
4
Independent component analysis based gene co-expression network inference (ICAnet) to decipher functional modules for better single-cell clustering and batch integration.基于独立成分分析的基因共表达网络推断 (ICAnet) 以破译功能模块,从而更好地进行单细胞聚类和批次整合。
Nucleic Acids Res. 2021 May 21;49(9):e54. doi: 10.1093/nar/gkab089.
5
BATMAN: Fast and Accurate Integration of Single-Cell RNA-Seq Datasets via Minimum-Weight Matching.蝙蝠侠:通过最小权重匹配实现单细胞RNA测序数据集的快速准确整合
iScience. 2020 Jun 26;23(6):101185. doi: 10.1016/j.isci.2020.101185. Epub 2020 May 20.
6
Integrating single-cell RNA-seq datasets with substantial batch effects.整合具有显著批次效应的单细胞RNA测序数据集。
bioRxiv. 2024 Feb 10:2023.11.03.565463. doi: 10.1101/2023.11.03.565463.
7
The effect of data transformation on low-dimensional integration of single-cell RNA-seq.数据转换对单细胞 RNA-seq 低维整合的影响。
BMC Bioinformatics. 2024 Apr 30;25(1):171. doi: 10.1186/s12859-024-05788-5.
8
FIRM: Flexible integration of single-cell RNA-sequencing data for large-scale multi-tissue cell atlas datasets.FIRM:单细胞 RNA 测序数据的灵活整合,适用于大规模多组织细胞图谱数据集。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac167.
9
Attention-based deep clustering method for scRNA-seq cell type identification.基于注意力机制的深度聚类方法在 scRNA-seq 细胞类型鉴定中的应用。
PLoS Comput Biol. 2023 Nov 10;19(11):e1011641. doi: 10.1371/journal.pcbi.1011641. eCollection 2023 Nov.
10
Domain adaptation for supervised integration of scRNA-seq data.监督整合 scRNA-seq 数据的领域自适应。
Commun Biol. 2023 Mar 16;6(1):274. doi: 10.1038/s42003-023-04668-7.

引用本文的文献

1
Establishing single cell RNA transcriptomics: a brief guide.建立单细胞RNA转录组学:简要指南。
Front Zool. 2025 Sep 2;22(1):25. doi: 10.1186/s12983-025-00579-x.
2
Harmonizing heterogeneous single-cell gene expression data with individual-level covariate information.将异质单细胞基因表达数据与个体水平协变量信息进行整合。
Bioinform Adv. 2025 Aug 9;5(1):vbaf189. doi: 10.1093/bioadv/vbaf189. eCollection 2025.
3
Deep learning-based cell type profiles reveal signatures of Alzheimer's disease resilience and resistance.基于深度学习的细胞类型图谱揭示了阿尔茨海默病恢复力和抵抗力的特征。

本文引用的文献

1
Pathogenic variants damage cell composition and single cell transcription in cardiomyopathies.致病变体损伤心肌病中的细胞组成和单细胞转录。
Science. 2022 Aug 5;377(6606):eabo1984. doi: 10.1126/science.abo1984.
2
Cell-type-specific cis-eQTLs in eight human brain cell types identify novel risk genes for psychiatric and neurological disorders.八种人类脑细胞类型中的细胞类型特异性顺式-eQTL 确定了精神和神经疾病的新风险基因。
Nat Neurosci. 2022 Aug;25(8):1104-1112. doi: 10.1038/s41593-022-01128-z. Epub 2022 Aug 1.
3
The Peripheral Immune Landscape in a Patient with Myocarditis after the Administration of BNT162b2 mRNA Vaccine.
Brain. 2025 Aug 5. doi: 10.1093/brain/awaf285.
4
Single-cell transcriptomics for immune profiling of cerebrospinal fluid in neurological diseases.用于神经系统疾病脑脊液免疫谱分析的单细胞转录组学
Front Immunol. 2025 May 29;16:1599303. doi: 10.3389/fimmu.2025.1599303. eCollection 2025.
5
Multi-Omics Integration in Nephrology: Advances, Challenges, and Future Directions.肾脏病学中的多组学整合:进展、挑战与未来方向
Semin Nephrol. 2024 Nov;44(6):151584. doi: 10.1016/j.semnephrol.2025.151584. Epub 2025 Apr 11.
6
A spatially resolved transcriptome landscape during thyroid cancer progression.甲状腺癌进展过程中的空间分辨转录组图谱。
Cell Rep Med. 2025 Apr 15;6(4):102043. doi: 10.1016/j.xcrm.2025.102043. Epub 2025 Mar 28.
7
SCITUNA: single-cell data integration tool using network alignment.SCITUNA:使用网络比对的单细胞数据整合工具。
BMC Bioinformatics. 2025 Mar 27;26(1):92. doi: 10.1186/s12859-025-06087-3.
8
Exploration of Conserved Human Adipose Subpopulations Using Targeted Single-Nuclei RNA Sequencing Data Sets.利用靶向单细胞核RNA测序数据集探索保守的人类脂肪亚群
J Am Heart Assoc. 2025 Mar 18;14(6):e038465. doi: 10.1161/JAHA.124.038465. Epub 2025 Mar 17.
9
Seurat function argument values in scRNA-seq data analysis: potential pitfalls and refinements for biological interpretation.单细胞RNA测序数据分析中Seurat函数参数值:生物学解释的潜在陷阱与改进
Front Bioinform. 2025 Feb 12;5:1519468. doi: 10.3389/fbinf.2025.1519468. eCollection 2025.
10
Advancements in single-cell RNA sequencing and spatial transcriptomics: transforming biomedical research.单细胞RNA测序和空间转录组学的进展:变革生物医学研究
Acta Biochim Pol. 2025 Feb 5;72:13922. doi: 10.3389/abp.2025.13922. eCollection 2025.
BNT162b2 mRNA 疫苗接种后心肌炎患者的外周免疫景观。
Mol Cells. 2022 Oct 31;45(10):738-748. doi: 10.14348/molcells.2022.0031. Epub 2022 Jul 28.
4
CHD8 haploinsufficiency links autism to transient alterations in excitatory and inhibitory trajectories.CHD8 杂合性不足将自闭症与兴奋性和抑制性轨迹的短暂改变联系起来。
Cell Rep. 2022 Apr 5;39(1):110615. doi: 10.1016/j.celrep.2022.110615.
5
UINMF performs mosaic integration of single-cell multi-omic datasets using nonnegative matrix factorization.UINMF 通过非负矩阵分解对单细胞多组学数据集进行镶嵌式整合。
Nat Commun. 2022 Feb 9;13(1):780. doi: 10.1038/s41467-022-28431-4.
6
Benchmarking atlas-level data integration in single-cell genomics.单细胞基因组学中图谱级数据整合的基准测试。
Nat Methods. 2022 Jan;19(1):41-50. doi: 10.1038/s41592-021-01336-8. Epub 2021 Dec 23.
7
Large-scale integration of single-cell transcriptomic data captures transitional progenitor states in mouse skeletal muscle regeneration.大规模整合单细胞转录组数据捕获了小鼠骨骼肌再生中的过渡祖细胞状态。
Commun Biol. 2021 Nov 12;4(1):1280. doi: 10.1038/s42003-021-02810-x.
8
Single-nucleus chromatin accessibility and transcriptomic characterization of Alzheimer's disease.阿尔茨海默病中单细胞核染色质可及性和转录组特征。
Nat Genet. 2021 Aug;53(8):1143-1155. doi: 10.1038/s41588-021-00894-z. Epub 2021 Jul 8.
9
Computational principles and challenges in single-cell data integration.单细胞数据整合的计算原理与挑战。
Nat Biotechnol. 2021 Oct;39(10):1202-1215. doi: 10.1038/s41587-021-00895-7. Epub 2021 May 3.
10
A pan-cancer single-cell transcriptional atlas of tumor infiltrating myeloid cells.泛癌种肿瘤浸润髓系细胞的单细胞转录组图谱。
Cell. 2021 Feb 4;184(3):792-809.e23. doi: 10.1016/j.cell.2021.01.010.