• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SCITUNA:使用网络比对的单细胞数据整合工具。

SCITUNA: single-cell data integration tool using network alignment.

作者信息

Houdjedj Aissa, Marouf Yacine, Myradov Mekan, Doğan Süleyman Onur, Erten Burak Onur, Tastan Oznur, Erten Cesim, Kazan Hilal

机构信息

Antalya Bilim University, 07190, Antalya, Turkey.

Akdeniz University, 07058, Antalya, Turkey.

出版信息

BMC Bioinformatics. 2025 Mar 27;26(1):92. doi: 10.1186/s12859-025-06087-3.

DOI:10.1186/s12859-025-06087-3
PMID:40148808
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11951583/
Abstract

BACKGROUND

As single-cell genomics experiments increase in complexity and scale, the need to integrate multiple datasets has grown. Such integration enhances cellular feature identification by leveraging larger data volumes. However, batch effects-technical variations arising from differences in labs, times, or protocols-pose a significant challenge. Despite numerous proposed batch correction methods, many still have limitations, such as outputting only dimension-reduced data, relying on computationally intensive models, or resulting in overcorrection for batches with diverse cell type composition.

RESULTS

We introduce a novel method for batch effect correction named SCITUNA, a Single-Cell data Integration Tool Using Network Alignment. We perform evaluations on 39 individual batches from four real datasets and a simulated dataset, which include both scRNA-seq and scATAC-seq datasets, spanning multiple organisms and tissues. A thorough comparison of existing batch correction methods using 13 metrics reveals that SCITUNA outperforms current approaches and is successful at preserving biological signals present in the original data. In particular, SCITUNA shows a better performance than the current methods in all the comparisons except for the multiple batch integration of the lung dataset where the difference is 0.004.

CONCLUSION

SCITUNA effectively removes batch effects while retaining the biological signals present in the data. Our extensive experiments reveal that SCITUNA will be a valuable tool for diverse integration tasks.

摘要

背景

随着单细胞基因组学实验的复杂性和规模不断增加,整合多个数据集的需求也日益增长。这种整合通过利用更大的数据量来增强细胞特征识别。然而,批次效应——由实验室、时间或实验方案的差异引起的技术变化——构成了重大挑战。尽管提出了许多批次校正方法,但许多方法仍然存在局限性,例如仅输出降维后的数据、依赖计算密集型模型,或者对具有不同细胞类型组成的批次进行过度校正。

结果

我们引入了一种名为SCITUNA的新型批次效应校正方法,即使用网络比对的单细胞数据整合工具。我们对来自四个真实数据集和一个模拟数据集的39个单独批次进行了评估,这些数据集包括scRNA-seq和scATAC-seq数据集,涵盖了多种生物和组织。使用13个指标对现有批次校正方法进行的全面比较表明,SCITUNA优于当前方法,并且成功地保留了原始数据中存在的生物信号。特别是,除了肺数据集的多批次整合(差异为0.004)外,SCITUNA在所有比较中均表现出比当前方法更好的性能。

结论

SCITUNA有效地消除了批次效应,同时保留了数据中存在的生物信号。我们的大量实验表明,SCITUNA将成为各种整合任务的有价值工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ceb/11951583/3405133c886a/12859_2025_6087_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ceb/11951583/7d544ffff574/12859_2025_6087_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ceb/11951583/0a2ca45fb63b/12859_2025_6087_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ceb/11951583/47550e8ec9aa/12859_2025_6087_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ceb/11951583/ac795ad15fa7/12859_2025_6087_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ceb/11951583/2eac2a60020b/12859_2025_6087_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ceb/11951583/1a44a2a46ab7/12859_2025_6087_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ceb/11951583/3405133c886a/12859_2025_6087_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ceb/11951583/7d544ffff574/12859_2025_6087_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ceb/11951583/0a2ca45fb63b/12859_2025_6087_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ceb/11951583/47550e8ec9aa/12859_2025_6087_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ceb/11951583/ac795ad15fa7/12859_2025_6087_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ceb/11951583/2eac2a60020b/12859_2025_6087_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ceb/11951583/1a44a2a46ab7/12859_2025_6087_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ceb/11951583/3405133c886a/12859_2025_6087_Fig7_HTML.jpg

相似文献

1
SCITUNA: single-cell data integration tool using network alignment.SCITUNA:使用网络比对的单细胞数据整合工具。
BMC Bioinformatics. 2025 Mar 27;26(1):92. doi: 10.1186/s12859-025-06087-3.
2
Partially characterized topology guides reliable anchor-free scRNA-integration.部分特征化的拓扑结构指导可靠的无锚单细胞RNA整合。
Commun Biol. 2025 Apr 4;8(1):561. doi: 10.1038/s42003-025-07988-y.
3
Beaconet: A Reference-Free Method for Integrating Multiple Batches of Single-Cell Transcriptomic Data in Original Molecular Space.Beaconet:一种在原始分子空间中整合多个批次单细胞转录组数据的无参方法。
Adv Sci (Weinh). 2024 Jul;11(26):e2306770. doi: 10.1002/advs.202306770. Epub 2024 May 6.
4
iSMNN: batch effect correction for single-cell RNA-seq data via iterative supervised mutual nearest neighbor refinement.iSMNN:通过迭代监督的互近邻修正对单细胞 RNA-seq 数据进行批次效应校正。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab122.
5
Integration of single cell data by disentangled representation learning.基于解缠表示学习的单细胞数据整合。
Nucleic Acids Res. 2022 Jan 25;50(2):e8. doi: 10.1093/nar/gkab978.
6
A benchmark of batch-effect correction methods for single-cell RNA sequencing data.单细胞 RNA 测序数据批次效应校正方法的基准测试。
Genome Biol. 2020 Jan 16;21(1):12. doi: 10.1186/s13059-019-1850-9.
7
Reference-informed evaluation of batch correction for single-cell omics data with overcorrection awareness.基于参考信息的单细胞组学数据批量校正评估及过校正识别
Commun Biol. 2025 Mar 30;8(1):521. doi: 10.1038/s42003-025-07947-7.
8
Benchmarking atlas-level data integration in single-cell genomics.单细胞基因组学中图谱级数据整合的基准测试。
Nat Methods. 2022 Jan;19(1):41-50. doi: 10.1038/s41592-021-01336-8. Epub 2021 Dec 23.
9
Independent component analysis based gene co-expression network inference (ICAnet) to decipher functional modules for better single-cell clustering and batch integration.基于独立成分分析的基因共表达网络推断 (ICAnet) 以破译功能模块,从而更好地进行单细胞聚类和批次整合。
Nucleic Acids Res. 2021 May 21;49(9):e54. doi: 10.1093/nar/gkab089.
10
NDMNN: A novel deep residual network based MNN method to remove batch effects from scRNA-seq data.NDMNN:一种基于深度残差网络的新型MNN方法,用于去除单细胞RNA测序数据中的批次效应。
J Bioinform Comput Biol. 2024 Jun;22(3):2450015. doi: 10.1142/S021972002450015X. Epub 2024 Jul 20.

本文引用的文献

1
Characterizing the impacts of dataset imbalance on single-cell data integration.表征数据集不平衡对单细胞数据整合的影响。
Nat Biotechnol. 2024 Dec;42(12):1899-1908. doi: 10.1038/s41587-023-02097-9. Epub 2024 Mar 1.
2
The scverse project provides a computational ecosystem for single-cell omics data analysis.scverse项目为单细胞组学数据分析提供了一个计算生态系统。
Nat Biotechnol. 2023 May;41(5):604-606. doi: 10.1038/s41587-023-01733-8.
3
Integration of Single-Cell RNA-Seq Datasets: A Review of Computational Methods.单细胞 RNA-Seq 数据集的整合:计算方法综述。
Mol Cells. 2023 Feb 28;46(2):106-119. doi: 10.14348/molcells.2023.0009. Epub 2023 Feb 24.
4
Batch alignment of single-cell transcriptomics data using deep metric learning.基于深度度量学习的单细胞转录组学数据批量对齐。
Nat Commun. 2023 Feb 21;14(1):960. doi: 10.1038/s41467-023-36635-5.
5
Online single-cell data integration through projecting heterogeneous datasets into a common cell-embedding space.通过将异质数据集映射到共同的细胞嵌入空间来实现单细胞数据的在线整合。
Nat Commun. 2022 Oct 17;13(1):6118. doi: 10.1038/s41467-022-33758-z.
6
A Python library for probabilistic analysis of single-cell omics data.一个用于单细胞组学数据概率分析的Python库。
Nat Biotechnol. 2022 Feb;40(2):163-166. doi: 10.1038/s41587-021-01206-w.
7
Benchmarking atlas-level data integration in single-cell genomics.单细胞基因组学中图谱级数据整合的基准测试。
Nat Methods. 2022 Jan;19(1):41-50. doi: 10.1038/s41592-021-01336-8. Epub 2021 Dec 23.
8
Comprehensive analysis of single cell ATAC-seq data with SnapATAC.利用 SnapATAC 对单细胞 ATAC-seq 数据进行全面分析。
Nat Commun. 2021 Feb 26;12(1):1337. doi: 10.1038/s41467-021-21583-9.
9
Probabilistic harmonization and annotation of single-cell transcriptomics data with deep generative models.使用深度生成模型对单细胞转录组学数据进行概率协调和注释。
Mol Syst Biol. 2021 Jan;17(1):e9620. doi: 10.15252/msb.20209620.
10
A multicenter study benchmarking single-cell RNA sequencing technologies using reference samples.一项使用参考试样对单细胞 RNA 测序技术进行基准测试的多中心研究。
Nat Biotechnol. 2021 Sep;39(9):1103-1114. doi: 10.1038/s41587-020-00748-9. Epub 2020 Dec 21.