• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于RNA测序计数数据的高效批次效应校正方法。

Highly effective batch effect correction method for RNA-seq count data.

作者信息

Zhang Xiaoyu

机构信息

Department of Computer Science and Information Science, California State University San Marcos, 333 S. Twin Oaks Valley Rd, San Marcos, CA 92096, USA.

出版信息

Comput Struct Biotechnol J. 2024 Dec 16;27:58-64. doi: 10.1016/j.csbj.2024.12.010. eCollection 2025.

DOI:10.1016/j.csbj.2024.12.010
PMID:39802213
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11718288/
Abstract

RNA sequencing (RNA-seq) has become a cornerstone of transcriptomics, providing detailed insights into gene expression across diverse biological conditions and sample types. However, RNA-seq data are often confounded by batch effects, systematic non-biological variations that compromise data reliability and obscure true biological differences. To address these challenges, we introduce ComBat-ref, a refined batch effect correction method designed to enhance the statistical power and reliability of differential expression analysis in RNA-seq data. Building on the principles of ComBat-seq, ComBat-ref employs a negative binomial model for count data adjustment but innovates by selecting a reference batch with the smallest dispersion, preserving count data for the reference batch, and adjusting other batches towards the reference batch. Our method demonstrated superior performance in both simulated environments and real-world datasets, including the growth factor receptor network (GFRN) data and NASA GeneLab transcriptomic datasets, significantly improving sensitivity and specificity compared to existing methods. By effectively mitigating batch effects while maintaining high detection power, ComBat-ref provides a robust solution for improving the accuracy and interpretability of RNA-seq data analyses.

摘要

RNA测序(RNA-seq)已成为转录组学的基石,它能深入洞察不同生物学条件和样本类型下的基因表达情况。然而,RNA-seq数据常常受到批次效应的干扰,这种系统性的非生物学变异会影响数据的可靠性,并掩盖真正的生物学差异。为应对这些挑战,我们引入了ComBat-ref,这是一种经过改进的批次效应校正方法,旨在提高RNA-seq数据中差异表达分析的统计功效和可靠性。基于ComBat-seq的原理,ComBat-ref采用负二项式模型进行计数数据调整,但它的创新之处在于选择离散度最小的参考批次,保留参考批次的计数数据,并将其他批次的数据调整至与参考批次一致。我们的方法在模拟环境和真实数据集(包括生长因子受体网络(GFRN)数据和美国国家航空航天局基因实验室转录组数据集)中均表现出卓越的性能,与现有方法相比,显著提高了灵敏度和特异性。通过在保持高检测能力的同时有效减轻批次效应,ComBat-ref为提高RNA-seq数据分析的准确性和可解释性提供了一个可靠的解决方案。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbd7/11718288/727e95978533/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbd7/11718288/4f57db2f54af/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbd7/11718288/ad6b3e64cbf6/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbd7/11718288/727e95978533/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbd7/11718288/4f57db2f54af/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbd7/11718288/ad6b3e64cbf6/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbd7/11718288/727e95978533/gr2.jpg

相似文献

1
Highly effective batch effect correction method for RNA-seq count data.用于RNA测序计数数据的高效批次效应校正方法。
Comput Struct Biotechnol J. 2024 Dec 16;27:58-64. doi: 10.1016/j.csbj.2024.12.010. eCollection 2025.
2
Highly Effective Batch Effect Correction Method for RNA-seq Count Data.用于RNA测序计数数据的高效批次效应校正方法
bioRxiv. 2024 May 5:2024.05.02.592266. doi: 10.1101/2024.05.02.592266.
3
: batch effect adjustment for RNA-seq count data.RNA测序计数数据的批次效应调整
NAR Genom Bioinform. 2020 Sep;2(3):lqaa078. doi: 10.1093/nargab/lqaa078. Epub 2020 Sep 21.
4
Batch effects correction in scRNA-seq based on biological-noise decoupling autoencoder and central-cross loss.基于生物噪声解耦自动编码器和中心交叉损失的单细胞RNA测序中的批次效应校正
Comput Biol Chem. 2024 Dec;113:108261. doi: 10.1016/j.compbiolchem.2024.108261. Epub 2024 Oct 19.
5
Batch effect detection and correction in RNA-seq data using machine-learning-based automated assessment of quality.基于机器学习的 RNA-seq 数据质量自动评估进行批次效应检测和校正。
BMC Bioinformatics. 2022 Jul 14;23(Suppl 6):279. doi: 10.1186/s12859-022-04775-y.
6
Deep Batch Integration and Denoise of Single-Cell RNA-Seq Data.单细胞 RNA-Seq 数据的深度批量整合和去噪。
Adv Sci (Weinh). 2024 Aug;11(29):e2308934. doi: 10.1002/advs.202308934. Epub 2024 May 22.
7
SMNN: batch effect correction for single-cell RNA-seq data via supervised mutual nearest neighbor detection.SMNN:通过有监督的互最近邻检测对单细胞 RNA-seq 数据进行批次效应校正。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa097.
8
A benchmark of batch-effect correction methods for single-cell RNA sequencing data.单细胞 RNA 测序数据批次效应校正方法的基准测试。
Genome Biol. 2020 Jan 16;21(1):12. doi: 10.1186/s13059-019-1850-9.
9
Propensity score matching enables batch-effect-corrected imputation in single-cell RNA-seq analysis.倾向得分匹配可实现单细胞 RNA-seq 分析中的批次效应校正填补。
Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac275.
10
pyComBat, a Python tool for batch effects correction in high-throughput molecular data using empirical Bayes methods.pyComBat,一个使用经验贝叶斯方法进行高通量分子数据批次效应校正的 Python 工具。
BMC Bioinformatics. 2023 Dec 7;24(1):459. doi: 10.1186/s12859-023-05578-5.

引用本文的文献

1
Transcriptomic Analysis of Larval Response to Infection.幼虫对感染反应的转录组分析
Animals (Basel). 2025 Aug 30;15(17):2556. doi: 10.3390/ani15172556.
2
Current Bioinformatics Tools in Precision Oncology.精准肿瘤学中的当前生物信息学工具
MedComm (2020). 2025 Jul 9;6(7):e70243. doi: 10.1002/mco2.70243. eCollection 2025 Jul.
3
Autism spectrum disorder risk genes have convergent effects on transcription and neuronal firing patterns in primary neurons.自闭症谱系障碍风险基因对原代神经元的转录和神经元放电模式具有趋同效应。

本文引用的文献

1
Procrustes is a machine-learning approach that removes cross-platform batch effects from clinical RNA sequencing data.Procrustes 是一种机器学习方法,可消除临床 RNA 测序数据中的跨平台批次效应。
Commun Biol. 2024 Mar 30;7(1):392. doi: 10.1038/s42003-024-06020-z.
2
Batch effect detection and correction in RNA-seq data using machine-learning-based automated assessment of quality.基于机器学习的 RNA-seq 数据质量自动评估进行批次效应检测和校正。
BMC Bioinformatics. 2022 Jul 14;23(Suppl 6):279. doi: 10.1186/s12859-022-04775-y.
3
Statistics or biology: the zero-inflation controversy about scRNA-seq data.
bioRxiv. 2025 Mar 26:2025.03.25.645337. doi: 10.1101/2025.03.25.645337.
统计学还是生物学:关于 scRNA-seq 数据的零膨胀争议。
Genome Biol. 2022 Jan 21;23(1):31. doi: 10.1186/s13059-022-02601-5.
4
Exploration of alcohol use disorder-associated brain miRNA-mRNA regulatory networks.酒精使用障碍相关脑 miRNA-mRNA 调控网络的探索。
Transl Psychiatry. 2021 Oct 2;11(1):504. doi: 10.1038/s41398-021-01635-w.
5
Differential DNA methylation and mRNA transcription in gingival tissues in periodontal health and disease.牙周健康和疾病中牙龈组织的差异 DNA 甲基化和 mRNA 转录。
J Clin Periodontol. 2021 Sep;48(9):1152-1164. doi: 10.1111/jcpe.13504. Epub 2021 Jul 11.
6
: batch effect adjustment for RNA-seq count data.RNA测序计数数据的批次效应调整
NAR Genom Bioinform. 2020 Sep;2(3):lqaa078. doi: 10.1093/nargab/lqaa078. Epub 2020 Sep 21.
7
Pathway activity profiling of growth factor receptor network and stemness pathways differentiates metaplastic breast cancer histological subtypes.生长因子受体网络和干性通路的途径活性分析可区分癌肉瘤样乳腺癌的组织学亚型。
BMC Cancer. 2019 Sep 5;19(1):881. doi: 10.1186/s12885-019-6052-z.
8
Alternative empirical Bayes models for adjusting for batch effects in genomic studies.用于调整基因组研究中批次效应的替代经验贝叶斯模型。
BMC Bioinformatics. 2018 Jul 13;19(1):262. doi: 10.1186/s12859-018-2263-6.
9
Removal of batch effects using distribution-matching residual networks.使用分布匹配残差网络消除批次效应。
Bioinformatics. 2017 Aug 15;33(16):2539-2546. doi: 10.1093/bioinformatics/btx196.
10
Polyester: simulating RNA-seq datasets with differential transcript expression.聚酯:模拟具有差异转录本表达的RNA测序数据集。
Bioinformatics. 2015 Sep 1;31(17):2778-84. doi: 10.1093/bioinformatics/btv272. Epub 2015 Apr 28.