• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用分布匹配残差网络消除批次效应。

Removal of batch effects using distribution-matching residual networks.

作者信息

Shaham Uri, Stanton Kelly P, Zhao Jun, Li Huamin, Raddassi Khadir, Montgomery Ruth, Kluger Yuval

机构信息

Department of Statistics, Yale University, New Haven, CT 06511, USA.

Department of Pathology, Yale School of Medicine, New Haven, CT 06510, USA.

出版信息

Bioinformatics. 2017 Aug 15;33(16):2539-2546. doi: 10.1093/bioinformatics/btx196.

DOI:10.1093/bioinformatics/btx196
PMID:28419223
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5870543/
Abstract

MOTIVATION

Sources of variability in experimentally derived data include measurement error in addition to the physical phenomena of interest. This measurement error is a combination of systematic components, originating from the measuring instrument and random measurement errors. Several novel biological technologies, such as mass cytometry and single-cell RNA-seq (scRNA-seq), are plagued with systematic errors that may severely affect statistical analysis if the data are not properly calibrated.

RESULTS

We propose a novel deep learning approach for removing systematic batch effects. Our method is based on a residual neural network, trained to minimize the Maximum Mean Discrepancy between the multivariate distributions of two replicates, measured in different batches. We apply our method to mass cytometry and scRNA-seq datasets, and demonstrate that it effectively attenuates batch effects.

AVAILABILITY AND IMPLEMENTATION

our codes and data are publicly available at https://github.com/ushaham/BatchEffectRemoval.git.

CONTACT

yuval.kluger@yale.edu.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

实验得出的数据中的变异性来源,除了感兴趣的物理现象外,还包括测量误差。这种测量误差是由测量仪器产生的系统成分和随机测量误差的组合。几种新型生物技术,如质谱流式细胞术和单细胞RNA测序(scRNA-seq),都存在系统误差,如果数据没有得到适当校准,这些误差可能会严重影响统计分析。

结果

我们提出了一种用于消除系统批次效应的新型深度学习方法。我们的方法基于残差神经网络,经过训练以最小化在不同批次中测量的两个重复样本的多变量分布之间的最大均值差异。我们将我们的方法应用于质谱流式细胞术和scRNA-seq数据集,并证明它有效地减弱了批次效应。

可用性和实现方式

我们的代码和数据可在https://github.com/ushaham/BatchEffectRemoval.git上公开获取。

联系方式

yuval.kluger@yale.edu。

补充信息

补充数据可在《生物信息学》在线获取。

相似文献

1
Removal of batch effects using distribution-matching residual networks.使用分布匹配残差网络消除批次效应。
Bioinformatics. 2017 Aug 15;33(16):2539-2546. doi: 10.1093/bioinformatics/btx196.
2
Gating mass cytometry data by deep learning.通过深度学习对门控质谱流式细胞术数据进行分类。
Bioinformatics. 2017 Nov 1;33(21):3423-3430. doi: 10.1093/bioinformatics/btx448.
3
HDMC: a novel deep learning-based framework for removing batch effects in single-cell RNA-seq data.HDMC:一种用于去除单细胞 RNA-seq 数据中批次效应的新型深度学习框架。
Bioinformatics. 2022 Feb 7;38(5):1295-1303. doi: 10.1093/bioinformatics/btab821.
4
ResPAN: a powerful batch correction model for scRNA-seq data through residual adversarial networks.ResPAN:通过残差对抗网络对 scRNA-seq 数据进行强大的批量校正模型。
Bioinformatics. 2022 Aug 10;38(16):3942-3949. doi: 10.1093/bioinformatics/btac427.
5
NDMNN: A novel deep residual network based MNN method to remove batch effects from scRNA-seq data.NDMNN:一种基于深度残差网络的新型MNN方法,用于去除单细胞RNA测序数据中的批次效应。
J Bioinform Comput Biol. 2024 Jun;22(3):2450015. doi: 10.1142/S021972002450015X. Epub 2024 Jul 20.
6
BERMAD: batch effect removal for single-cell RNA-seq data using a multi-layer adaptation autoencoder with dual-channel framework.BERMAD:基于双通道框架的多层自适应自动编码器去除单细胞 RNA-seq 数据中的批次效应
Bioinformatics. 2024 Mar 4;40(3). doi: 10.1093/bioinformatics/btae127.
7
CLAIRE: contrastive learning-based batch correction framework for better balance between batch mixing and preservation of cellular heterogeneity.CLAIRE:基于对比学习的批次校正框架,更好地平衡批次混合和保留细胞异质性。
Bioinformatics. 2023 Mar 1;39(3). doi: 10.1093/bioinformatics/btad099.
8
Falco: a quick and flexible single-cell RNA-seq processing framework on the cloud.Falco:一个在云端快速且灵活的单细胞RNA测序处理框架。
Bioinformatics. 2017 Mar 1;33(5):767-769. doi: 10.1093/bioinformatics/btw732.
9
Machine learning and statistical methods for clustering single-cell RNA-sequencing data.机器学习和统计方法在单细胞 RNA 测序数据分析中的应用。
Brief Bioinform. 2020 Jul 15;21(4):1209-1223. doi: 10.1093/bib/bbz063.
10
Automated cell type discovery and classification through knowledge transfer.通过知识转移实现自动化细胞类型发现与分类
Bioinformatics. 2017 Jun 1;33(11):1689-1695. doi: 10.1093/bioinformatics/btx054.

引用本文的文献

1
FedscGen: privacy-preserving federated batch effect correction of single-cell RNA sequencing data.FedscGen:单细胞RNA测序数据的隐私保护联邦批次效应校正
Genome Biol. 2025 Jul 22;26(1):216. doi: 10.1186/s13059-025-03684-6.
2
An order-preserving batch-effect correction method based on a monotonic deep learning framework.一种基于单调深度学习框架的保序批效应校正方法。
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf247.
3
A Comprehensive Drift-Adaptive Framework for Sustaining Model Performance in COVID-19 Detection From Dynamic Cough Audio Data: Model Development and Validation.一种用于在动态咳嗽音频数据的COVID-19检测中维持模型性能的综合漂移自适应框架:模型开发与验证
J Med Internet Res. 2025 Jun 3;27:e66919. doi: 10.2196/66919.
4
Navigating single-cell RNA-sequencing: protocols, tools, databases, and applications.探索单细胞RNA测序:方案、工具、数据库及应用
Genomics Inform. 2025 May 17;23(1):13. doi: 10.1186/s44342-025-00044-5.
5
Highly effective batch effect correction method for RNA-seq count data.用于RNA测序计数数据的高效批次效应校正方法。
Comput Struct Biotechnol J. 2024 Dec 16;27:58-64. doi: 10.1016/j.csbj.2024.12.010. eCollection 2025.
6
Wise Roles and Future Visionary Endeavors of Current Emperor: Advancing Dynamic Methods for Longitudinal Microbiome Meta-Omics Data in Personalized and Precision Medicine.当代帝王的明智角色与未来前瞻性努力:推进个性化与精准医学中纵向微生物组元组学数据的动态方法
Adv Sci (Weinh). 2024 Dec;11(47):e2400458. doi: 10.1002/advs.202400458. Epub 2024 Nov 13.
7
Single-cell mosaic integration and cell state transfer with auto-scaling self-attention mechanism.单细胞马赛克整合和细胞状态转移与自动缩放自注意机制。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae540.
8
GRouNdGAN: GRN-guided simulation of single-cell RNA-seq data using causal generative adversarial networks.GRouNdGAN:使用因果生成对抗网络对单细胞 RNA-seq 数据进行 GRN 指导模拟。
Nat Commun. 2024 May 14;15(1):4055. doi: 10.1038/s41467-024-48516-6.
9
Integration of scRNA-seq data by disentangled representation learning with condition domain adaptation.基于条件域自适应的解缠表示学习整合 scRNA-seq 数据。
BMC Bioinformatics. 2024 Mar 16;25(1):116. doi: 10.1186/s12859-024-05706-9.
10
A novel batch-effect correction method for scRNA-seq data based on Adversarial Information Factorization.基于对抗信息分解的 scRNA-seq 数据新型批量效应校正方法。
PLoS Comput Biol. 2024 Feb 22;20(2):e1011880. doi: 10.1371/journal.pcbi.1011880. eCollection 2024 Feb.

本文引用的文献

1
Standardization and quality control for high-dimensional mass cytometry studies of human samples.人类样本高维质谱流式细胞术研究的标准化与质量控制
Cytometry A. 2016 Oct;89(10):903-913. doi: 10.1002/cyto.a.22935. Epub 2016 Aug 30.
2
Comprehensive Classification of Retinal Bipolar Neurons by Single-Cell Transcriptomics.通过单细胞转录组学对视网膜双极神经元进行综合分类
Cell. 2016 Aug 25;166(5):1308-1323.e30. doi: 10.1016/j.cell.2016.07.054.
3
Mass Cytometry: Single Cells, Many Features.质谱流式细胞术:单细胞,多特征。
Cell. 2016 May 5;165(4):780-91. doi: 10.1016/j.cell.2016.04.019.
4
Methods that remove batch effects while retaining group differences may lead to exaggerated confidence in downstream analyses.在保留组间差异的同时消除批次效应的方法可能会导致对下游分析的信心过度膨胀。
Biostatistics. 2016 Jan;17(1):29-39. doi: 10.1093/biostatistics/kxv027. Epub 2015 Aug 13.
5
Highly Parallel Genome-wide Expression Profiling of Individual Cells Using Nanoliter Droplets.利用纳升液滴对单个细胞进行高度并行的全基因组表达谱分析。
Cell. 2015 May 21;161(5):1202-1214. doi: 10.1016/j.cell.2015.05.002.
6
Massively parallel single-cell RNA-seq for marker-free decomposition of tissues into cell types.大规模并行单细胞 RNA-seq 用于无标记组织细胞类型分解。
Science. 2014 Feb 14;343(6172):776-9. doi: 10.1126/science.1247651.
7
Smart-seq2 for sensitive full-length transcriptome profiling in single cells.Smart-seq2 可用于单细胞中灵敏的全长转录组谱分析。
Nat Methods. 2013 Nov;10(11):1096-8. doi: 10.1038/nmeth.2639. Epub 2013 Sep 22.
8
Normalization of mass cytometry data with bead standards.用标准微球对质谱流式细胞术数据进行标准化处理。
Cytometry A. 2013 May;83(5):483-94. doi: 10.1002/cyto.a.22271. Epub 2013 Mar 19.
9
The sva package for removing batch effects and other unwanted variation in high-throughput experiments.sva 包用于去除高通量实验中的批次效应和其他不需要的变异。
Bioinformatics. 2012 Mar 15;28(6):882-3. doi: 10.1093/bioinformatics/bts034. Epub 2012 Jan 17.
10
Tackling the widespread and critical impact of batch effects in high-throughput data.解决高通量数据中广泛存在且极具影响力的批次效应问题。
Nat Rev Genet. 2010 Oct;11(10):733-9. doi: 10.1038/nrg2825. Epub 2010 Sep 14.