demuxmix：使用回归混合模型对带有 barcodes 的寡核苷酸标记的单细胞 RNA 测序数据进行解复用。

demuxmix: demultiplexing oligonucleotide-barcoded single-cell RNA sequencing data with regression mixture models.

机构信息

Center for Translational and Computational Neuroimmunology, Department of Neurology, Columbia University Irving Medical Center, New York, NY 10032, United States.

Taub Institute for Research on Alzheimer's Disease and the Aging Brain, Columbia University Irving Medical Center, New York, NY 10032, United States.

出版信息

Bioinformatics. 2023 Aug 1;39(8). doi: 10.1093/bioinformatics/btad481.

DOI:10.1093/bioinformatics/btad481

PMID:37527018

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10412409/

Abstract

MOTIVATION

Droplet-based single-cell RNA sequencing (scRNA-seq) is widely used in biomedical research for interrogating the transcriptomes of single cells on a large scale. Pooling and processing cells from different samples together can reduce costs and batch effects. To pool cells, they are often first labeled with hashtag oligonucleotides (HTOs). These HTOs are sequenced alongside the cells' RNA in the droplets and subsequently used to computationally assign each droplet to its sample of origin, a process referred to as demultiplexing. Accurate demultiplexing is crucial but can be challenging due to background HTOs, low-quality cells/cell debris, and multiplets.

RESULTS

A new demultiplexing method based on negative binomial regression mixture models is introduced. The method, called demuxmix, implements two significant improvements. First, demuxmix's probabilistic classification framework provides error probabilities for droplet assignments that can be used to discard uncertain droplets and inform about the quality of the HTO data and the success of the demultiplexing process. Second, demuxmix utilizes the positive association between detected genes in the RNA library and HTO counts to explain parts of the variance in the HTO data resulting in improved droplet assignments. The improved performance of demuxmix compared with existing demultiplexing methods is assessed using real and simulated data. Finally, the feasibility of accurately demultiplexing experimental designs where non-labeled cells are pooled with labeled cells is demonstrated.

AVAILABILITY AND IMPLEMENTATION

R/Bioconductor package demuxmix (https://doi.org/doi:10.18129/B9.bioc.demuxmix).

摘要

动机

基于液滴的单细胞 RNA 测序 (scRNA-seq) 在生物医学研究中被广泛用于大规模检测单细胞的转录组。将来自不同样本的细胞混合并处理可以降低成本和批次效应。为了混合细胞，它们通常首先用标签寡核苷酸 (HTO) 进行标记。这些 HTO 与细胞的 RNA 一起在液滴中测序，随后用于计算将每个液滴分配到其原始样本，这个过程称为多路分解。准确的多路分解至关重要，但由于背景 HTO、低质量的细胞/细胞碎片和多联体，可能具有挑战性。

结果

引入了一种基于负二项回归混合模型的新多路分解方法。该方法称为 demuxmix，实现了两个重要改进。首先，demuxmix 的概率分类框架为液滴分配提供错误概率，可以用于丢弃不确定的液滴，并提供关于 HTO 数据质量和多路分解过程成功的信息。其次，demuxmix 利用 RNA 文库中检测到的基因与 HTO 计数之间的正相关关系来解释 HTO 数据中部分方差，从而提高液滴分配的准确性。使用真实和模拟数据评估了 demuxmix 与现有多路分解方法相比的性能改进。最后，证明了在与标记细胞混合的非标记细胞中准确多路分解实验设计的可行性。

可用性和实现

R/Bioconductor 包 demuxmix（https://doi.org/doi:10.18129/B9.bioc.demuxmix）。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0802/10412409/854bedd7daeb/btad481f1.jpg

相似文献

demuxmix: demultiplexing oligonucleotide-barcoded single-cell RNA sequencing data with regression mixture models.demuxmix：使用回归混合模型对带有 barcodes 的寡核苷酸标记的单细胞 RNA 测序数据进行解复用。

Bioinformatics. 2023 Aug 1;39(8). doi: 10.1093/bioinformatics/btad481.

demuxmix: Demultiplexing oligonucleotide-barcoded single-cell RNA sequencing data with regression mixture models.Demuxmix：使用回归混合模型对寡核苷酸条形码单细胞RNA测序数据进行解复用

bioRxiv. 2023 Jan 29:2023.01.27.525961. doi: 10.1101/2023.01.27.525961.

Benchmarking single-cell hashtag oligo demultiplexing methods.单细胞哈希寡核苷酸解复用方法的基准测试

NAR Genom Bioinform. 2023 Oct 11;5(4):lqad086. doi: 10.1093/nargab/lqad086. eCollection 2023 Dec.

BFF and cellhashR: analysis tools for accurate demultiplexing of cell hashing data.BFF 和 cellhashR：用于准确分析细胞哈希数据的分析工具。

Bioinformatics. 2022 May 13;38(10):2791-2801. doi: 10.1093/bioinformatics/btac213.

scDemultiplex: An iterative beta-binomial model-based method for accurate demultiplexing with hashtag oligos.scDemultiplex：一种基于迭代贝塔二项式模型的方法，用于使用标签寡核苷酸进行精确解复用。

Comput Struct Biotechnol J. 2023 Aug 19;21:4044-4055. doi: 10.1016/j.csbj.2023.08.013. eCollection 2023.

scruff: an R/Bioconductor package for preprocessing single-cell RNA-sequencing data.scruff：一个用于预处理单细胞 RNA-seq 数据的 R/Bioconductor 包。

BMC Bioinformatics. 2019 May 2;20(1):222. doi: 10.1186/s12859-019-2797-2.

scPipe: A flexible R/Bioconductor preprocessing pipeline for single-cell RNA-sequencing data.scPipe：用于单细胞 RNA 测序数据的灵活 R/Bioconductor 预处理流水线。

PLoS Comput Biol. 2018 Aug 10;14(8):e1006361. doi: 10.1371/journal.pcbi.1006361. eCollection 2018 Aug.

SiftCell: A robust framework to detect and isolate cell-containing droplets from single-cell RNA sequence reads.SiftCell：一种从单细胞 RNA 序列读取中检测和分离含细胞液滴的稳健框架。

Cell Syst. 2023 Jul 19;14(7):620-628.e3. doi: 10.1016/j.cels.2023.06.002.

DIMM-SC: a Dirichlet mixture model for clustering droplet-based single cell transcriptomic data.DIMM-SC：一种基于 Dirichlet 混合模型的用于聚类基于液滴的单细胞转录组学数据的方法。

Bioinformatics. 2018 Jan 1;34(1):139-146. doi: 10.1093/bioinformatics/btx490.

Genotype-free demultiplexing of pooled single-cell RNA-seq.无基因型信息的Pooled 单细胞 RNA-seq 数据拆分。

Genome Biol. 2019 Dec 19;20(1):290. doi: 10.1186/s13059-019-1852-7.

引用本文的文献

Size-dependent temporal decoupling of morphogenesis and transcriptional programs in pseudoembryos.拟胚胎中形态发生与转录程序的大小依赖性时间解耦

Sci Adv. 2025 Aug 22;11(34):eadv7790. doi: 10.1126/sciadv.adv7790.

Probability of stealth multiplets in sample-multiplexing for droplet-based single-cell analysis.基于液滴的单细胞分析中样本多路复用的隐匿多重峰概率。

BMC Genomics. 2025 Jul 23;26(1):686. doi: 10.1186/s12864-025-11835-z.

A multi-omics resource of B cell activation reveals genetic mechanisms for immune-mediated diseases.一个B细胞激活的多组学资源揭示了免疫介导疾病的遗传机制。

medRxiv. 2025 Jun 14:2025.05.22.25328104. doi: 10.1101/2025.05.22.25328104.

CellBouncer, A Unified Toolkit for Single-Cell Demultiplexing and Ambient RNA Analysis, Reveals Hominid Mitochondrial Incompatibilities.CellBouncer，一种用于单细胞解复用和环境RNA分析的统一工具包，揭示了人类线粒体不相容性。

bioRxiv. 2025 Mar 23:2025.03.23.644821. doi: 10.1101/2025.03.23.644821.

Benchmarking spatial transcriptomics technologies with the multi-sample SpatialBenchVisium dataset.使用多样本空间基准数据集SpatialBenchVisium对空间转录组学技术进行基准测试。

Genome Biol. 2025 Mar 28;26(1):77. doi: 10.1186/s13059-025-03543-4.

A cross-disease resource of living human microglia identifies disease-enriched subsets and tool compounds recapitulating microglial states.一份关于活体人类小胶质细胞的跨疾病资源鉴定出疾病富集亚群以及重现小胶质细胞状态的工具化合物。

Nat Neurosci. 2024 Dec;27(12):2521-2537. doi: 10.1038/s41593-024-01764-7. Epub 2024 Oct 15.

Systematic benchmark of single-cell hashtag demultiplexing approaches reveals robust performance of a clustering-based method.单细胞标签解复用方法的系统基准测试揭示了基于聚类方法的强大性能。

Brief Funct Genomics. 2025 Jan 15;24. doi: 10.1093/bfgp/elae039.

A pharmacological toolkit for human microglia identifies Topoisomerase I inhibitors as immunomodulators for Alzheimer's disease.用于人类小胶质细胞的药理学工具包确定拓扑异构酶I抑制剂为阿尔茨海默病的免疫调节剂。

bioRxiv. 2024 Feb 6:2024.02.06.579103. doi: 10.1101/2024.02.06.579103.

deMULTIplex2: robust sample demultiplexing for scRNA-seq.deMULTIplex2：用于 scRNA-seq 的稳健样本拆分。

Genome Biol. 2024 Jan 30;25(1):37. doi: 10.1186/s13059-024-03177-y.

Comput Struct Biotechnol J. 2023 Aug 19;21:4044-4055. doi: 10.1016/j.csbj.2023.08.013. eCollection 2023.

本文引用的文献

Benchmarking single-cell hashtag oligo demultiplexing methods.单细胞哈希寡核苷酸解复用方法的基准测试

NAR Genom Bioinform. 2023 Oct 11;5(4):lqad086. doi: 10.1093/nargab/lqad086. eCollection 2023 Dec.

Comparative analysis of antibody- and lipid-based multiplexing methods for single-cell RNA-seq.基于抗体和脂质的单细胞 RNA-seq 多重分析方法的比较分析。

Genome Biol. 2022 Feb 16;23(1):55. doi: 10.1186/s13059-022-02628-8.

Multiplexing Methods for Simultaneous Large-Scale Transcriptomic Profiling of Samples at Single-Cell Resolution.多重化方法可用于单细胞分辨率下同时大规模进行转录组样本分析。

Adv Sci (Weinh). 2021 Sep;8(17):e2101229. doi: 10.1002/advs.202101229. Epub 2021 Jul 8.

GMM-Demux: sample demultiplexing, multiplet detection, experiment planning, and novel cell-type verification in single cell sequencing.GMM-Demux：单细胞测序中的样品分拆、多重检测、实验规划和新型细胞类型验证。

Genome Biol. 2020 Jul 30;21(1):188. doi: 10.1186/s13059-020-02084-2.

Highly multiplexed single-cell RNA-seq by DNA oligonucleotide tagging of cellular proteins.通过细胞蛋白的 DNA 寡核苷酸标记进行高度多重化的单细胞 RNA-seq。

Nat Biotechnol. 2020 Jan;38(1):35-38. doi: 10.1038/s41587-019-0372-z. Epub 2019 Dec 23.

Vireo: Bayesian demultiplexing of pooled single-cell RNA-seq data without genotype reference.Vireo：无基因型参考的混合单细胞 RNA-seq 数据的贝叶斯解复用。

Genome Biol. 2019 Dec 13;20(1):273. doi: 10.1186/s13059-019-1865-2.

Orchestrating single-cell analysis with Bioconductor.使用 Bioconductor 进行单细胞分析的协调。

Nat Methods. 2020 Feb;17(2):137-145. doi: 10.1038/s41592-019-0654-x. Epub 2019 Dec 2.

Nuclei multiplexing with barcoded antibodies for single-nucleus genomics.利用带有条形码抗体的核复用进行单细胞基因组学研究。

Nat Commun. 2019 Jul 2;10(1):2907. doi: 10.1038/s41467-019-10756-2.

MULTI-seq: sample multiplexing for single-cell RNA sequencing using lipid-tagged indices.Multi-seq：使用脂质标记索引进行单细胞 RNA 测序的样本多重化。

Nat Methods. 2019 Jul;16(7):619-626. doi: 10.1038/s41592-019-0433-8. Epub 2019 Jun 17.

EmptyDrops: distinguishing cells from empty droplets in droplet-based single-cell RNA sequencing data.EmptyDrops：用于区分基于液滴的单细胞 RNA 测序数据中的细胞和空液滴。

Genome Biol. 2019 Mar 22;20(1):63. doi: 10.1186/s13059-019-1662-y.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

demuxmix：使用回归混合模型对带有 barcodes 的寡核苷酸标记的单细胞 RNA 测序数据进行解复用。

demuxmix: demultiplexing oligonucleotide-barcoded single-cell RNA sequencing data with regression mixture models.

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY AND IMPLEMENTATION

动机

结果

可用性和实现

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献