Wellcome Trust Sanger Institute, Cellular Genetics, Wellcome Genome Campus, Hinxton, CB10 1SA, UK.
Cambridge University Hospitals NHS Foundation Trust, Hills Road, Cambridge, CB2 0QQ, UK.
Gigascience. 2020 Dec 26;9(12). doi: 10.1093/gigascience/giaa151.
BACKGROUND: Droplet-based single-cell RNA sequence analyses assume that all acquired RNAs are endogenous to cells. However, any cell-free RNAs contained within the input solution are also captured by these assays. This sequencing of cell-free RNA constitutes a background contamination that confounds the biological interpretation of single-cell transcriptomic data. RESULTS: We demonstrate that contamination from this "soup" of cell-free RNAs is ubiquitous, with experiment-specific variations in composition and magnitude. We present a method, SoupX, for quantifying the extent of the contamination and estimating "background-corrected" cell expression profiles that seamlessly integrate with existing downstream analysis tools. Applying this method to several datasets using multiple droplet sequencing technologies, we demonstrate that its application improves biological interpretation of otherwise misleading data, as well as improving quality control metrics. CONCLUSIONS: We present SoupX, a tool for removing ambient RNA contamination from droplet-based single-cell RNA sequencing experiments. This tool has broad applicability, and its application can improve the biological utility of existing and future datasets.
背景:基于液滴的单细胞 RNA 序列分析假定所有获得的 RNA 都是内源性细胞的。然而,输入溶液中包含的任何无细胞 RNA 也会被这些检测方法捕获。这些无细胞 RNA 的测序构成了背景污染,从而混淆了单细胞转录组数据的生物学解释。
结果:我们证明了这种“汤”中无细胞 RNA 的污染是普遍存在的,其组成和程度具有实验特异性的变化。我们提出了一种方法 SoupX,用于量化污染的程度,并估计“背景校正”的细胞表达谱,该方法与现有的下游分析工具无缝集成。我们使用多种液滴测序技术对几个数据集应用该方法,证明其应用可以改善原本具有误导性数据的生物学解释,同时提高质量控制指标。
结论:我们提出了 SoupX,这是一种从基于液滴的单细胞 RNA 测序实验中去除环境 RNA 污染的工具。该工具具有广泛的适用性,其应用可以提高现有和未来数据集的生物学实用性。
Gigascience. 2020-12-26
Bioinformatics. 2018-1-1
Genome Biol. 2020-3-5
BMC Genomics. 2018-12-24
Methods Mol Biol. 2018
Signal Transduct Target Ther. 2025-8-25
Nat Cardiovasc Res. 2025-8-21
Acta Neuropathol Commun. 2025-8-12
Methods Mol Biol. 2025
Genome Biol. 2020-3-5
Nature. 2019-10-9
Sci Data. 2018-2-13