Arceneaux Deronisha, Chen Zhengyi, Simmons Alan J, Heiser Cody N, Southard-Smith Austin N, Brenan Michael J, Yang Yilin, Chen Bob, Xu Yanwen, Choi Eunyoung, Campbell Joshua D, Liu Qi, Lau Ken S
Epithelial Biology Center, Vanderbilt University Medical Center, Nashville, TN, USA.
Department of Cell and Developmental Biology, Vanderbilt University School of Medicine, Nashville, TN, USA.
iScience. 2023 Jun 29;26(7):107242. doi: 10.1016/j.isci.2023.107242. eCollection 2023 Jul 21.
Droplet-based single-cell RNA-seq (scRNA-seq) data are plagued by ambient contaminations caused by nucleic acid material released by dead and dying cells. This material is mixed into the buffer and is co-encapsulated with cells, leading to a lower signal-to-noise ratio. Although there exist computational methods to remove ambient contaminations post-hoc, the reliability of algorithms in generating high-quality data from low-quality sources remains uncertain. Here, we assess data quality before data filtering by a set of quantitative, contamination-based metrics that assess data quality more effectively than standard metrics. Through a series of controlled experiments, we report improvements that can minimize ambient contamination outside of tissue dissociation, via cell fixation, improved cell loading, microfluidic dilution, and nuclei versus cell preparation; many of these parameters are inaccessible on commercial platforms. We provide end-users with insights on factors that can guide their decision-making regarding optimizations that minimize ambient contamination, and metrics to assess data quality.
基于微滴的单细胞RNA测序(scRNA-seq)数据受到死亡和濒死细胞释放的核酸物质所导致的环境污染物的困扰。这种物质混入缓冲液中,并与细胞一起被共同封装,导致信噪比降低。尽管存在事后去除环境污染物的计算方法,但从低质量数据源生成高质量数据的算法的可靠性仍不确定。在这里,我们通过一组基于污染物的定量指标在数据过滤之前评估数据质量,这些指标比标准指标更有效地评估数据质量。通过一系列对照实验,我们报告了可以通过细胞固定、改进细胞加载、微流体稀释以及细胞核与细胞制备等方法,在组织解离之外最大限度地减少环境污染物的改进措施;其中许多参数在商业平台上无法获取。我们为最终用户提供有关因素的见解,这些因素可以指导他们在最小化环境污染物的优化方面做出决策,以及评估数据质量的指标。