Charrout Mohammed, Reinders Marcel J T, Mahfouz Ahmed
Delft Bioinformatics Lab, Delft University of Technology, Delft 2628 XE, The Netherlands.
NAR Genom Bioinform. 2020 Jul 22;2(3):lqaa053. doi: 10.1093/nargab/lqaa053. eCollection 2020 Sep.
Advances in single-cell RNA sequencing over the past decade has shifted the discussion of cell identity toward the transcriptional state of the cell. While the incredible resolution provided by single-cell RNA sequencing has led to great advances in unraveling tissue heterogeneity and inferring cell differentiation dynamics, it raises the question of which sources of variation are important for determining cellular identity. Here we show that confounding biological sources of variation, most notably the cell cycle, can distort the inference of differentiation trajectories. We show that by factorizing single cell data into distinct sources of variation, we can select a relevant set of factors that constitute the core regulators for trajectory inference, while filtering out confounding sources of variation (e.g. cell cycle) which can perturb the inferred trajectory. Script are available publicly on https://github.com/mochar/cell_variation.
在过去十年中,单细胞RNA测序技术的进步已将关于细胞身份的讨论转向细胞的转录状态。虽然单细胞RNA测序提供的惊人分辨率在揭示组织异质性和推断细胞分化动力学方面取得了巨大进展,但它也引发了一个问题,即哪些变异来源对于确定细胞身份很重要。在这里,我们表明,令人困惑的生物学变异来源,最显著的是细胞周期,会扭曲分化轨迹的推断。我们表明,通过将单细胞数据分解为不同的变异来源,我们可以选择一组相关因素,这些因素构成轨迹推断的核心调节因子,同时过滤掉可能干扰推断轨迹的混杂变异来源(例如细胞周期)。脚本可在https://github.com/mochar/cell_variation上公开获取。