Ji Cong, Miao Zong, He Xionglei
State Key Laboratory of Biocontrol, College of Ecology and Evolution, Sun Yat-sen University, Guangzhou, 510275, China.
PLoS One. 2015 Apr 13;10(4):e0123789. doi: 10.1371/journal.pone.0123789. eCollection 2015.
Due to the growth of interest in single-cell genomics, computational methods for distinguishing true variants from artifacts are highly desirable. While special attention has been paid to false positives in variant or mutation calling from single-cell sequencing data, an equally important but often neglected issue is that of false negatives derived from allele dropout during the amplification of single cell genomes. In this paper, we propose a simple strategy to reduce the false negatives in single-cell sequencing data analysis. Simulation results show that this method is highly reliable, with an error rate of 4.94×10-5, which is orders of magnitude lower than the expected false negative rate (~34%) estimated from a single-cell exome dataset, though the method is limited by the low SNP density in the human genome. We applied this method to analyze the exome data of a few dozen single tumor cells generated in previous studies, and extracted cell specific mutation information for a small set of sites. Interestingly, we found that there are difficulties in using the classical clonal model of tumor cell growth to explain the mutation patterns observed in some tumor cells.
由于对单细胞基因组学的兴趣日益增长,非常需要用于从伪影中区分真正变异的计算方法。虽然在从单细胞测序数据进行变异或突变检测时,已经特别关注了假阳性,但一个同样重要但经常被忽视的问题是,在单细胞基因组扩增过程中由于等位基因缺失导致的假阴性问题。在本文中,我们提出了一种简单的策略来减少单细胞测序数据分析中的假阴性。模拟结果表明,该方法高度可靠,错误率为4.94×10-5,比从单细胞外显子组数据集中估计的预期假阴性率(约34%)低几个数量级,尽管该方法受到人类基因组中低SNP密度的限制。我们应用此方法分析了先前研究中生成的几十个单肿瘤细胞的外显子组数据,并提取了一小部分位点的细胞特异性突变信息。有趣的是,我们发现使用经典的肿瘤细胞生长克隆模型来解释在一些肿瘤细胞中观察到的突变模式存在困难。