Department of Computer Science, Brown University, Providence, RI 02912.
Department of Computer Science, Princeton University, Princeton, NJ 08540.
Cell Syst. 2020 Apr 22;10(4):323-332.e8. doi: 10.1016/j.cels.2020.04.001.
A small number of somatic mutations drive the development of cancer, but all somatic mutations are markers of the evolutionary history of a tumor. Prominent methods to construct phylogenies from single-cell sequencing data use single-nucleotide variants (SNVs) as markers but fail to adequately account for copy-number aberrations (CNAs), which can overlap SNVs and result in SNV losses. Here, we introduce SCARLET, an algorithm that infers tumor phylogenies from single-cell DNA sequencing data while accounting for both CNA-driven loss of SNVs and sequencing errors. SCARLET outperforms existing methods on simulated data, with more accurate inference of the order in which mutations were acquired and the mutations present in individual cells. Using a single-cell dataset from a patient with colorectal cancer, SCARLET constructs a tumor phylogeny that is consistent with the observed CNAs and suggests an alternate origin for the patient's metastases. SCARLET is available at: github.com/raphael-group/scarlet.
少数体细胞突变驱动癌症的发展,但所有体细胞突变都是肿瘤进化史的标志物。从单细胞测序数据构建系统发生树的突出方法使用单核苷酸变体 (SNV) 作为标记,但不能充分考虑到拷贝数异常 (CNA),CNA 可与 SNV 重叠并导致 SNV 丢失。在这里,我们引入了 SCARLET,这是一种从单细胞 DNA 测序数据推断肿瘤系统发生树的算法,同时考虑了 CNA 驱动的 SNV 丢失和测序错误。SCARLET 在模拟数据上的表现优于现有方法,更准确地推断了突变获得的顺序和单个细胞中存在的突变。使用来自结直肠癌患者的单细胞数据集,SCARLET 构建了与观察到的 CNA 一致的肿瘤系统发生树,并提示了患者转移的另一种起源。SCARLET 可在:github.com/raphael-group/scarlet。