Institute of Zoology, Chinese Academy of Sciences, Beichen West Road, 100101, Beijing, Country.
Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Zhongguancun East Road, 100190, Beijing, China.
Brief Bioinform. 2022 May 13;23(3). doi: 10.1093/bib/bbac092.
The rapid development of single-cell DNA sequencing (scDNA-seq) technology has greatly enhanced the resolution of tumor cell profiling, providing an unprecedented perspective in characterizing intra-tumoral heterogeneity and understanding tumor progression and metastasis. However, prominent algorithms for constructing tumor phylogeny based on scDNA-seq data usually only take single nucleotide variations (SNVs) as markers, failing to consider the effect caused by copy number alterations (CNAs). Here, we propose BiTSC$^2$, Bayesian inference of Tumor clonal Tree by joint analysis of Single-Cell SNV and CNA data. BiTSC$^2$ takes raw reads from scDNA-seq as input, accounts for the overlapping of CNA and SNV, models allelic dropout rate, sequencing errors and missing rate, as well as assigns single cells into subclones. By applying Markov Chain Monte Carlo sampling, BiTSC$^2$ can simultaneously estimate the subclonal scCNA and scSNV genotype matrices, subclonal assignments and tumor subclonal evolutionary tree. In comparison with existing methods on synthetic and real tumor data, BiTSC$^2$ shows high accuracy in genotype recovery, subclonal assignment and tree reconstruction. BiTSC$^2$ also performs robustly in dealing with scDNA-seq data with low sequencing depth and variant missing rate. BiTSC$^2$ software is available at https://github.com/ucasdp/BiTSC2.
单细胞 DNA 测序(scDNA-seq)技术的快速发展极大地提高了肿瘤细胞剖析的分辨率,为刻画肿瘤内异质性和理解肿瘤进展和转移提供了前所未有的视角。然而,基于 scDNA-seq 数据构建肿瘤系统发生树的主要算法通常仅将单核苷酸变异(SNVs)作为标记,而忽略了拷贝数改变(CNAs)所造成的影响。在这里,我们提出了 BiTSC$^2$,即通过单细胞 SNV 和 CNA 数据的联合分析对肿瘤克隆树进行贝叶斯推断的方法。BiTSC$^2$ 以 scDNA-seq 的原始读数作为输入,考虑了 CNA 和 SNV 的重叠,对等位基因缺失率、测序错误率和缺失率进行建模,并将单细胞分配到亚克隆中。通过应用马尔可夫链蒙特卡罗采样,BiTSC$^2$ 可以同时估计亚克隆 scCNA 和 scSNV 基因型矩阵、亚克隆分配和肿瘤亚克隆进化树。与在合成和真实肿瘤数据上的现有方法相比,BiTSC$^2$ 在基因型恢复、亚克隆分配和树重建方面表现出了较高的准确性。BiTSC$^2$ 在处理低测序深度和变异缺失率的 scDNA-seq 数据时也表现稳健。BiTSC$^2$ 软件可在 https://github.com/ucasdp/BiTSC2 上获取。