Electrical and Information College of Jinan University, Zhuhai, China.
Department of Computer Science, Hong Kong Baptist University, Hong Kong SAR, China.
Sci Rep. 2017 Aug 29;7(1):9666. doi: 10.1038/s41598-017-09139-8.
Variations in DNA copy number carry important information on genome evolution and regulation of DNA replication in cancer cells. The rapid development of single-cell sequencing technology enables exploration of gene-expression heterogeneity among single cells, providing important information on cell evolution. Evolutionary relationships in accumulated sequence data can be visualized by adjacent positioning of similar cells so that similar copy-number profiles are shown by block patterns. However, single-cell DNA sequencing data usually have low amount of starting genome, which requires an extra step of amplification to accumulate sufficient samples, introducing noise and making regular pattern-finding challenging. In this paper, we will propose to tackle this issue of recovering the hidden blocks within single-cell DNA-sequencing data through continuous sample permutations such that similar samples are positioned adjacently. The permutation is guided by the total variational norm of the recovered copy number profiles, and is continued until the total variational norm is minimized when similar samples are stacked together to reveal block patterns. An efficient numerical scheme for finding this permutation is designed, tailored from the alternating direction method of multipliers. Application of this method to both simulated and real data demonstrates its ability to recover the hidden structures of single-cell DNA sequences.
DNA 拷贝数的变化携带了关于基因组进化和癌细胞中 DNA 复制调控的重要信息。单细胞测序技术的快速发展使得探索单细胞之间的基因表达异质性成为可能,为细胞进化提供了重要信息。相似细胞的相邻定位可以可视化累积序列数据中的进化关系,使得相似的拷贝数图谱通过块模式显示。然而,单细胞 DNA 测序数据通常起始基因组数量较少,需要额外的扩增步骤来积累足够的样本,这会引入噪声,使得常规的模式发现变得具有挑战性。在本文中,我们将提出通过连续的样本置换来解决这个问题,从而恢复单细胞 DNA 测序数据中的隐藏块,使得相似的样本被定位在相邻位置。置换是由恢复的拷贝数图谱的总变分范数引导的,并且一直持续到相似的样本堆积在一起以揭示块模式时总变分范数最小化。从交替方向乘子法设计了一种有效的寻找这种置换的数值方案。该方法在模拟和真实数据上的应用证明了它能够恢复单细胞 DNA 序列的隐藏结构。