Klarman Cell Observatory, Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA.
Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA.
Genome Res. 2018 Dec;28(12):1901-1918. doi: 10.1101/gr.238543.118. Epub 2018 Nov 20.
Mutation data reveal the dynamic equilibrium between DNA damage and repair processes in cells and are indispensable to the understanding of age-related diseases, tumor evolution, and the acquisition of drug resistance. However, available genome-wide methods have a limited ability to resolve rare somatic variants and the relationships between these variants. Here, we present lineage sequencing, a new genome sequencing approach that enables somatic event reconstruction by providing quality somatic mutation call sets with resolution as high as the single-cell level in subject lineages. Lineage sequencing entails sampling single cells from a population and sequencing subclonal sample sets derived from these cells such that knowledge of relationships among the cells can be used to jointly call variants across the sample set. This approach integrates data from multiple sequence libraries to support each variant and precisely assigns mutations to lineage segments. We applied lineage sequencing to a human colon cancer cell line with a DNA polymerase epsilon () proofreading deficiency (HT115) and a human retinal epithelial cell line immortalized by constitutive telomerase expression (RPE1). Cells were cultured under continuous observation to link observed single-cell phenotypes with single-cell mutation data. The high sensitivity, specificity, and resolution of the data provide a unique opportunity for quantitative analysis of variation in mutation rate, spectrum, and correlations among variants. Our data show that mutations arrive with nonuniform probability across sublineages and that DNA lesion dynamics may cause strong correlations between certain mutations.
突变数据揭示了细胞内 DNA 损伤和修复过程的动态平衡,对于理解与年龄相关的疾病、肿瘤进化以及获得耐药性至关重要。然而,现有的全基因组方法在解析罕见的体细胞变异和这些变异之间的关系方面能力有限。在这里,我们提出了谱系测序,这是一种新的基因组测序方法,通过提供质量高的体细胞突变调用集,分辨率高达单细胞水平,从而能够重建体细胞事件。谱系测序需要从群体中采样单个细胞,并对来自这些细胞的亚克隆样本集进行测序,以便可以利用细胞之间的关系来共同调用样本集中的变体。该方法整合了来自多个序列文库的数据来支持每个变体,并精确地将突变分配到谱系片段中。我们将谱系测序应用于一个具有 DNA 聚合酶 ε()校对缺陷的人类结肠癌细胞系(HT115)和一个由组成型端粒酶表达永生化的人视网膜上皮细胞系(RPE1)。对细胞进行连续观察,将观察到的单细胞表型与单细胞突变数据联系起来。数据的高灵敏度、特异性和分辨率为定量分析突变率、突变谱以及变体之间的相关性提供了独特的机会。我们的数据表明,突变以非均匀的概率出现在亚谱系中,并且 DNA 损伤动态可能导致某些突变之间存在强烈的相关性。