New England Biolabs Inc., 240 County Road, Ipswich, MA 01938-2723, USA.
Science. 2017 Feb 17;355(6326):752-756. doi: 10.1126/science.aai8690.
Mutations in somatic cells generate a heterogeneous genomic population and may result in serious medical conditions. Although cancer is typically associated with somatic variations, advances in DNA sequencing indicate that cell-specific variants affect a number of phenotypes and pathologies. Here, we show that mutagenic damage accounts for the majority of the erroneous identification of variants with low to moderate (1 to 5%) frequency. More important, we found signatures of damage in most sequencing data sets in widely used resources, including the 1000 Genomes Project and The Cancer Genome Atlas, establishing damage as a pervasive cause of sequencing errors. The extent of this damage directly confounds the determination of somatic variants in these data sets.
体细胞突变会产生异质的基因组群体,并可能导致严重的医疗状况。尽管癌症通常与体细胞变异有关,但 DNA 测序技术的进步表明,细胞特异性变异会影响许多表型和病理。在这里,我们表明,诱变损伤解释了大多数错误识别低至中度(1 至 5%)频率变体的原因。更重要的是,我们在广泛使用的资源中发现了大多数测序数据集都存在损伤特征,包括 1000 基因组计划和癌症基因组图谱,从而确定损伤是测序错误的普遍原因。这种损伤的程度直接混淆了这些数据集体细胞变异的确定。