Bioinformatics and Genomics Graduate Program, Penn State University, University Park, Pennsylvania 16802, USA.
Department of Statistics, Penn State University, University Park, Pennsylvania 16802, USA.
Genome Res. 2018 Dec;28(12):1767-1778. doi: 10.1101/gr.241257.118. Epub 2018 Nov 6.
DNA conformation may deviate from the classical B-form in ∼13% of the human genome. Non-B DNA regulates many cellular processes; however, its effects on DNA polymerization speed and accuracy have not been investigated genome-wide. Such an inquiry is critical for understanding neurological diseases and cancer genome instability. Here, we present the first simultaneous examination of DNA polymerization kinetics and errors in the human genome sequenced with Single-Molecule Real-Time (SMRT) technology. We show that polymerization speed differs between non-B and B-DNA: It decelerates at G-quadruplexes and fluctuates periodically at disease-causing tandem repeats. Analyzing polymerization kinetics profiles, we predict and validate experimentally non-B DNA formation for a novel motif. We demonstrate that several non-B motifs affect sequencing errors (e.g., G-quadruplexes increase error rates), and that sequencing errors are positively associated with polymerase slowdown. Finally, we show that highly divergent G4 motifs have pronounced polymerization slowdown and high sequencing error rates, suggesting similar mechanisms for sequencing errors and germline mutations.
DNA 构象在人类基因组中约有 13%偏离经典的 B 构象。非 B DNA 调节许多细胞过程;然而,其对 DNA 聚合速度和准确性的影响尚未在全基因组范围内进行研究。这种探究对于理解神经疾病和癌症基因组不稳定性至关重要。在这里,我们首次同时检查了使用单分子实时 (SMRT) 技术测序的人类基因组中的 DNA 聚合动力学和错误。我们表明,聚合速度在非 B-DNA 和 B-DNA 之间存在差异:它在 G-四联体处减速,并在致病串联重复处周期性波动。通过分析聚合动力学谱,我们预测并实验验证了一个新基序的非 B-DNA 形成。我们证明了几个非 B 基序会影响测序错误(例如,G-四联体会增加错误率),并且测序错误与聚合酶减速呈正相关。最后,我们表明高度变异的 G4 基序具有明显的聚合减速和高测序错误率,这表明测序错误和种系突变具有相似的机制。