Traverse Charles C, Ochman Howard
Department of Integrative Biology, University of Texas, Austin, Texas, USA.
Department of Integrative Biology, University of Texas, Austin, Texas, USA
mBio. 2017 Aug 29;8(4):e01230-17. doi: 10.1128/mBio.01230-17.
Advances in sequencing technologies have enabled direct quantification of genome-wide errors that occur during RNA transcription. These errors occur at rates that are orders of magnitude higher than rates during DNA replication, but due to technical difficulties such measurements have been limited to single-base substitutions and have not yet quantified the scope of transcription insertions and deletions. Previous reporter gene assay findings suggested that transcription indels are produced exclusively by elongation complex slippage at homopolymeric runs, so we enumerated indels across the protein-coding transcriptomes of and , which differ widely in their genomic base compositions and incidence of repeat regions. As anticipated from prior assays, transcription insertions prevailed in homopolymeric runs of A and T; however, transcription deletions arose in much more complex sequences and were rarely associated with homopolymeric runs. By reconstructing the relocated positions of the elongation complex as inferred from the sequences inserted or deleted during transcription, we show that continuation of transcription after slippage hinges on the degree of nucleotide complementarity within the RNA:DNA hybrid at the new DNA template location. The high level of mistakes generated during transcription can result in the accumulation of malfunctioning and misfolded proteins which can alter global gene regulation and in the expenditure of energy to degrade these nonfunctional proteins. The transcriptome-wide occurrence of base substitutions has been elucidated in bacteria, but information on transcription insertions and deletions-errors that potentially have more dire effects on protein function-is limited to reporter gene constructs. Here, we capture the transcriptome-wide spectrum of insertions and deletions in and and show that they occur at rates approaching those of base substitutions. Knowledge of the full extent of sequences subject to transcription indels supports a new model of bacterial transcription slippage, one that relies on the number of complementary bases between the transcript and the DNA template to which it slipped.
测序技术的进步使得直接定量RNA转录过程中发生的全基因组错误成为可能。这些错误的发生率比DNA复制过程中的错误率高出几个数量级,但由于技术困难,此类测量仅限于单碱基替换,尚未对转录插入和缺失的范围进行定量。先前的报告基因检测结果表明,转录插入缺失仅由延伸复合物在同聚物序列处的滑动产生,因此我们在[具体物种1]和[具体物种2]的蛋白质编码转录组中枚举了插入缺失,这两个物种在基因组碱基组成和重复区域发生率上有很大差异。正如先前检测所预期的那样,转录插入在A和T的同聚物序列中占主导;然而,转录缺失出现在更为复杂的序列中,并且很少与同聚物序列相关。通过重建从转录过程中插入或缺失的序列推断出的延伸复合物的重新定位位置,我们表明,滑动后转录的继续取决于新DNA模板位置处RNA:DNA杂交体内核苷酸互补程度。转录过程中产生的高水平错误会导致功能失调和错误折叠的蛋白质积累,这可能会改变全局基因调控,并消耗能量来降解这些无功能的蛋白质。细菌中碱基替换在转录组范围内的发生情况已得到阐明,但关于转录插入和缺失(可能对蛋白质功能产生更严重影响的错误)的信息仅限于报告基因构建体。在这里,我们捕捉了[具体物种1]和[具体物种2]转录组范围内的插入和缺失谱,并表明它们的发生率接近碱基替换的发生率。了解受转录插入缺失影响的完整序列范围支持了一种新的细菌转录滑动模型,该模型依赖于转录本与其滑动到的DNA模板之间互补碱基的数量。