Kuksa Pavel P, Leung Yuk Yee, Vandivier Lee E, Anderson Zachary, Gregory Brian D, Wang Li-San
Department of Pathology and Laboratory Medicine, University of Pennsylvania, D102 Richards Medical Research Bldg., 3700 Hamilton Walk, Philadelphia, PA, 19104, USA.
Penn Institute for Biomedical Informatics, University of Pennsylvania, Philadelphia, PA, 19104, USA.
Methods Mol Biol. 2017;1562:211-229. doi: 10.1007/978-1-4939-6807-7_14.
RNA molecules are often altered post-transcriptionally by the covalent modification of their nucleotides. These modifications are known to modulate the structure, function, and activity of RNAs. When reverse transcribed into cDNA during RNA sequencing library preparation, atypical (modified) ribonucleotides that affect Watson-Crick base pairing will interfere with reverse transcriptase (RT), resulting in cDNA products with mis-incorporated bases or prematurely terminated RNA products. These interactions with RT can therefore be inferred from mismatch patterns in the sequencing reads, and are distinguishable from simple base-calling errors, single-nucleotide polymorphisms (SNPs), or RNA editing sites. Here, we describe a computational protocol for the in silico identification of modified ribonucleotides from RT-based RNA-seq read-out using the High-throughput Analysis of Modified Ribonucleotides (HAMR) software. HAMR can identify these modifications transcriptome-wide with single nucleotide resolution, and also differentiate between different types of modifications to predict modification identity. Researchers can use HAMR to identify and characterize RNA modifications using RNA-seq data from a variety of common RT-based sequencing protocols such as Poly(A), total RNA-seq, and small RNA-seq.
RNA分子通常在转录后通过其核苷酸的共价修饰而发生改变。已知这些修饰可调节RNA的结构、功能和活性。在RNA测序文库制备过程中,当逆转录成cDNA时,影响沃森-克里克碱基配对的非典型(修饰)核糖核苷酸会干扰逆转录酶(RT),导致cDNA产物出现碱基错配或RNA产物提前终止。因此,这些与RT的相互作用可以从测序读数中的错配模式推断出来,并且与简单的碱基识别错误、单核苷酸多态性(SNP)或RNA编辑位点区分开来。在这里,我们描述了一种计算方法,用于使用基于高通量分析修饰核糖核苷酸(HAMR)软件,从基于RT的RNA-seq读数中在计算机上识别修饰的核糖核苷酸。HAMR可以以单核苷酸分辨率在全转录组范围内识别这些修饰,还可以区分不同类型的修饰以预测修饰的同一性。研究人员可以使用HAMR,通过来自各种常见的基于RT的测序方案(如Poly(A)、全RNA-seq和小RNA-seq)的RNA-seq数据,来识别和表征RNA修饰。