Savva Yiannis A, Laurent Georges St, Reenan Robert A
Department of Molecular Biology, Cell Biology and Biochemistry, Brown University, Providence, RI, 02912, USA.
Methods Mol Biol. 2016;1358:255-68. doi: 10.1007/978-1-4939-3067-8_15.
Adenosine (A)-to-inosine (I) RNA editing is a fundamental posttranscriptional modification that ensures the deamination of A-to-I in double-stranded (ds) RNA molecules. Intriguingly, the A-to-I RNA editing system is particularly active in the nervous system of higher eukaryotes, altering a plethora of noncoding and coding sequences. Abnormal RNA editing is highly associated with many neurological phenotypes and neurodevelopmental disorders. However, the molecular mechanisms underlying RNA editing-mediated pathogenesis still remain enigmatic and have attracted increasing attention from researchers. Over the last decade, methods available to perform genome-wide transcriptome analysis, have evolved rapidly. Within the RNA editing field researchers have adopted next-generation sequencing technologies to identify RNA-editing sites within genomes and to elucidate the underlying process. However, technical challenges associated with editing site discovery have hindered efforts to uncover comprehensive editing site datasets, resulting in the general perception that the collections of annotated editing sites represent only a small minority of the total number of sites in a given organism, tissue, or cell type of interest. Additionally to doubts about sensitivity, existing RNA-editing site lists often contain high percentages of false positives, leading to uncertainty about their validity and usefulness in downstream studies. An accurate investigation of A-to-I editing requires properly validated datasets of editing sites with demonstrated and transparent levels of sensitivity and specificity. Here, we describe a high signal-to-noise method for RNA-editing site detection using single-molecule sequencing (SMS). With this method, authentic RNA-editing sites may be differentiated from artifacts. Machine learning approaches provide a procedure to improve upon and experimentally validate sequencing outcomes through use of computationally predicted, iterative feedback loops. Subsequent use of extensive Sanger sequencing validations can generate accurate editing site lists. This approach has broad application and accurate genome-wide editing analysis of various tissues from clinical specimens or various experimental organisms is now a possibility.
腺苷(A)到肌苷(I)的RNA编辑是一种基本的转录后修饰,可确保双链(ds)RNA分子中的A脱氨基变成I。有趣的是,A到I的RNA编辑系统在高等真核生物的神经系统中特别活跃,可改变大量非编码和编码序列。异常的RNA编辑与许多神经表型和神经发育障碍高度相关。然而,RNA编辑介导的发病机制的分子机制仍然不明,并且已经引起了研究人员越来越多的关注。在过去十年中,可用于进行全基因组转录组分析的方法迅速发展。在RNA编辑领域,研究人员采用了下一代测序技术来识别基因组中的RNA编辑位点并阐明其潜在过程。然而,与编辑位点发现相关的技术挑战阻碍了揭示全面编辑位点数据集的努力,导致人们普遍认为注释的编辑位点集合仅占给定生物体、组织或感兴趣细胞类型中位点总数的一小部分。除了对灵敏度的怀疑之外,现有的RNA编辑位点列表通常包含高比例的假阳性,导致其在下游研究中的有效性和实用性存在不确定性。对A到I编辑的准确研究需要经过适当验证的编辑位点数据集,其灵敏度和特异性水平已得到证明且透明。在这里,我们描述了一种使用单分子测序(SMS)检测RNA编辑位点的高信噪比方法。通过这种方法,可以将真实的RNA编辑位点与伪影区分开来。机器学习方法提供了一种通过使用计算预测的迭代反馈回路来改进和实验验证测序结果的程序。随后使用广泛的桑格测序验证可以生成准确的编辑位点列表。这种方法具有广泛的应用,现在有可能对临床标本或各种实验生物体的各种组织进行准确的全基因组编辑分析。