Zhang Ning, Shi Shundi, Yoo Barney, Yuan Xiaohong, Li Wenjia, Zhang Shenglong
Department of Biological and Chemical Sciences, New York Institute of Technology; Department of Chemical Engineering, Columbia University.
Department of Chemical Engineering, Columbia University.
J Vis Exp. 2020 Jul 10(161). doi: 10.3791/61281.
Mass spectrometry (MS)-based sequencing approaches have been shown to be useful in direct sequencing RNA without the need for a complementary DNA (cDNA) intermediate. However, such approaches are rarely applied as a de novo RNA sequencing method, but used mainly as a tool that can assist in quality assurance for confirming known sequences of purified single-stranded RNA samples. Recently, we developed a direct RNA sequencing method by integrating a 2-dimensional mass-retention time hydrophobic end-labeling strategy into MS-based sequencing (2D-HELS MS Seq). This method is capable of accurately sequencing single RNA sequences as well as mixtures containing up to 12 distinct RNA sequences. In addition to the four canonical ribonucleotides (A, C, G, and U), the method has the capacity to sequence RNA oligonucleotides containing modified nucleotides. This is possible because the modified nucleobase either has an intrinsically unique mass that can help in its identification and its location in the RNA sequence, or can be converted into a product with a unique mass. In this study, we have used RNA, incorporating two representative modified nucleotides (pseudouridine (Ψ) and 5-methylcytosine (mC)), to illustrate the application of the method for the de novo sequencing of a single RNA oligonucleotide as well as a mixture of RNA oligonucleotides, each with a different sequence and/or modified nucleotides. The procedures and protocols described here to sequence these model RNAs will be applicable to other short RNA samples (<35 nt) when using a standard high-resolution LC-MS system, and can also be used for sequence verification of modified therapeutic RNA oligonucleotides. In the future, with the development of more robust algorithms and with better instruments, this method could allow sequencing of more complex biological samples.
基于质谱(MS)的测序方法已被证明可用于直接对RNA进行测序,而无需互补DNA(cDNA)中间体。然而,此类方法很少被用作从头RNA测序方法,主要用作一种有助于质量保证的工具,用于确认纯化的单链RNA样品的已知序列。最近,我们通过将二维质量保留时间疏水末端标记策略整合到基于MS的测序中(二维HELS MS Seq),开发了一种直接RNA测序方法。该方法能够准确地对单个RNA序列以及包含多达12种不同RNA序列的混合物进行测序。除了四种标准核糖核苷酸(A、C、G和U)外,该方法还能够对含有修饰核苷酸的RNA寡核苷酸进行测序。这是可能的,因为修饰的核碱基要么具有本质上独特的质量,有助于其识别及其在RNA序列中的位置,要么可以转化为具有独特质量的产物。在本研究中,我们使用了包含两种代表性修饰核苷酸(假尿苷(Ψ)和5-甲基胞嘧啶(mC))的RNA,来说明该方法在单个RNA寡核苷酸以及RNA寡核苷酸混合物的从头测序中的应用,每个混合物具有不同的序列和/或修饰核苷酸。此处描述的对这些模型RNA进行测序的程序和方案,在使用标准高分辨率LC-MS系统时将适用于其他短RNA样品(<35 nt),也可用于修饰的治疗性RNA寡核苷酸的序列验证。未来,随着更强大算法的开发和更好仪器的出现,该方法可能允许对更复杂的生物样品进行测序。