Hefei National Laboratory for Physical Sciences at Microscale, School of Life Sciences, University of Science and Technology of China, Hefei, China.
Department of Microbiology and Plant Pathology, University of California, Riverside, CA, USA.
Methods Mol Biol. 2022;2316:251-274. doi: 10.1007/978-1-0716-1464-8_22.
Viroids are single-stranded circular RNA molecules that cause diseases in plants and do not encode any protein. Classical approaches for the identification of new viroids are challenging for many plant pathology laboratories as viroid cDNA synthesis and sequencing require purification and enrichment of the naked viroid RNA by two-dimensional gel electrophoresis. Conventional metagenomic approaches are not effective for viroid discovery because the total number of known viroids is small, and distinct viroids share limited nucleotide sequence similarity. In this chapter, we describe a homology-independent approach for the identification of both known and new viroids in disease samples. It is known that viroid infection of plants triggers production of overlapping viroid-derived small interfering RNAs (siRNAs) targeting the entire genome with high densities and that replication of viroids occurs via a rolling-circle mechanism to yield head-to-tail multiple-repeat replicative intermediates. Our approach involves deep sequencing of either long or small RNAs in a disease sample followed by viroid identification with a unique computational algorithm, progressive filtering of overlapping small RNAs (PFOR). Among the sequenced total small RNAs, PFOR retains viroid-derived siRNAs for viroid genome assembly by progressively eliminating nonoverlapping small RNAs and those that overlap but cannot be assembled into a direct repeat RNA, a unique feature of viroid RNA replication. In contrast, long RNAs sequenced after depletion of ribosomal RNAs are cut into 21-nucleotide virtual overlapping small RNAs with the algorithm SLS (splitting longer read into shorter fragments) before PFOR. We show that new viroids or viroids from the two known families are readily identified and their full-length sequences recovered by PFOR from long or small RNAs sequenced directly from infected plants. We propose that our approach can be used for viroid discovery in both plants and potentially animals since PFOR identifies viroids by searching for circular RNAs or a unique replication intermediate of the viroid genome in a sequence homology-independent manner.
类病毒是单链环状 RNA 分子,它们会导致植物患病,且不编码任何蛋白质。对于许多植物病理学实验室来说,鉴定新类病毒的经典方法具有挑战性,因为类病毒 cDNA 的合成和测序需要通过二维凝胶电泳对裸露的类病毒 RNA 进行纯化和富集。传统的宏基因组方法对于类病毒的发现并不有效,因为已知类病毒的总数较少,而且不同的类病毒共享的核苷酸序列相似性有限。在本章中,我们描述了一种用于鉴定疾病样本中已知和新类病毒的不依赖同源性的方法。已知植物类病毒感染会触发针对整个基因组的重叠类病毒衍生小干扰 RNA(siRNA)的产生,并且类病毒的复制通过滚环机制进行,以产生头尾相连的多重复制中间体。我们的方法涉及对疾病样本中的长链或小 RNA 进行深度测序,然后使用独特的计算算法对类病毒进行鉴定,即重叠小 RNA 的渐进式过滤(PFOR)。在测序的总小 RNA 中,PFOR 通过渐进式消除非重叠小 RNA 和那些可以组装成直接重复 RNA 的但不能重叠的小 RNA,保留类病毒衍生的 siRNA 用于类病毒基因组组装,这是类病毒 RNA 复制的独特特征。相比之下,在用算法 SLS(将较长的 RNA 切成较短的片段)对核糖体 RNA 耗尽后的长 RNA 进行测序后,会将其切成 21 个核苷酸的虚拟重叠小 RNA,然后再进行 PFOR。我们表明,通过 PFOR 可以从直接从感染植物中测序的长或小 RNA 中,快速鉴定出新的类病毒或两种已知家族的类病毒,并获得它们的全长序列。我们提出,我们的方法可用于植物和潜在动物中的类病毒发现,因为 PFOR 通过以序列同源性独立的方式搜索圆形 RNA 或类病毒基因组的独特复制中间体来鉴定类病毒。