National Institute of Plant Genome Research in New Delhi, India.
National Institute of Plant Genome Research in New Delhi.
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa322.
Plant transcriptome encompasses numerous endogenous, regulatory non-coding RNAs (ncRNAs) that play a major biological role in regulating key physiological mechanisms. While studies have shown that ncRNAs are extremely diverse and ubiquitous, the functions of the vast majority of ncRNAs are still unknown. With ever-increasing ncRNAs under study, it is essential to identify, categorize and annotate these ncRNAs on a genome-wide scale. The use of high-throughput RNA sequencing (RNA-seq) technologies provides a broader picture of the non-coding component of transcriptome, enabling the comprehensive identification and annotation of all major ncRNAs across samples. However, the detection of known and emerging class of ncRNAs from RNA-seq data demands complex computational methods owing to their unique as well as similar characteristics. Here, we discuss major plant endogenous, regulatory ncRNAs in an RNA sample followed by computational strategies applied to discover each class of ncRNAs using RNA-seq. We also provide a collection of relevant software packages and databases to present a comprehensive bioinformatics toolbox for plant ncRNA researchers. We assume that the discussions in this review will provide a rationale for the discovery of all major categories of plant ncRNAs.
植物转录组包含大量内源性、调节性非编码 RNA(ncRNA),它们在调节关键生理机制方面发挥着重要的生物学作用。虽然研究表明 ncRNA 极其多样化且普遍存在,但绝大多数 ncRNA 的功能仍不清楚。随着越来越多的 ncRNA 受到研究,在全基因组范围内识别、分类和注释这些 ncRNA 至关重要。使用高通量 RNA 测序(RNA-seq)技术可以更全面地了解转录组的非编码成分,从而全面识别和注释所有样本中的主要 ncRNA。然而,由于其独特和相似的特征,从 RNA-seq 数据中检测已知和新兴的 ncRNA 类别需要复杂的计算方法。在这里,我们讨论了 RNA 样本中主要的植物内源性、调节性 ncRNA,并讨论了应用于使用 RNA-seq 发现每一类 ncRNA 的计算策略。我们还提供了相关软件包和数据库的集合,为植物 ncRNA 研究人员提供了一个全面的生物信息学工具包。我们假设,本综述中的讨论将为发现所有主要类别的植物 ncRNA 提供依据。