Center for Bioinformatics and Computational Biology, Delaware Biotechnology Institute, University of Delaware, Newark, DE 19716, USA.
Donald Danforth Plant Science Center, Saint Louis, MO 63132, USA.
Plant Physiol. 2021 Apr 23;185(4):1764-1782. doi: 10.1093/plphys/kiab001.
In monocots other than maize (Zea mays) and rice (Oryza sativa), the repertoire and diversity of microRNAs (miRNAs) and the populations of phased, secondary, small interfering RNAs (phasiRNAs) are poorly characterized. To remedy this, we sequenced small RNAs (sRNA) from vegetative and dissected inflorescence tissue in 28 phylogenetically diverse monocots and from several early-diverging angiosperm lineages, as well as publicly available data from 10 additional monocot species. We annotated miRNAs, small interfering RNAs (siRNAs) and phasiRNAs across the monocot phylogeny, identifying miRNAs apparently lost or gained in the grasses relative to other monocot families, as well as a number of transfer RNA fragments misannotated as miRNAs. Using our miRNA database cleaned of these misannotations, we identified conservation at the 8th, 9th, 19th, and 3'-end positions that we hypothesize are signatures of selection for processing, targeting, or Argonaute sorting. We show that 21-nucleotide (nt) reproductive phasiRNAs are far more numerous in grass genomes than other monocots. Based on sequenced monocot genomes and transcriptomes, DICER-LIKE5, important to 24-nt phasiRNA biogenesis, likely originated via gene duplication before the diversification of the grasses. This curated database of phylogenetically diverse monocot miRNAs, siRNAs, and phasiRNAs represents a large collection of data that should facilitate continued exploration of sRNA diversification in flowering plants.
除了玉米(Zea mays)和水稻(Oryza sativa)之外,单子叶植物中的 microRNAs(miRNAs)和相错、次级、小干扰 RNA(phasiRNAs)的 repertoire 和多样性特征描述甚少。为了弥补这一不足,我们对 28 种系统发育多样化的单子叶植物和几个早期分化的被子植物谱系的营养和解剖花序组织中的小 RNA(sRNA)进行了测序,还对来自另外 10 种单子叶植物的公共可用数据进行了测序。我们在单子叶植物系统发育范围内注释了 miRNAs、小干扰 RNA(siRNAs)和 phasiRNAs,鉴定了相对于其他单子叶植物科,在禾本科中显然丢失或获得的 miRNAs,以及一些被错误注释为 miRNAs 的 tRNA 片段。我们使用经过这些错误注释清理后的 miRNA 数据库,鉴定了 8 位、9 位、19 位和 3'-末端位置的保守性,我们假设这些位置是加工、靶向或 Argonaute 分选的选择特征。我们表明,21 个核苷酸(nt)的生殖性 phasiRNAs 在禾本科基因组中的数量远远多于其他单子叶植物。基于测序的单子叶植物基因组和转录组,对 24-nt phasiRNA 生物发生至关重要的 DICER-LIKE5 很可能在禾本科多样化之前通过基因复制起源。这个经过系统发育多样化的单子叶植物 miRNAs、siRNAs 和 phasiRNAs 的精心整理的数据库代表了大量数据,应该有助于进一步探索开花植物中 sRNA 的多样化。