Gregor Mendel Institute, Austrian Academy of Sciences, Vienna Biocenter, Dr. Bohr-gasse 3, Vienna 1030, Austria.
Plant Cell. 2023 Dec 21;36(1):85-111. doi: 10.1093/plcell/koad233.
Long noncoding RNAs (lncRNAs) are understudied and underannotated in plants. In mammals, lncRNA loci are nearly as ubiquitous as protein-coding genes, and their expression is highly variable between individuals of the same species. Using Arabidopsis thaliana as a model, we aimed to elucidate the true scope of lncRNA transcription across plants from different regions and study its natural variation. We used transcriptome deep sequencing data sets spanning hundreds of natural accessions and several developmental stages to create a population-wide annotation of lncRNAs, revealing thousands of previously unannotated lncRNA loci. While lncRNA transcription is ubiquitous in the genome, most loci appear to be actively silenced and their expression is extremely variable between natural accessions. This high expression variability is largely caused by the high variability of repressive chromatin levels at lncRNA loci. High variability was particularly common for intergenic lncRNAs (lincRNAs), where pieces of transposable elements (TEs) present in 50% of these lincRNA loci are associated with increased silencing and variation, and such lncRNAs tend to be targeted by the TE silencing machinery. We created a population-wide lncRNA annotation in Arabidopsis and improve our understanding of plant lncRNA genome biology, raising fundamental questions about what causes transcription and silencing across the genome.
长非编码 RNA(lncRNA)在植物中研究较少且注释不足。在哺乳动物中,lncRNA 基因座几乎与蛋白质编码基因一样普遍存在,并且它们在同一物种的个体之间的表达具有高度可变性。我们使用拟南芥作为模型,旨在阐明不同地区植物中 lncRNA 转录的真实范围,并研究其自然变异。我们使用跨越数百个自然群体和几个发育阶段的转录组深度测序数据集,创建了一个广泛的 lncRNA 注释,揭示了数千个以前未注释的 lncRNA 基因座。虽然 lncRNA 转录在基因组中普遍存在,但大多数基因座似乎被主动沉默,并且它们在自然群体中的表达差异极大。这种高表达变异性主要是由 lncRNA 基因座上抑制性染色质水平的高度可变性引起的。高变异性在基因间 lncRNA(lincRNA)中尤为常见,这些 lincRNA 基因座中有 50%存在转座元件(TEs),这些 TEs 与增强的沉默和变异性相关,并且这些 lncRNAs 往往被 TE 沉默机制靶向。我们在拟南芥中创建了一个广泛的 lncRNA 注释,提高了我们对植物 lncRNA 基因组生物学的理解,提出了关于是什么导致整个基因组转录和沉默的基本问题。