Department of Biomedical Informatics, School of Basic Medical Sciences, Peking University Health Science Center, Beijing 100191, China.
Genomics. 2012 May;99(5):292-8. doi: 10.1016/j.ygeno.2012.02.003. Epub 2012 Feb 20.
Vertebrate genomes encode thousands of non-coding RNAs including short non-coding RNAs (such as microRNAs) and long non-coding RNAs (lncRNAs). Chicken (Gallus gallus) is an important model organism for developmental biology, and the recently assembled genome sequences for chicken will facilitate the understanding of the functional roles of non-coding RNA genes during development. The present study concerns the first systematic identification of lncRNAs using RNA-Seq to sample the transcriptome during chicken muscle development. A computational approach was used to identify 281 new intergenic lncRNAs in the chicken genome. Novel lncRNAs in general are less conserved than protein-coding genes and slightly more conserved than random non-coding sequences. The present study has provided an initial chicken lncRNA catalog and greatly increased the number of chicken ncRNAs in the non-protein coding RNA database. Furthermore, the computational pipeline presented in the current work will be useful for characterizing lncRNAs obtained from deep sequencing data.
脊椎动物基因组编码数千种非编码 RNA,包括短非编码 RNA(如 microRNA)和长非编码 RNA(lncRNA)。鸡(Gallus gallus)是发育生物学的重要模式生物,最近组装的鸡基因组序列将有助于理解非编码 RNA 基因在发育过程中的功能作用。本研究首次使用 RNA-Seq 系统地鉴定了 lncRNAs,以采样鸡肌肉发育过程中的转录组。使用计算方法在鸡基因组中鉴定了 281 个新的基因间 lncRNAs。一般来说,新的 lncRNAs 不如蛋白质编码基因保守,略高于随机非编码序列。本研究提供了一个初步的鸡 lncRNA 目录,并大大增加了非蛋白编码 RNA 数据库中的鸡 ncRNA 数量。此外,当前工作中提出的计算流程将有助于描述从深度测序数据中获得的 lncRNAs。