Nielsen Henrik
Department of Cellular and Molecular Medicine, The Panum Institute, University of Copenhagen, Copenhagen, Denmark.
Methods Mol Biol. 2011;703:3-14. doi: 10.1007/978-1-59745-248-9_1.
The application of new and less biased methods to study the transcriptional output from genomes, such as tiling arrays and deep sequencing, has revealed that most of the genome is transcribed and that there is substantial overlap of transcripts derived from the two strands of DNA. In protein coding regions, the map of transcripts is very complex due to small transcripts from the flanking ends of the transcription unit, the use of multiple start and stop sites for the main transcript, production of multiple functional RNA molecules from the same primary transcript, and RNA molecules made by independent transcription from within the unit. In genomic regions separating those that encode proteins or highly abundant RNA molecules with known function, transcripts are generally of low abundance and short-lived. In most of these cases, it is unclear to what extent a function is related to transcription per se or to the RNA products.
应用新的且偏差较小的方法来研究基因组的转录输出,如平铺阵列和深度测序,已揭示出基因组的大部分区域都被转录,并且来自DNA两条链的转录本存在大量重叠。在蛋白质编码区域,转录本图谱非常复杂,这是由于转录单元侧翼末端的小转录本、主要转录本使用多个起始和终止位点、同一初级转录本产生多个功能性RNA分子,以及转录单元内部独立转录产生的RNA分子。在分隔那些编码蛋白质或具有已知功能的高丰度RNA分子的基因组区域中,转录本通常丰度较低且寿命较短。在大多数这些情况下,尚不清楚功能与转录本身或RNA产物的相关程度。