Wilhelm Brian T, Marguerat Samuel, Watt Stephen, Schubert Falk, Wood Valerie, Goodhead Ian, Penkett Christopher J, Rogers Jane, Bähler Jürg
Cancer Research UK Fission Yeast Functional Genomics Group, Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1HH, UK.
Nature. 2008 Jun 26;453(7199):1239-43. doi: 10.1038/nature07002. Epub 2008 May 18.
Recent data from several organisms indicate that the transcribed portions of genomes are larger and more complex than expected, and that many functional properties of transcripts are based not on coding sequences but on regulatory sequences in untranslated regions or non-coding RNAs. Alternative start and polyadenylation sites and regulation of intron splicing add additional dimensions to the rich transcriptional output. This transcriptional complexity has been sampled mainly using hybridization-based methods under one or few experimental conditions. Here we applied direct high-throughput sequencing of complementary DNAs (RNA-Seq), supplemented with data from high-density tiling arrays, to globally sample transcripts of the fission yeast Schizosaccharomyces pombe, independently from available gene annotations. We interrogated transcriptomes under multiple conditions, including rapid proliferation, meiotic differentiation and environmental stress, as well as in RNA processing mutants to reveal the dynamic plasticity of the transcriptional landscape as a function of environmental, developmental and genetic factors. High-throughput sequencing proved to be a powerful and quantitative method to sample transcriptomes deeply at maximal resolution. In contrast to hybridization, sequencing showed little, if any, background noise and was sensitive enough to detect widespread transcription in >90% of the genome, including traces of RNAs that were not robustly transcribed or rapidly degraded. The combined sequencing and strand-specific array data provide rich condition-specific information on novel, mostly non-coding transcripts, untranslated regions and gene structures, thus improving the existing genome annotation. Sequence reads spanning exon-exon or exon-intron junctions give unique insight into a surprising variability in splicing efficiency across introns, genes and conditions. Splicing efficiency was largely coordinated with transcript levels, and increased transcription led to increased splicing in test genes. Hundreds of introns showed such regulated splicing during cellular proliferation or differentiation.
来自多种生物的最新数据表明,基因组的转录部分比预期的更大且更复杂,而且转录本的许多功能特性并非基于编码序列,而是基于非翻译区或非编码RNA中的调控序列。可变的起始和聚腺苷酸化位点以及内含子剪接的调控为丰富的转录输出增添了更多维度。这种转录复杂性主要是在一种或几种实验条件下使用基于杂交的方法进行采样的。在这里,我们应用互补DNA的直接高通量测序(RNA测序),并辅以高密度平铺阵列的数据,以独立于现有基因注释的方式全面采样裂殖酵母粟酒裂殖酵母的转录本。我们在多种条件下研究转录组,包括快速增殖、减数分裂分化和环境应激,以及在RNA加工突变体中,以揭示转录图谱作为环境、发育和遗传因素的函数的动态可塑性。高通量测序被证明是一种强大的定量方法,能够以最大分辨率深度采样转录组。与杂交不同,测序几乎没有背景噪声,并且灵敏度足以检测基因组中90%以上的广泛转录,包括未被稳定转录或快速降解的RNA痕迹。测序和链特异性阵列数据相结合,提供了关于新型(大多为非编码)转录本、非翻译区和基因结构的丰富的条件特异性信息,从而改进了现有的基因组注释。跨越外显子-外显子或外显子-内含子连接的序列读数为跨内含子、基因和条件的剪接效率的惊人变异性提供了独特的见解。剪接效率在很大程度上与转录本水平相协调,并且转录增加导致测试基因中的剪接增加。数百个内含子在细胞增殖或分化过程中表现出这种受调控的剪接。