Department of Cell and Developmental Biology, Vanderbilt University, Nashville, Tennessee 37232, USA.
Genome Res. 2011 Feb;21(2):325-41. doi: 10.1101/gr.114595.110. Epub 2010 Dec 22.
The C. elegans genome has been completely sequenced, and the developmental anatomy of this model organism is described at single-cell resolution. Here we utilize strategies that exploit this precisely defined architecture to link gene expression to cell type. We obtained RNAs from specific cells and from each developmental stage using tissue-specific promoters to mark cells for isolation by FACS or for mRNA extraction by the mRNA-tagging method. We then generated gene expression profiles of more than 30 different cells and developmental stages using tiling arrays. Machine-learning-based analysis detected transcripts corresponding to established gene models and revealed novel transcriptionally active regions (TARs) in noncoding domains that comprise at least 10% of the total C. elegans genome. Our results show that about 75% of transcripts with detectable expression are differentially expressed among developmental stages and across cell types. Examination of known tissue- and cell-specific transcripts validates these data sets and suggests that newly identified TARs may exercise cell-specific functions. Additionally, we used self-organizing maps to define groups of coregulated transcripts and applied regulatory element analysis to identify known transcription factor- and miRNA-binding sites, as well as novel motifs that likely function to control subsets of these genes. By using cell-specific, whole-genome profiling strategies, we have detected a large number of novel transcripts and produced high-resolution gene expression maps that provide a basis for establishing the roles of individual genes in cellular differentiation.
秀丽隐杆线虫的基因组已经被完全测序,并且该模型生物的发育解剖学已经在单细胞分辨率下进行了描述。在这里,我们利用利用这种精确定义的结构的策略将基因表达与细胞类型联系起来。我们使用组织特异性启动子从特定细胞和每个发育阶段获得 RNA,以通过 FACS 标记细胞进行分离,或通过 mRNA 标记方法提取 mRNA。然后,我们使用 tiling arrays 生成了 30 多种不同细胞和发育阶段的基因表达谱。基于机器学习的分析检测到与已建立的基因模型相对应的转录本,并在非编码区域中揭示了新的转录活性区域 (TARs),这些区域至少占秀丽隐杆线虫基因组的 10%。我们的结果表明,在发育阶段和细胞类型之间,大约 75%的可检测表达转录本存在差异表达。对已知的组织和细胞特异性转录本的检查验证了这些数据集,并表明新鉴定的 TARs 可能发挥细胞特异性功能。此外,我们使用自组织映射来定义共调控转录本组,并应用调控元件分析来识别已知的转录因子和 miRNA 结合位点,以及可能用于控制这些基因子集的新基序。通过使用细胞特异性、全基因组分析策略,我们已经检测到大量新的转录本,并生成了高分辨率的基因表达图谱,为确定单个基因在细胞分化中的作用提供了基础。