Oka Rurika, Zicola Johan, Weber Blaise, Anderson Sarah N, Hodgman Charlie, Gent Jonathan I, Wesselink Jan-Jaap, Springer Nathan M, Hoefsloot Huub C J, Turck Franziska, Stam Maike
Swammerdam Institute for Life Sciences, Universiteit van Amsterdam, Science Park 904, 1098 XH, Amsterdam, The Netherlands.
Department Plant Developmental Biology, Max Planck Institute for Plant Breeding Research, Carl-von-Linné-Weg 10, 50829, Köln, Germany.
Genome Biol. 2017 Jul 21;18(1):137. doi: 10.1186/s13059-017-1273-4.
While most cells in multicellular organisms carry the same genetic information, in each cell type only a subset of genes is being transcribed. Such differentiation in gene expression depends, for a large part, on the activation and repression of regulatory sequences, including transcriptional enhancers. Transcriptional enhancers can be located tens of kilobases from their target genes, but display characteristic chromatin and DNA features, allowing their identification by genome-wide profiling. Here we show that integration of chromatin characteristics can be applied to predict distal enhancer candidates in Zea mays, thereby providing a basis for a better understanding of gene regulation in this important crop plant.
To predict transcriptional enhancers in the crop plant maize (Zea mays L. ssp. mays), we integrated available genome-wide DNA methylation data with newly generated maps for chromatin accessibility and histone 3 lysine 9 acetylation (H3K9ac) enrichment in young seedling and husk tissue. Approximately 1500 intergenic regions, displaying low DNA methylation, high chromatin accessibility and H3K9ac enrichment, were classified as enhancer candidates. Based on their chromatin profiles, candidate sequences can be classified into four subcategories. Tissue-specificity of enhancer candidates is defined based on the tissues in which they are identified and putative target genes are assigned based on tissue-specific expression patterns of flanking genes.
Our method identifies three previously identified distal enhancers in maize, validating the new set of enhancer candidates and enlarging the toolbox for the functional characterization of gene regulation in the highly repetitive maize genome.
虽然多细胞生物中的大多数细胞都携带相同的遗传信息,但在每种细胞类型中只有一部分基因会被转录。这种基因表达的差异在很大程度上取决于调控序列(包括转录增强子)的激活和抑制。转录增强子可以位于距其靶基因数十千碱基处,但具有特征性的染色质和DNA特征,这使得它们能够通过全基因组分析来识别。在这里,我们表明染色质特征的整合可用于预测玉米中的远端增强子候选物,从而为更好地理解这种重要农作物中的基因调控提供基础。
为了预测农作物玉米(Zea mays L. ssp. mays)中的转录增强子,我们将可用的全基因组DNA甲基化数据与新生成的幼苗和果壳组织中染色质可及性和组蛋白3赖氨酸9乙酰化(H3K9ac)富集图谱整合在一起。大约1500个基因间区域,显示出低DNA甲基化、高染色质可及性和H3K9ac富集,被归类为增强子候选物。基于它们的染色质图谱,候选序列可分为四个亚类。增强子候选物的组织特异性是根据它们被鉴定的组织来定义的,推定靶基因是根据侧翼基因的组织特异性表达模式来指定的。
我们的方法在玉米中鉴定出三个先前已鉴定的远端增强子,验证了新的增强子候选物集,并扩大了用于高度重复的玉米基因组中基因调控功能表征的工具箱。