Dipartimento di Scienze Agroalimentari, Ambientali e Animali (DI4A), Udine, I-33100, Italy.
Istituto di Genomica Applicata, Udine, I-33100, Italy.
Plant J. 2021 Sep;107(6):1631-1647. doi: 10.1111/tpj.15404. Epub 2021 Aug 6.
Vitis vinifera is an economically important crop and a useful model in which to study chromatin dynamics. In contrast to the small and relatively simple genome of Arabidopsis thaliana, grapevine contains a complex genome of 487 Mb that exhibits extensive colonization by transposable elements. We used Hi-C, ChIP-seq and ATAC-seq to measure how chromatin features correlate to the expression of 31 845 grapevine genes. ATAC-seq revealed the presence of more than 16 000 open chromatin regions, of which we characterize nearly 5000 as possible distal enhancer candidates that occur in intergenic space > 2 kb from the nearest transcription start site (TSS). A motif search identified more than 480 transcription factor (TF) binding sites in these regions, with those for TCP family proteins in greatest abundance. These open chromatin regions are typically within 15 kb from their nearest promoter, and a gene ontology analysis indicated that their nearest genes are significantly enriched for TF activity. The presence of a candidate cis-regulatory element (cCRE) > 2 kb upstream of the TSS, location in the active nuclear compartment as determined by Hi-C, and the enrichment of H3K4me3, H3K4me1 and H3K27ac at the gene are correlated with gene expression. Taken together, these results suggest that regions of intergenic open chromatin identified by ATAC-seq can be considered potential candidates for cis-regulatory regions in V. vinifera. Our findings enhance the characterization of a valuable agricultural crop, and help to clarify the understanding of unique plant biology.
葡萄是一种经济上重要的作物,也是研究染色质动态的有用模式生物。与拟南芥这种基因组小且相对简单的物种相比,葡萄拥有 487Mb 的复杂基因组,其中广泛存在转座元件的定殖。我们使用 Hi-C、ChIP-seq 和 ATAC-seq 来测量 31845 个葡萄基因的表达与染色质特征的相关性。ATAC-seq 揭示了超过 16000 个开放染色质区域的存在,其中我们将近 5000 个特征化为可能的远端增强子候选者,这些候选者位于距离最近的转录起始位点(TSS)>2kb 的基因间区。 motif 搜索在这些区域中发现了超过 480 个转录因子(TF)结合位点,其中 TCP 家族蛋白的数量最多。这些开放染色质区域通常距离其最近的启动子 15kb 以内,基因本体论分析表明,其最近的基因显著富集了 TF 活性。在 TSS 上游>2kb 的位置存在候选顺式调控元件(cCRE)、在 Hi-C 中确定的活跃核区的位置,以及 H3K4me3、H3K4me1 和 H3K27ac 在基因上的富集都与基因表达相关。总的来说,这些结果表明,ATAC-seq 鉴定的基因间开放染色质区域可以被认为是 V.vinifera 中顺式调控区域的潜在候选者。我们的研究结果增强了对有价值的农业作物的特征描述,并有助于阐明对独特植物生物学的理解。