Liu Hongbo, Liu Xiaojuan, Zhang Shumei, Lv Jie, Li Song, Shang Shipeng, Jia Shanshan, Wei Yanjun, Wang Fang, Su Jianzhong, Wu Qiong, Zhang Yan
College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China
Department of Rehabilitation, the First Affiliated Hospital of Harbin Medical University, Harbin 150001, China.
Nucleic Acids Res. 2016 Jan 8;44(1):75-94. doi: 10.1093/nar/gkv1332. Epub 2015 Dec 3.
DNA methylation is a key epigenetic mark that is critical for gene regulation in multicellular eukaryotes. Although various human cell types may have the same genome, these cells have different methylomes. The systematic identification and characterization of methylation marks across cell types are crucial to understand the complex regulatory network for cell fate determination. In this study, we proposed an entropy-based framework termed SMART to integrate the whole genome bisulfite sequencing methylomes across 42 human tissues/cells and identified 757 887 genome segments. Nearly 75% of the segments showed uniform methylation across all cell types. From the remaining 25% of the segments, we identified cell type-specific hypo/hypermethylation marks that were specifically hypo/hypermethylated in a minority of cell types using a statistical approach and presented an atlas of the human methylation marks. Further analysis revealed that the cell type-specific hypomethylation marks were enriched through H3K27ac and transcription factor binding sites in cell type-specific manner. In particular, we observed that the cell type-specific hypomethylation marks are associated with the cell type-specific super-enhancers that drive the expression of cell identity genes. This framework provides a complementary, functional annotation of the human genome and helps to elucidate the critical features and functions of cell type-specific hypomethylation.
DNA甲基化是一种关键的表观遗传标记,对多细胞真核生物中的基因调控至关重要。尽管各种人类细胞类型可能具有相同的基因组,但这些细胞具有不同的甲基化组。跨细胞类型对甲基化标记进行系统的识别和表征对于理解细胞命运决定的复杂调控网络至关重要。在本研究中,我们提出了一种基于熵的框架,称为SMART,以整合42种人类组织/细胞的全基因组亚硫酸氢盐测序甲基化组,并识别出757887个基因组片段。近75%的片段在所有细胞类型中显示出均匀的甲基化。从其余25%的片段中,我们使用统计方法识别出在少数细胞类型中特异性低甲基化/高甲基化的细胞类型特异性低/高甲基化标记,并展示了人类甲基化标记图谱。进一步分析表明,细胞类型特异性低甲基化标记以细胞类型特异性方式通过H3K27ac和转录因子结合位点富集。特别是,我们观察到细胞类型特异性低甲基化标记与驱动细胞身份基因表达的细胞类型特异性超级增强子相关。该框架为人类基因组提供了一种互补的功能注释,并有助于阐明细胞类型特异性低甲基化的关键特征和功能。