Department of Chemistry, Purdue University, 560 Oval Dr., West Lafayette, IN, 47907, USA.
BMC Genomics. 2020 May 31;21(1):378. doi: 10.1186/s12864-020-6688-8.
Genomic imprinting is a process thereby a subset of genes is expressed in a parent-of-origin specific manner. This evolutionary novelty is restricted to mammals and controlled by genomic DNA segments known as Imprinting Control Regions (ICRs) and germline Differentially Methylated Regions (gDMRs). Previously, I showed that in the mouse genome, the fully characterized ICRs/gDMRs often includes clusters of 2 or more of a set of composite-DNA-elements known as ZFBS-morph overlaps.
Because of the importance of the ICRs to regulating parent-of-origin specific gene expression, I developed a genome-wide strategy for predicting their positions in the human genome. My strategy consists of creating plots to display the density of ZFBS-morph overlaps along the entire chromosomal DNA sequences. In initial evaluations, I found that peaks in these plots pinpointed several of the known ICRs/gDMRs along the DNA in chromosomal bands. I deduced that in density-plots, robust peaks corresponded to actual or candidate ICRs in the DNA. By locating the genes in the vicinity of candidate ICRs, I could discover potential imprinting genes. Additionally, my assessments revealed a connection between several of the potential imprinted genes and human developmental anomalies. Examples include Leber congenital amaurosis 11, Coffin-Siris syndrome, progressive myoclonic epilepsy-10, microcephalic osteodysplastic primordial dwarfism type II, and microphthalmia, cleft lip and palate, and agenesis of the corpus callosum.
With plots displaying the density of ZFBS-morph overlaps, researchers could locate candidate ICRs and imprinted genes. Since the datafiles are available for download and display at the UCSC genome browser, it is possible to examine the plots in the context of Single nucleotide polymorphisms (SNPs) to design experiments to discover novel ICRs and imprinted genes in the human genome.
基因组印迹是一种过程,即一组基因以亲本来源特异性的方式表达。这种进化新事物仅限于哺乳动物,并受称为印迹控制区(ICR)和生殖系差异甲基化区(gDMR)的基因组 DNA 片段控制。以前,我表明在小鼠基因组中,完全表征的 ICR/gDMR 通常包括 2 个或更多一组称为 ZFBS-morph 重叠的复合 DNA 元件的簇。
由于 ICR 对调节亲本来源特异性基因表达的重要性,我开发了一种在人类基因组中预测其位置的全基因组策略。我的策略包括创建图来显示整个染色体 DNA 序列中 ZFBS-morph 重叠的密度。在初步评估中,我发现这些图中的峰值精确地指出了 DNA 中染色体带中的几个已知的 ICR/gDMR。我推断在密度图中,强大的峰值对应于 DNA 中的实际或候选 ICR。通过在候选 ICR 附近定位基因,我可以发现潜在的印迹基因。此外,我的评估揭示了一些潜在的印迹基因与人类发育异常之间的联系。例如,Leber 先天性黑矇 11、Coffin-Siris 综合征、进行性肌阵挛性癫痫 10、小头骨发育不良原基型 II 和小头症、唇裂和腭裂以及胼胝体发育不全。
通过显示 ZFBS-morph 重叠密度的图,研究人员可以定位候选 ICR 和印迹基因。由于可下载并在 UCSC 基因组浏览器中显示数据文件,因此可以在单核苷酸多态性 (SNP) 的上下文中检查图,以设计实验在人类基因组中发现新的 ICR 和印迹基因。