Sun Shuying, Lee Yu Ri, Enfield Brittany
Department of Mathematics, Texas State University, San Marcos, TX, USA.
Global Engineering Systems, Cypress Semiconductor, Austin, TX, USA.
Cancer Inform. 2019 Aug 29;18:1176935119872959. doi: 10.1177/1176935119872959. eCollection 2019.
DNA methylation is an epigenetic event that involves adding a methyl group to the cytosine (C) site, especially the one that pairs with a guanine (G) site (ie, CG or CpG site), in a human genome. This event plays an important role in both cancerous and normal cell development. Previous studies often assume symmetric methylation on both DNA strands. However, asymmetric methylation, or hemimethylation (methylation that occurs only on 1 DNA strand), does exist and has been reported in several studies. Due to the limitation of previous DNA methylation sequencing technologies, researchers could only study hemimethylation on specific genes, but the overall genomic hemimethylation landscape remains relatively unexplored. With the development of advanced next-generation sequencing techniques, it is now possible to measure methylation levels on both forward and reverse strands at all CpG sites in an entire genome. Analyzing hemimethylation patterns may potentially reveal regions related to undergoing tumor growth. For our research, we first identify hemimethylated CpG sites in breast cancer cell lines using Wilcoxon signed rank tests. We then identify hemimethylation patterns by grouping consecutive hemimethylated CpG sites based on their methylation states, methylation "M" or unmethylation "U." These patterns include regular (or consecutive) hemimethylation clusters (eg, "MMM" on one strand and "UUU" on another strand) and polarity (or reverse) clusters (eg, "MU" on one strand and "UM" on another strand). Our results reveal that most hemimethylation clusters are the polarity type, and hemimethylation does occur across the entire genome with notably higher numbers in the breast cancer cell lines. The lengths or sizes of most hemimethylation clusters are very short, often less than 50 base pairs. After mapping hemimethylation clusters and sites to corresponding genes, we study the functions of these genes and find that several of the highly hemimethylated genes may influence tumor growth or suppression. These genes may also indicate a progressing transition to a new tumor stage.
DNA甲基化是一种表观遗传事件,涉及在人类基因组中胞嘧啶(C)位点,特别是与鸟嘌呤(G)位点配对的位点(即CG或CpG位点)上添加甲基基团。这一事件在癌细胞和正常细胞发育中都起着重要作用。以往的研究通常假设DNA两条链上的甲基化是对称的。然而,不对称甲基化,即半甲基化(仅发生在一条DNA链上的甲基化)确实存在,并且在一些研究中已有报道。由于以往DNA甲基化测序技术的局限性,研究人员只能研究特定基因上的半甲基化,但整个基因组的整体半甲基化情况仍相对未被探索。随着先进的下一代测序技术的发展,现在有可能在整个基因组的所有CpG位点上测量正向和反向链上的甲基化水平。分析半甲基化模式可能潜在地揭示与肿瘤生长相关的区域。在我们的研究中,我们首先使用Wilcoxon符号秩检验在乳腺癌细胞系中鉴定半甲基化的CpG位点。然后,我们根据甲基化状态(甲基化“M”或未甲基化“U)对连续的半甲基化CpG位点进行分组,从而确定半甲基化模式。这些模式包括规则(或连续)半甲基化簇(例如,一条链上为“MMM”,另一条链上为“UUU”)和极性(或反向)簇(例如,一条链上为“MU”,另一条链上为“UM”)。我们的结果表明,大多数半甲基化簇是极性类型,并且整个基因组中确实存在半甲基化,在乳腺癌细胞系中的数量明显更高。大多数半甲基化簇的长度或大小非常短,通常小于50个碱基对。在将半甲基化簇和位点映射到相应基因后,我们研究这些基因的功能,发现一些高度半甲基化的基因可能影响肿瘤生长或抑制。这些基因也可能表明向新肿瘤阶段的进展性转变。