Department of Mathematics, Texas State University, San Marcos, TX, USA.
Department of Statistics, Texas A&M University, College Station, TX, USA.
BMC Cancer. 2021 Mar 12;21(1):268. doi: 10.1186/s12885-021-07990-7.
DNA methylation is an epigenetic event involving the addition of a methyl-group to a cytosine-guanine base pair (i.e., CpG site). It is associated with different cancers. Our research focuses on studying non-small cell lung cancer hemimethylation, which refers to methylation occurring on only one of the two DNA strands. Many studies often assume that methylation occurs on both DNA strands at a CpG site. However, recent publications show the existence of hemimethylation and its significant impact. Therefore, it is important to identify cancer hemimethylation patterns.
In this paper, we use the Wilcoxon signed rank test to identify hemimethylated CpG sites based on publicly available non-small cell lung cancer methylation sequencing data. We then identify two types of hemimethylated CpG clusters, regular and polarity clusters, and genes with large numbers of hemimethylated sites. Highly hemimethylated genes are then studied for their biological interactions using available bioinformatics tools.
In this paper, we have conducted the first-ever investigation of hemimethylation in lung cancer. Our results show that hemimethylation does exist in lung cells either as singletons or clusters. Most clusters contain only two or three CpG sites. Polarity clusters are much shorter than regular clusters and appear less frequently. The majority of clusters found in tumor samples have no overlap with clusters found in normal samples, and vice versa. Several genes that are known to be associated with cancer are hemimethylated differently between the cancerous and normal samples. Furthermore, highly hemimethylated genes exhibit many different interactions with other genes that may be associated with cancer. Hemimethylation has diverse patterns and frequencies that are comparable between normal and tumorous cells. Therefore, hemimethylation may be related to both normal and tumor cell development.
Our research has identified CpG clusters and genes that are hemimethylated in normal and lung tumor samples. Due to the potential impact of hemimethylation on gene expression and cell function, these clusters and genes may be important to advance our understanding of the development and progression of non-small cell lung cancer.
DNA 甲基化是一种涉及向胞嘧啶-鸟嘌呤碱基对(即 CpG 位点)添加甲基基团的表观遗传事件。它与不同的癌症有关。我们的研究重点是研究非小细胞肺癌半甲基化,即仅在两条 DNA 链中的一条上发生甲基化。许多研究通常假设 CpG 位点的两条 DNA 链上都发生甲基化。然而,最近的出版物表明半甲基化的存在及其重要影响。因此,识别癌症半甲基化模式很重要。
在本文中,我们使用 Wilcoxon 符号秩检验根据公开的非小细胞肺癌甲基化测序数据识别半甲基化的 CpG 位点。然后,我们识别出两种类型的半甲基化 CpG 簇,即规则簇和极性簇,以及具有大量半甲基化位点的基因。然后使用可用的生物信息学工具研究高度半甲基化基因的生物学相互作用。
在本文中,我们首次对肺癌中的半甲基化进行了调查。我们的结果表明,半甲基化确实存在于肺细胞中,无论是作为单一位点还是簇。大多数簇仅包含两个或三个 CpG 位点。极性簇比规则簇短得多,出现的频率也低得多。在肿瘤样本中发现的大多数簇与在正常样本中发现的簇没有重叠,反之亦然。一些已知与癌症相关的基因在癌性和正常样本之间的半甲基化方式不同。此外,高度半甲基化基因与其他可能与癌症相关的基因表现出许多不同的相互作用。半甲基化具有在正常和肿瘤细胞之间相似的不同模式和频率。因此,半甲基化可能与正常和肿瘤细胞的发育都有关。
我们的研究已经确定了在正常和肺肿瘤样本中半甲基化的 CpG 簇和基因。由于半甲基化对基因表达和细胞功能的潜在影响,这些簇和基因可能对我们理解非小细胞肺癌的发生和发展很重要。