Department of Physics, Chuo University, 1-13-27, Kasuga, Bunkyo-ku, Tokyo 112-8551, Japan.
Department of Computer Science, King Abdulaziz University, Jeddah 21589, Saudi Arabia.
Genomics. 2023 Mar;115(2):110577. doi: 10.1016/j.ygeno.2023.110577. Epub 2023 Feb 15.
In contrast to RNA-seq analysis, which has various standard methods, no standard methods for identifying differentially methylated cytosines (DMCs) exist. To identify DMCs, we tested principal component analysis and tensor decomposition-based unsupervised feature extraction with optimized standard deviation, which has been shown to be effective for differentially expressed gene (DEG) identification. The proposed method outperformed certain conventional methods, including those that assume beta-binomial distribution for methylation as the proposed method does not require this, especially when applied to methylation profiles measured using high throughput sequencing. DMCs identified by the proposed method also significantly overlapped with various functional sites, including known differentially methylated regions, enhancers, and DNase I hypersensitive sites. The proposed method was applied to data sets retrieved from The Cancer Genome Atlas to identify DMCs using American Joint Committee on Cancer staging system edition labels. This suggests that the proposed method is a promising standard method for identifying DMCs.
与具有各种标准方法的 RNA-seq 分析不同,目前尚无用于鉴定差异甲基化胞嘧啶 (DMC) 的标准方法。为了鉴定 DMC,我们测试了基于主成分分析和张量分解的无监督特征提取,并优化了标准偏差,事实证明,该方法对于鉴定差异表达基因 (DEG) 非常有效。与某些传统方法相比,该方法表现更优,包括那些假设甲基化的 beta-二项式分布的方法,因为该方法不需要这样做,特别是当应用于使用高通量测序测量的甲基化谱时。该方法鉴定的 DMC 还与各种功能位点显著重叠,包括已知的差异甲基化区域、增强子和 DNase I 超敏位点。该方法还应用于从癌症基因组图谱中检索的数据集中,使用美国癌症联合委员会分期系统版本标签来识别 DMC。这表明该方法是一种很有前途的鉴定 DMC 的标准方法。