• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过跨维度马尔可夫链蒙特卡罗和隐马尔可夫模型揭示癌症表观遗传学的改变。

Uncovering Alterations in Cancer Epigenetics via Trans-Dimensional Markov Chain Monte Carlo and Hidden Markov Models.

作者信息

Shokoohi Farhad, Khaniki Saeedeh Hajebi

机构信息

Department of Mathematical Sciences, University of Nevada-Las Vegas, Las Vega, NV 89154, USA.

Department of Biostatistics, School of Health, Mashhad University of Medical Sciences, Mashhad, Iran.

出版信息

bioRxiv. 2023 Jun 15:2023.06.15.545168. doi: 10.1101/2023.06.15.545168.

DOI:10.1101/2023.06.15.545168
PMID:37398181
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10312753/
Abstract

Epigenetic alterations are key drivers in the development and progression of cancer. Identifying differentially methylated cytosines (DMCs) in cancer samples is a crucial step toward understanding these changes. In this paper, we propose a trans-dimensional Markov chain Monte Carlo (TMCMC) approach that uses hidden Markov models (HMMs) with binomial emission, and bisulfite sequencing (BS-Seq) data, called DMCTHM, to identify DMCs in cancer epigenetic studies. We introduce the Expander-Collider penalty to tackle under and over-estimation in TMCMC-HMMs. We address all known challenges inherent in BS-Seq data by introducing novel approaches for capturing functional patterns and autocorrelation structure of the data, as well as for handling missing values, multiple covariates, multiple comparisons, and family-wise errors. We demonstrate the effectiveness of DMCTHM through comprehensive simulation studies. The results show that our proposed method outperforms other competing methods in identifying DMCs. Notably, with DMCTHM, we uncovered new DMCs and genes in Colorectal cancer that were significantly enriched in the Tp53 pathway.

摘要

表观遗传改变是癌症发生和发展的关键驱动因素。识别癌症样本中差异甲基化的胞嘧啶(DMC)是理解这些变化的关键一步。在本文中,我们提出了一种跨维度马尔可夫链蒙特卡罗(TMCMC)方法,该方法使用具有二项式发射的隐马尔可夫模型(HMM)和亚硫酸氢盐测序(BS-Seq)数据(称为DMCTHM)来识别癌症表观遗传研究中的DMC。我们引入扩展器-碰撞器惩罚来解决TMCMC-HMM中的估计不足和过度估计问题。我们通过引入新颖的方法来捕捉数据的功能模式和自相关结构,以及处理缺失值、多个协变量、多重比较和家族性错误,来解决BS-Seq数据中固有的所有已知挑战。我们通过全面的模拟研究证明了DMCTHM的有效性。结果表明,我们提出的方法在识别DMC方面优于其他竞争方法。值得注意的是,通过DMCTHM,我们在结直肠癌中发现了新的DMC和基因,这些基因在Tp53通路中显著富集。

相似文献

1
Uncovering Alterations in Cancer Epigenetics via Trans-Dimensional Markov Chain Monte Carlo and Hidden Markov Models.通过跨维度马尔可夫链蒙特卡罗和隐马尔可夫模型揭示癌症表观遗传学的改变。
bioRxiv. 2023 Jun 15:2023.06.15.545168. doi: 10.1101/2023.06.15.545168.
2
A hidden markov model for identifying differentially methylated sites in bisulfite sequencing data.一种用于识别亚硫酸氢盐测序数据中差异甲基化位点的隐马尔可夫模型。
Biometrics. 2019 Mar;75(1):210-221. doi: 10.1111/biom.12965. Epub 2018 Oct 9.
3
LuxHMM: DNA methylation analysis with genome segmentation via hidden Markov model.LuxHMM:基于隐马尔可夫模型的基因组分割的 DNA 甲基化分析。
BMC Bioinformatics. 2023 Feb 22;24(1):58. doi: 10.1186/s12859-023-05174-7.
4
A Bayesian hidden Markov model for detecting differentially methylated regions.一种用于检测差异甲基化区域的贝叶斯隐马尔可夫模型。
Biometrics. 2019 Jun;75(2):663-673. doi: 10.1111/biom.13000. Epub 2019 Mar 29.
5
Bayesian restoration of a hidden Markov chain with applications to DNA sequencing.应用于DNA测序的隐马尔可夫链的贝叶斯恢复
J Comput Biol. 1999 Summer;6(2):261-77. doi: 10.1089/cmb.1999.6.261.
6
Principal component analysis- and tensor decomposition-based unsupervised feature extraction to select more suitable differentially methylated cytosines: Optimization of standard deviation versus state-of-the-art methods.基于主成分分析和张量分解的无监督特征提取,以选择更合适的差异甲基化胞嘧啶:标准偏差与最先进方法的优化。
Genomics. 2023 Mar;115(2):110577. doi: 10.1016/j.ygeno.2023.110577. Epub 2023 Feb 15.
7
Bayesian adaptive group lasso with semiparametric hidden Markov models.贝叶斯自适应分组 lasso 与半参数隐马尔可夫模型。
Stat Med. 2019 Apr 30;38(9):1634-1650. doi: 10.1002/sim.8051. Epub 2018 Nov 28.
8
Detection of differentially methylated regions from bisulfite-seq data by hidden Markov models incorporating genome-wide methylation level distributions.通过结合全基因组甲基化水平分布的隐马尔可夫模型从亚硫酸氢盐测序数据中检测差异甲基化区域。
BMC Genomics. 2015;16 Suppl 12(Suppl 12):S3. doi: 10.1186/1471-2164-16-S12-S3. Epub 2015 Dec 9.
9
Identification of Differentially Methylated Sites with Weak Methylation Effects.具有微弱甲基化效应的差异甲基化位点的鉴定
Genes (Basel). 2018 Feb 8;9(2):75. doi: 10.3390/genes9020075.
10
A Bayesian hierarchical model for analyzing methylated RNA immunoprecipitation sequencing data.一种用于分析甲基化RNA免疫沉淀测序数据的贝叶斯分层模型。
Quant Biol. 2018 Sep;6(3):275-286. doi: 10.1007/s40484-018-0149-2. Epub 2018 Aug 30.

本文引用的文献

1
Identifying Differential Methylation in Cancer Epigenetics via a Bayesian Functional Regression Model.通过贝叶斯功能回归模型识别癌症表观遗传学中的差异甲基化。
Biomolecules. 2024 May 29;14(6):639. doi: 10.3390/biom14060639.
2
FUNCTION-ON-FUNCTION REGRESSION FOR THE IDENTIFICATION OF EPIGENETIC REGIONS EXHIBITING WINDOWS OF SUSCEPTIBILITY TO ENVIRONMENTAL EXPOSURES.用于识别对环境暴露具有易感性窗口的表观遗传区域的函数对函数回归法
Ann Appl Stat. 2021 Sep;15(3):1366-1385. doi: 10.1214/20-aoas1425. Epub 2021 Sep 23.
3
Expression characteristics of long non-coding RNA in colon adenocarcinoma and its potential value for judging the survival and prognosis of patients: bioinformatics analysis based on The Cancer Genome Atlas database.
长链非编码RNA在结肠腺癌中的表达特征及其对患者生存和预后判断的潜在价值:基于癌症基因组图谱数据库的生物信息学分析
J Gastrointest Oncol. 2022 Jun;13(3):1178-1187. doi: 10.21037/jgo-22-384.
4
DNA methylation-based diagnostic, prognostic, and predictive biomarkers in colorectal cancer.基于 DNA 甲基化的结直肠癌诊断、预后和预测生物标志物。
Biochim Biophys Acta Rev Cancer. 2022 May;1877(3):188722. doi: 10.1016/j.bbcan.2022.188722. Epub 2022 Mar 17.
5
Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries.《全球癌症统计数据 2020:全球 185 个国家和地区 36 种癌症的发病率和死亡率估计》。
CA Cancer J Clin. 2021 May;71(3):209-249. doi: 10.3322/caac.21660. Epub 2021 Feb 4.
6
Detect differentially methylated regions using non-homogeneous hidden Markov model for bisulfite sequencing data.使用非齐次隐马尔可夫模型检测亚硫酸氢盐测序数据中的差异甲基化区域。
Methods. 2021 May;189:34-43. doi: 10.1016/j.ymeth.2020.09.009. Epub 2020 Sep 17.
7
Integration analysis of long non-coding RNA (lncRNA) role in tumorigenesis of colon adenocarcinoma.长链非编码 RNA(lncRNA)在结肠腺癌发生中的作用的综合分析。
BMC Med Genomics. 2020 Jul 29;13(1):108. doi: 10.1186/s12920-020-00757-2.
8
A comprehensive evaluation of computational tools to identify differential methylation regions using RRBS data.基于 RRBS 数据的差异甲基化区域识别的计算工具的综合评估。
Genomics. 2020 Nov;112(6):4567-4576. doi: 10.1016/j.ygeno.2020.07.032. Epub 2020 Jul 24.
9
Identification of cell type-specific methylation signals in bulk whole genome bisulfite sequencing data.批量全基因组亚硫酸氢盐测序数据中细胞类型特异性甲基化信号的鉴定。
Genome Biol. 2020 Jul 1;21(1):156. doi: 10.1186/s13059-020-02065-5.
10
Role of long non-coding RNA TP73-AS1 in cancer.长链非编码 RNA TP73-AS1 在癌症中的作用。
Biosci Rep. 2019 Oct 30;39(10). doi: 10.1042/BSR20192274.