Suppr超能文献

多组DNA甲基化数据的非参数贝叶斯差异分析

Nonparametric Bayes Differential Analysis of Multigroup DNA Methylation Data.

作者信息

Gu Chiyu, Baladandayuthapani Veerabhadran, Guha Subharup

机构信息

Formerly at the University of Missouri. Currently employed at Bayer Crop Science, 700 Chesterfield Pkwy W, Chesterfield, MO 63017.

Department of Biostatistics, University of Michigan.

出版信息

Bayesian Anal. 2025 Jun;20(2):489-518. doi: 10.1214/23-ba1407. Epub 2023 Nov 23.

Abstract

DNA methylation datasets in cancer studies are comprised of measurements on a large number of genomic locations called cytosine-phosphate-guanine (CpG) sites with complex correlation structures. A fundamental goal of these studies is the development of statistical techniques that can identify disease genomic signatures across multiple patient groups defined by different experimental or biological conditions. We propose , a nonparametric Bayesian approach for differential analysis relying on a novel class of first order mixture models called the Sticky Pitman-Yor process or two-restaurant two-cuisine franchise (2R2CF). The BayesDiff methodology flexibly utilizes information from all CpG sites or biomarker probes, adaptively accommodates any serial dependence due to the widely varying inter-probe distances, and makes posterior inferences about the differential genomic signature of patient groups. Using simulation studies, we demonstrate the effectiveness of the BayesDiff procedure relative to existing statistical techniques for differential DNA methylation. The methodology is applied to analyze a gastrointestinal (GI) cancer dataset exhibiting serial correlation and complex interaction patterns. The results support and complement known aspects of DNA methylation and gene association in upper GI cancers.

摘要

癌症研究中的DNA甲基化数据集由对大量称为胞嘧啶-磷酸-鸟嘌呤(CpG)位点的基因组位置的测量组成,这些位点具有复杂的相关结构。这些研究的一个基本目标是开发统计技术,以识别由不同实验或生物学条件定义的多个患者群体中的疾病基因组特征。我们提出了一种非参数贝叶斯方法用于差异分析,该方法依赖于一类称为粘性皮特曼-约尔过程或两餐厅两菜系特许经营(2R2CF)的新型一阶混合模型。贝叶斯差异分析方法(BayesDiff)灵活地利用来自所有CpG位点或生物标志物探针的信息,自适应地适应由于探针间距离差异很大而产生的任何序列依赖性,并对患者群体的差异基因组特征进行后验推断。通过模拟研究,我们证明了BayesDiff程序相对于现有的差异DNA甲基化统计技术的有效性。该方法被应用于分析一个表现出序列相关性和复杂相互作用模式的胃肠道(GI)癌症数据集。结果支持并补充了上消化道癌症中DNA甲基化和基因关联的已知方面。

相似文献

4
Robustifying Bayesian nonparametric mixtures for count data.增强用于计数数据的贝叶斯非参数混合模型
Biometrics. 2017 Mar;73(1):174-184. doi: 10.1111/biom.12538. Epub 2016 Apr 28.
6
Epigenetics, heritability and longitudinal analysis.表观遗传学、遗传力与纵向分析。
BMC Genet. 2018 Sep 17;19(Suppl 1):77. doi: 10.1186/s12863-018-0648-1.
8
Pan-cancer analysis of differential DNA methylation patterns.泛癌症分析中差异 DNA 甲基化模式。
BMC Med Genomics. 2020 Oct 22;13(Suppl 10):154. doi: 10.1186/s12920-020-00780-3.

引用本文的文献

1
A clustering approach to integrative analyses of multiomic cancer data.一种用于多组学癌症数据综合分析的聚类方法。
J Appl Stat. 2024 Nov 29;52(8):1539-1560. doi: 10.1080/02664763.2024.2431742. eCollection 2025.

本文引用的文献

4
Cancer Statistics, 2017.《2017 年癌症统计》
CA Cancer J Clin. 2017 Jan;67(1):7-30. doi: 10.3322/caac.21387. Epub 2017 Jan 5.
5
Toward a Shared Vision for Cancer Genomic Data.迈向癌症基因组数据的共同愿景。
N Engl J Med. 2016 Sep 22;375(12):1109-12. doi: 10.1056/NEJMp1607591.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验