重计数甲基化可实现对公共血液DNA甲基化阵列数据的灵活分析。

recountmethylation enables flexible analysis of public blood DNA methylation array data.

作者信息

Maden Sean K, Walsh Brian, Ellrott Kyle, Hansen Kasper D, Thompson Reid F, Nellore Abhinav

机构信息

Computational Biology Program, Oregon Health & Science University, Portland, OR 97239, USA.

Department of Biomedical Engineering, Oregon Health & Science University, Portland, OR 97239, USA.

出版信息

Bioinform Adv. 2023 Feb 20;3(1):vbad020. doi: 10.1093/bioadv/vbad020. eCollection 2023.

DOI:10.1093/bioadv/vbad020

PMID:36874953

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9976962/

Abstract

SUMMARY

Thousands of DNA methylation (DNAm) array samples from human blood are publicly available on the Gene Expression Omnibus (GEO), but they remain underutilized for experiment planning, replication and cross-study and cross-platform analyses. To facilitate these tasks, we augmented our recountmethylation R/Bioconductor package with 12 537 uniformly processed EPIC and HM450K blood samples on GEO as well as several new features. We subsequently used our updated package in several illustrative analyses, finding (i) study ID bias adjustment increased variation explained by biological and demographic variables, (ii) most variation in autosomal DNAm was explained by genetic ancestry and CD4+ T-cell fractions and (iii) the dependence of power to detect differential methylation on sample size was similar for each of peripheral blood mononuclear cells (PBMC), whole blood and umbilical cord blood. Finally, we used PBMC and whole blood to perform independent validations, and we recovered 38-46% of differentially methylated probes between sexes from two previously published epigenome-wide association studies.

AVAILABILITY AND IMPLEMENTATION

Source code to reproduce the main results are available on GitHub (repo: recountmethylation_flexible-blood-analysis_manuscript; url: https://github.com/metamaden/recountmethylation_flexible-blood-analysis_manuscript). All data was publicly available and downloaded from the Gene Expression Omnibus (https://www.ncbi.nlm.nih.gov/geo/). Compilations of the analyzed public data can be accessed from the website recount.bio/data (preprocessed HM450K array data: https://recount.bio/data/remethdb_h5se-gm_epic_0-0-2_1589820348/; preprocessed EPIC array data: https://recount.bio/data/remethdb_h5se-gm_epic_0-0-2_1589820348/).

SUPPLEMENTARY INFORMATION

Supplementary data are available at online.

摘要

来自人类血液的数千个DNA甲基化（DNAm）阵列样本在基因表达综合数据库（GEO）上是公开可用的，但它们在实验规划、复制以及跨研究和跨平台分析中仍未得到充分利用。为便于开展这些任务，我们用GEO上12537个经过统一处理的EPIC和HM450K血液样本以及若干新功能增强了我们的recountmethylation R/Bioconductor软件包。随后，我们在若干说明性分析中使用了更新后的软件包，发现（i）研究ID偏差调整增加了由生物学和人口统计学变量解释的变异，（ii）常染色体DNAm中的大部分变异由遗传血统和CD4 + T细胞比例解释，并且（iii）检测差异甲基化的功效对样本量的依赖性在每个外周血单核细胞（PBMC）、全血和脐带血中相似。最后，我们使用PBMC和全血进行独立验证，并且我们从两项先前发表的全表观基因组关联研究中找回了38 - 46%的性别间差异甲基化探针。

可用性与实现

重现主要结果的源代码可在GitHub上获取（仓库：recountmethylation_flexible - blood - analysis_manuscript；网址：https://github.com/metamaden/recountmethylation_flexible - blood - analysis_manuscript）。所有数据均公开可用并从基因表达综合数据库（https://www.ncbi.nlm.nih.gov/geo/）下载。分析的公共数据汇编可从网站recount.bio/data访问（预处理的HM450K阵列数据：https://recount.bio/data/remethdb_h5se - gm_epic_0 - 0 - 2_1589820348/；预处理的EPIC阵列数据：https://recount.bio/data/remethdb_h5se - gm_epic_0 - 0 - 2_1589820348/）。

补充信息

补充数据可在网上获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f430/9976962/b1198528e439/vbad020f1.jpg

相似文献

recountmethylation enables flexible analysis of public blood DNA methylation array data.重计数甲基化可实现对公共血液DNA甲基化阵列数据的灵活分析。

Bioinform Adv. 2023 Feb 20;3(1):vbad020. doi: 10.1093/bioadv/vbad020. eCollection 2023.

Human methylome variation across Infinium 450K data on the Gene Expression Omnibus.基于基因表达综合数据库中Infinium 450K数据的人类甲基化组变异情况

NAR Genom Bioinform. 2021 Apr 22;3(2):lqab025. doi: 10.1093/nargab/lqab025. eCollection 2021 Jun.

pwrEWAS: a user-friendly tool for comprehensive power estimation for epigenome wide association studies (EWAS).pwrEWAS：用于全基因组关联研究（EWAS）中全面估计功效的用户友好工具。

BMC Bioinformatics. 2019 Apr 29;20(1):218. doi: 10.1186/s12859-019-2804-7.

Systematic evaluation of DNA methylation age estimation with common preprocessing methods and the Infinium MethylationEPIC BeadChip array.采用常见预处理方法和 Infinium MethylationEPIC BeadChip 阵列进行 DNA 甲基化年龄估算的系统评价。

Clin Epigenetics. 2018 Oct 16;10(1):123. doi: 10.1186/s13148-018-0556-2.

Bigmelon: tools for analysing large DNA methylation datasets.Bigmelon：用于分析大型甲基化数据集的工具。

Bioinformatics. 2019 Mar 15;35(6):981-986. doi: 10.1093/bioinformatics/bty713.

ebGSEA: an improved Gene Set Enrichment Analysis method for Epigenome-Wide-Association Studies.ebGSEA：一种改进的用于全基因组关联研究的基因集富集分析方法。

Bioinformatics. 2019 Sep 15;35(18):3514-3516. doi: 10.1093/bioinformatics/btz073.

CoMeBack: DNA methylation array data analysis for co-methylated regions.CoMeBack：共甲基化区域的 DNA 甲基化阵列数据分析。

Bioinformatics. 2020 May 1;36(9):2675-2683. doi: 10.1093/bioinformatics/btaa049.

cnAnalysis450k: an R package for comparative analysis of 450k/EPIC Illumina methylation array derived copy number data.cnAnalysis450k：一个用于对Illumina 450k/EPIC甲基化芯片衍生的拷贝数数据进行比较分析的R软件包。

Bioinformatics. 2017 Aug 1;33(15):2266-2272. doi: 10.1093/bioinformatics/btx156.

Genetic, epigenetic and genomic effects on variation of gene expression among grape varieties.遗传、表观遗传和基因组对不同葡萄品种间基因表达变异性的影响。

Plant J. 2019 Sep;99(5):895-909. doi: 10.1111/tpj.14370. Epub 2019 Jun 7.

methylclock: a Bioconductor package to estimate DNA methylation age.methylclock：一个用于估计 DNA 甲基化年龄的 Bioconductor 软件包。

Bioinformatics. 2021 Jul 19;37(12):1759-1760. doi: 10.1093/bioinformatics/btaa825.

引用本文的文献

Ultrasensitive Amplification-Free Quantification of a Methyl CpG-Rich Cancer Biomarker by Single-Molecule Kinetic Fingerprinting.通过单分子动力学指纹技术对富含甲基化 CpG 的癌症生物标志物进行超灵敏无扩增定量检测。

Anal Chem. 2024 Oct 29;96(43):17209-17216. doi: 10.1021/acs.analchem.4c03002. Epub 2024 Oct 19.

本文引用的文献

Meta-analysis of epigenome-wide association studies in newborns and children show widespread sex differences in blood DNA methylation.对新生儿和儿童的全基因组关联研究的荟萃分析表明，血液 DNA 甲基化存在广泛的性别差异。

Mutat Res Rev Mutat Res. 2022 Jan-Jun;789:108415. doi: 10.1016/j.mrrev.2022.108415. Epub 2022 Mar 14.

Characterising sex differences of autosomal DNA methylation in whole blood using the Illumina EPIC array.使用 Illumina EPIC 阵列描述全血中常染色体 DNA 甲基化的性别差异。

Clin Epigenetics. 2022 May 14;14(1):62. doi: 10.1186/s13148-022-01279-7.

Enhanced cell deconvolution of peripheral blood using DNA methylation for high-resolution immune profiling.利用 DNA 甲基化增强外周血的细胞去卷积，实现高分辨率免疫分析。

Nat Commun. 2022 Feb 9;13(1):761. doi: 10.1038/s41467-021-27864-7.

Cell-Free DNA Methylation as Blood-Based Biomarkers for Pancreatic Adenocarcinoma-A Literature Update.游离DNA甲基化作为胰腺癌基于血液的生物标志物——文献综述

Epigenomes. 2021 Apr 9;5(2):8. doi: 10.3390/epigenomes5020008.

Circulating tumor DNA methylation marker MYO1-G for diagnosis and monitoring of colorectal cancer.循环肿瘤 DNA 甲基化标志物 MYO1-G 用于结直肠癌的诊断和监测。

Clin Epigenetics. 2021 Dec 27;13(1):232. doi: 10.1186/s13148-021-01216-0.

Methylation of FBN1, SPG20, ITF2, RUNX3, SNCA, MLH1, and SEPT9 genes in circulating cell-free DNA as biomarkers of colorectal cancer.循环无细胞 DNA 中 FBN1、SPG20、ITF2、RUNX3、SNCA、MLH1 和 SEPT9 基因甲基化作为结直肠癌的生物标志物。

Cancer Biomark. 2022;34(2):221-250. doi: 10.3233/CBM-210315.

Reproducibility standards for machine learning in the life sciences.生命科学中机器学习的可重复性标准。

Nat Methods. 2021 Oct;18(10):1132-1135. doi: 10.1038/s41592-021-01256-7.

Sustainable data analysis with Snakemake.使用 Snakemake 进行可持续数据分析。

F1000Res. 2021 Jan 18;10:33. doi: 10.12688/f1000research.29032.2. eCollection 2021.

DNA Methylation Patterning and the Regulation of Beta Cell Homeostasis.DNA 甲基化模式与β细胞稳态的调控。

Front Endocrinol (Lausanne). 2021 May 7;12:651258. doi: 10.3389/fendo.2021.651258. eCollection 2021.

Human methylome variation across Infinium 450K data on the Gene Expression Omnibus.基于基因表达综合数据库中Infinium 450K数据的人类甲基化组变异情况

NAR Genom Bioinform. 2021 Apr 22;3(2):lqab025. doi: 10.1093/nargab/lqab025. eCollection 2021 Jun.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

重计数甲基化可实现对公共血液DNA甲基化阵列数据的灵活分析。

recountmethylation enables flexible analysis of public blood DNA methylation array data.

作者信息

机构信息

出版信息

SUMMARY

AVAILABILITY AND IMPLEMENTATION

SUPPLEMENTARY INFORMATION

摘要

可用性与实现

补充信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献