Suppr超能文献

可靠检测随机表观遗传突变及其与心血管衰老的关联。

Reliable detection of stochastic epigenetic mutations and associations with cardiovascular aging.

机构信息

Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA.

Altos Labs, San Diego Institute of Sciences, San Diego, CA, USA.

出版信息

Geroscience. 2024 Dec;46(6):5745-5765. doi: 10.1007/s11357-024-01191-3. Epub 2024 May 13.

Abstract

Stochastic epigenetic mutations (SEMs) have been proposed as novel aging biomarkers to capture heterogeneity in age-related DNA methylation changes. SEMs are defined as outlier methylation patterns at cytosine-guanine dinucleotide sites, categorized as hypermethylated (hyperSEM) or hypomethylated (hypoSEM) relative to a reference. Because SEMs are defined by their outlier status, it is critical to differentiate extreme values due to technical noise or data artifacts from those due to real biology. Using technical replicate data, we found SEM detection is not reliable: across 3 datasets, 24 to 39% of hypoSEM and 46 to 67% of hyperSEM are not shared between replicates. We identified factors influencing SEM reliability-including blood cell type composition, probe beta-value statistics, genomic location, and presence of SNPs. We used these factors in a training dataset to build a machine learning-based filter that removes unreliable SEMs, and found this filter enhances reliability in two independent validation datasets. We assessed associations between SEM loads and aging phenotypes in the Framingham Heart Study and discovered that associations with aging outcomes were in large part driven by hypoSEMs at baseline methylated probes and hyperSEMs at baseline unmethylated probes, which are the same subsets that demonstrate highest technical reliability. These aging associations were preserved after filtering out unreliable SEMs and were enhanced after adjusting for blood cell composition. Finally, we utilized these insights to formulate best practices for SEM detection and introduce a novel R package, SEMdetectR, which uses parallel programming for efficient SEM detection with comprehensive options for detection, filtering, and analysis.

摘要

随机表观遗传突变(SEM)被提议作为新的衰老生物标志物,以捕捉与年龄相关的 DNA 甲基化变化中的异质性。SEM 被定义为胞嘧啶-鸟嘌呤二核苷酸位点的异常甲基化模式,相对于参考值,分为超甲基化(hyperSEM)或低甲基化(hypoSEM)。由于 SEM 是通过其异常状态定义的,因此区分由于技术噪声或数据伪影导致的极值与由于真实生物学导致的极值至关重要。使用技术重复数据,我们发现 SEM 检测不可靠:在 3 个数据集之间,24%至 39%的 hypoSEM 和 46%至 67%的 hyperSEM 不在重复之间共享。我们确定了影响 SEM 可靠性的因素,包括血细胞类型组成、探针β值统计、基因组位置和 SNP 的存在。我们在训练数据集中使用这些因素构建了基于机器学习的过滤器,以去除不可靠的 SEM,并在两个独立的验证数据集中发现该过滤器提高了可靠性。我们评估了 SEM 负荷与弗雷明汉心脏研究中衰老表型之间的关联,并发现与衰老结果的关联在很大程度上是由基线甲基化探针的 hypoSEM 和基线未甲基化探针的 hyperSEM 驱动的,这与表现出最高技术可靠性的相同子集相同。在过滤掉不可靠的 SEM 后,这些衰老关联得以保留,在调整血细胞组成后,关联得到增强。最后,我们利用这些见解制定了 SEM 检测的最佳实践,并引入了一个新的 R 包 SEMdetectR,该包使用并行编程来实现高效的 SEM 检测,并提供全面的检测、过滤和分析选项。

相似文献

本文引用的文献

2
Making sense of the ageing methylome.解读衰老甲基化组。
Nat Rev Genet. 2022 Oct;23(10):585-605. doi: 10.1038/s41576-022-00477-6. Epub 2022 May 2.
3
Somatic mutation rates scale with lifespan across mammals.哺乳动物的体细胞突变率与寿命成正比。
Nature. 2022 Apr;604(7906):517-524. doi: 10.1038/s41586-022-04618-z. Epub 2022 Apr 13.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验