• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

BoostMe 能够准确预测多种人类组织全基因组亚硫酸氢盐测序中的 DNA 甲基化值。

BoostMe accurately predicts DNA methylation values in whole-genome bisulfite sequencing of multiple human tissues.

机构信息

National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, 20892, USA.

European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, UK.

出版信息

BMC Genomics. 2018 May 23;19(1):390. doi: 10.1186/s12864-018-4766-y.

DOI:10.1186/s12864-018-4766-y
PMID:29792182
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5966887/
Abstract

BACKGROUND

Bisulfite sequencing is widely employed to study the role of DNA methylation in disease; however, the data suffer from biases due to coverage depth variability. Imputation of methylation values at low-coverage sites may mitigate these biases while also identifying important genomic features associated with predictive power.

RESULTS

Here we describe BoostMe, a method for imputing low-quality DNA methylation estimates within whole-genome bisulfite sequencing (WGBS) data. BoostMe uses a gradient boosting algorithm, XGBoost, and leverages information from multiple samples for prediction. We find that BoostMe outperforms existing algorithms in speed and accuracy when applied to WGBS of human tissues. Furthermore, we show that imputation improves concordance between WGBS and the MethylationEPIC array at low WGBS depth, suggesting improved WGBS accuracy after imputation.

CONCLUSIONS

Our findings support the use of BoostMe as a preprocessing step for WGBS analysis.

摘要

背景

亚硫酸氢盐测序被广泛用于研究 DNA 甲基化在疾病中的作用;然而,由于覆盖深度的变化,数据会存在偏差。在低覆盖区域对甲基化值进行插补可以减轻这些偏差,同时还可以识别与预测能力相关的重要基因组特征。

结果

这里我们描述了 BoostMe,一种用于在全基因组亚硫酸氢盐测序(WGBS)数据中插补低质量 DNA 甲基化估计值的方法。BoostMe 使用梯度提升算法 XGBoost,并利用来自多个样本的信息进行预测。我们发现,在应用于人体组织的 WGBS 时,BoostMe 在速度和准确性方面优于现有的算法。此外,我们表明,在 WGBS 深度较低时,插补可以提高 WGBS 与 MethylationEPIC 阵列之间的一致性,这表明插补后 WGBS 的准确性有所提高。

结论

我们的研究结果支持将 BoostMe 用作 WGBS 分析的预处理步骤。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac96/5966887/a76c2ea0e4bc/12864_2018_4766_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac96/5966887/3757aebed097/12864_2018_4766_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac96/5966887/13187cfe6142/12864_2018_4766_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac96/5966887/21b84cc6ef1b/12864_2018_4766_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac96/5966887/77b204874a93/12864_2018_4766_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac96/5966887/d4aa9c720741/12864_2018_4766_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac96/5966887/a76c2ea0e4bc/12864_2018_4766_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac96/5966887/3757aebed097/12864_2018_4766_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac96/5966887/13187cfe6142/12864_2018_4766_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac96/5966887/21b84cc6ef1b/12864_2018_4766_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac96/5966887/77b204874a93/12864_2018_4766_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac96/5966887/d4aa9c720741/12864_2018_4766_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac96/5966887/a76c2ea0e4bc/12864_2018_4766_Fig6_HTML.jpg

相似文献

1
BoostMe accurately predicts DNA methylation values in whole-genome bisulfite sequencing of multiple human tissues.BoostMe 能够准确预测多种人类组织全基因组亚硫酸氢盐测序中的 DNA 甲基化值。
BMC Genomics. 2018 May 23;19(1):390. doi: 10.1186/s12864-018-4766-y.
2
An integrative approach for efficient analysis of whole genome bisulfite sequencing data.一种用于全基因组亚硫酸氢盐测序数据高效分析的综合方法。
BMC Genomics. 2015;16 Suppl 12(Suppl 12):S14. doi: 10.1186/1471-2164-16-S12-S14. Epub 2015 Dec 9.
3
BCREval: a computational method to estimate the bisulfite conversion ratio in WGBS.BCREval:一种估算 WGBS 中亚硫酸氢盐转化率的计算方法。
BMC Bioinformatics. 2020 Jan 31;21(1):38. doi: 10.1186/s12859-019-3334-z.
4
Whole-Genome Bisulfite Sequencing for the Methylation Analysis of Insect Genomes.用于昆虫基因组甲基化分析的全基因组亚硫酸氢盐测序
Methods Mol Biol. 2019;1858:141-156. doi: 10.1007/978-1-4939-8775-7_11.
5
Estimating Global Methylation and Erasure Using Low-Coverage Whole-Genome Bisulfite Sequencing (WGBS ).使用低覆盖度全基因组亚硫酸氢盐测序(WGBS)估计全球甲基化和擦除。
Methods Mol Biol. 2021;2272:29-44. doi: 10.1007/978-1-0716-1294-1_3.
6
Resolution of the DNA methylation state of single CpG dyads using in silico strand annealing and WGBS data.使用计算机模拟链退火和 WGBS 数据解析单个 CpG 二联体的 DNA 甲基化状态。
Nat Protoc. 2019 Jan;14(1):202-216. doi: 10.1038/s41596-018-0090-x.
7
Guidelines for whole genome bisulphite sequencing of intact and FFPET DNA on the Illumina HiSeq X Ten.Illumina HiSeq X Ten 平台上完整 DNA 和 FFPET DNA 的全基因组亚硫酸氢盐测序指南。
Epigenetics Chromatin. 2018 May 28;11(1):24. doi: 10.1186/s13072-018-0194-0.
8
MethylStar: A fast and robust pre-processing pipeline for bulk or single-cell whole-genome bisulfite sequencing data.MethylStar:一个快速且稳健的用于批量或单细胞全基因组 bisulfite 测序数据的预处理流水线。
BMC Genomics. 2020 Jul 13;21(1):479. doi: 10.1186/s12864-020-06886-3.
9
Whole-Genome Bisulfite Sequencing Using the Ovation® Ultralow Methyl-Seq Protocol.使用Ovation®超微量甲基化测序方案进行全基因组亚硫酸氢盐测序。
Methods Mol Biol. 2018;1708:83-104. doi: 10.1007/978-1-4939-7481-8_5.
10
Evaluation of preprocessing, mapping and postprocessing algorithms for analyzing whole genome bisulfite sequencing data.用于分析全基因组亚硫酸氢盐测序数据的预处理、映射和后处理算法的评估。
Brief Bioinform. 2016 Nov;17(6):938-952. doi: 10.1093/bib/bbv103. Epub 2015 Dec 1.

引用本文的文献

1
Direct genetic transformation bypasses tumor-associated DNA methylation alterations.直接基因转化绕过了肿瘤相关的DNA甲基化改变。
Genome Biol. 2025 Jul 17;26(1):212. doi: 10.1186/s13059-025-03650-2.
2
Improved epigenetic age prediction models by combining sex chromosome and autosomal markers.通过结合性染色体和常染色体标记改进表观遗传年龄预测模型。
Epigenetics Chromatin. 2025 Jul 15;18(1):45. doi: 10.1186/s13072-025-00606-5.
3
UNet with Attention Networks: A Novel Deep Learning Approach for DNA Methylation Prediction in HeLa Cells.

本文引用的文献

1
Atropos: specific, sensitive, and speedy trimming of sequencing reads.Atropos:对测序读数进行特定、灵敏且快速的修剪。
PeerJ. 2017 Aug 30;5:e3720. doi: 10.7717/peerj.3720. eCollection 2017.
2
DeepCpG: accurate prediction of single-cell DNA methylation states using deep learning.DeepCpG:利用深度学习准确预测单细胞DNA甲基化状态
Genome Biol. 2017 Apr 11;18(1):67. doi: 10.1186/s13059-017-1189-z.
3
Predicting the impact of non-coding variants on DNA methylation.预测非编码变异对DNA甲基化的影响。
带有注意力网络的UNet:一种用于预测HeLa细胞中DNA甲基化的新型深度学习方法。
Genes (Basel). 2025 May 28;16(6):655. doi: 10.3390/genes16060655.
4
Robust molecular subgrouping and reference-free aneuploidy detection in medulloblastoma using low-depth whole genome bisulfite sequencing.利用低深度全基因组亚硫酸氢盐测序在髓母细胞瘤中进行稳健的分子亚组分析和无参考的非整倍体检测。
Acta Neuropathol Commun. 2025 Jun 24;13(1):132. doi: 10.1186/s40478-025-02049-1.
5
CMImpute: cross-species and tissue imputation of species-level DNA methylation samples across mammalian species.CMImpute:跨哺乳动物物种的物种水平DNA甲基化样本的跨物种和组织插补
Genome Biol. 2025 May 20;26(1):133. doi: 10.1186/s13059-025-03561-2.
6
Data-Driven Identification of Early Cancer-Associated Genes via Penalized Trans-Dimensional Hidden Markov Models.通过惩罚跨维度隐马尔可夫模型进行数据驱动的早期癌症相关基因识别。
Biomolecules. 2025 Feb 16;15(2):294. doi: 10.3390/biom15020294.
7
Advancing epigenetic profiling in cervical cancer: machine learning techniques for classifying DNA methylation patterns.宫颈癌中表观遗传谱分析的进展:用于分类DNA甲基化模式的机器学习技术
3 Biotech. 2024 Nov;14(11):264. doi: 10.1007/s13205-024-04107-2. Epub 2024 Oct 9.
8
The role of epigenetics in women's reproductive health: the impact of environmental factors.表观遗传学在女性生殖健康中的作用:环境因素的影响。
Front Endocrinol (Lausanne). 2024 Sep 13;15:1399757. doi: 10.3389/fendo.2024.1399757. eCollection 2024.
9
Cross-species and tissue imputation of species-level DNA methylation samples across mammalian species.跨哺乳动物物种的物种水平DNA甲基化样本的跨物种和组织推算。
bioRxiv. 2023 Nov 27:2023.11.26.568769. doi: 10.1101/2023.11.26.568769.
10
iCpG-Pos: an accurate computational approach for identification of CpG sites using positional features on single-cell whole genome sequence data.iCpG-Pos:一种使用单细胞全基因组序列数据上的位置特征来识别 CpG 位点的准确计算方法。
Bioinformatics. 2023 Aug 1;39(8). doi: 10.1093/bioinformatics/btad474.
Nucleic Acids Res. 2017 Jun 20;45(11):e99. doi: 10.1093/nar/gkx177.
4
Identification of methylation haplotype blocks aids in deconvolution of heterogeneous tissue samples and tumor tissue-of-origin mapping from plasma DNA.甲基化单倍型块的识别有助于对异质组织样本进行解卷积,并从血浆DNA中进行肿瘤组织起源映射。
Nat Genet. 2017 Apr;49(4):635-642. doi: 10.1038/ng.3805. Epub 2017 Mar 6.
5
Genetic regulatory signatures underlying islet gene expression and type 2 diabetes.胰岛基因表达和2型糖尿病背后的遗传调控特征。
Proc Natl Acad Sci U S A. 2017 Feb 28;114(9):2301-2306. doi: 10.1073/pnas.1621192114. Epub 2017 Feb 13.
6
Ascl2 inhibits myogenesis by antagonizing the transcriptional activity of myogenic regulatory factors.Ascl2通过拮抗成肌调节因子的转录活性来抑制肌生成。
Development. 2017 Jan 15;144(2):235-247. doi: 10.1242/dev.138099. Epub 2016 Dec 19.
7
Critical evaluation of the Illumina MethylationEPIC BeadChip microarray for whole-genome DNA methylation profiling.对Illumina MethylationEPIC BeadChip微阵列用于全基因组DNA甲基化分析的批判性评估。
Genome Biol. 2016 Oct 7;17(1):208. doi: 10.1186/s13059-016-1066-1.
8
Next-generation genotype imputation service and methods.下一代基因型填充服务和方法。
Nat Genet. 2016 Oct;48(10):1284-1287. doi: 10.1038/ng.3656. Epub 2016 Aug 29.
9
A reference panel of 64,976 haplotypes for genotype imputation.用于基因型插补的64976个单倍型参考面板。
Nat Genet. 2016 Oct;48(10):1279-83. doi: 10.1038/ng.3643. Epub 2016 Aug 22.
10
The genetic architecture of type 2 diabetes.2型糖尿病的遗传结构
Nature. 2016 Aug 4;536(7614):41-47. doi: 10.1038/nature18642. Epub 2016 Jul 11.