• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

作物全基因组亚硫酸氢盐测序数据的图谱绘制方法性能

Performance of Mapping Approaches for Whole-Genome Bisulfite Sequencing Data in Crop Plants.

作者信息

Grehl Claudius, Wagner Marc, Lemnian Ioana, Glaser Bruno, Grosse Ivo

机构信息

Institute of Computer Science, Bioinformatics, Martin Luther University Halle-Wittenberg, Von Seckendorff-Platz 1, Halle (Saale), Germany.

Institute of Agronomy and Nutritional Sciences, Soil Biogeochemistry, Martin Luther University Halle-Wittenberg, Von Seckendorff-Platz 3, Halle (Saale), Germany.

出版信息

Front Plant Sci. 2020 Feb 28;11:176. doi: 10.3389/fpls.2020.00176. eCollection 2020.

DOI:10.3389/fpls.2020.00176
PMID:32256504
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7093021/
Abstract

DNA methylation is involved in many different biological processes in the development and well-being of crop plants such as transposon activation, heterosis, environment-dependent transcriptome plasticity, aging, and many diseases. Whole-genome bisulfite sequencing is an excellent technology for detecting and quantifying DNA methylation patterns in a wide variety of species, but optimized data analysis pipelines exist only for a small number of species and are missing for many important crop plants. This is especially important as most existing benchmark studies have been performed on mammals with hardly any repetitive elements and without CHG and CHH methylation. Pipelines for the analysis of whole-genome bisulfite sequencing data usually consists of four steps: read trimming, read mapping, quantification of methylation levels, and prediction of differentially methylated regions (DMRs). Here we focus on read mapping, which is challenging because un-methylated cytosines are transformed to uracil during bisulfite treatment and to thymine during the subsequent polymerase chain reaction, and read mappers must be capable of dealing with this cytosine/thymine polymorphism. Several read mappers have been developed over the last years, with different strengths and weaknesses, but their performances have not been critically evaluated. Here, we compare eight read mappers: Bismark, BismarkBwt2, BSMAP, BS-Seeker2, Bwameth, GEM3, Segemehl, and GSNAP to assess the impact of the read-mapping results on the prediction of DMRs. We used simulated data generated from the genomes of , , , , and , monitored the effects of the bisulfite conversion rate, the sequencing error rate, the maximum number of allowed mismatches, as well as the genome structure and size, and calculated precision, number of uniquely mapped reads, distribution of the mapped reads, run time, and memory consumption as features for benchmarking the eight read mappers mentioned above. Furthermore, we validated our findings using real-world data of and showed the influence of the mapping step on DMR calling in WGBS pipelines. We found that the conversion rate had only a minor impact on the mapping quality and the number of uniquely mapped reads, whereas the error rate and the maximum number of allowed mismatches had a strong impact and leads to differences of the performance of the eight read mappers. In conclusion, we recommend BSMAP which needs the shortest run time and yields the highest precision, and Bismark which requires the smallest amount of memory and yields precision and high numbers of uniquely mapped reads.

摘要

DNA甲基化参与了作物植物发育和健康过程中的许多不同生物学过程,如转座子激活、杂种优势、环境依赖性转录组可塑性、衰老以及多种疾病。全基因组亚硫酸氢盐测序是一种用于检测和定量多种物种中DNA甲基化模式的优秀技术,但仅针对少数物种存在优化的数据分析流程,许多重要的作物植物则没有。这一点尤为重要,因为大多数现有的基准研究是在几乎没有重复元件且不存在CHG和CHH甲基化的哺乳动物上进行的。全基因组亚硫酸氢盐测序数据分析流程通常包括四个步骤:读段修剪、读段比对、甲基化水平定量以及差异甲基化区域(DMR)预测。在这里,我们重点关注读段比对,这具有挑战性,因为未甲基化的胞嘧啶在亚硫酸氢盐处理过程中会转化为尿嘧啶,在随后的聚合酶链反应中又会转化为胸腺嘧啶,并且读段比对工具必须能够处理这种胞嘧啶/胸腺嘧啶多态性。在过去几年中已经开发了几种读段比对工具,各有优缺点,但它们的性能尚未得到严格评估。在这里,我们比较了八种读段比对工具:Bismark、BismarkBwt2、BSMAP、BS-Seeker2、Bwameth、GEM3、Segemehl和GSNAP,以评估读段比对结果对DMR预测的影响。我们使用了从 、 、 、 和 的基因组生成的模拟数据,监测了亚硫酸氢盐转化率、测序错误率、允许的最大错配数以及基因组结构和大小的影响,并计算了精度、唯一比对读段数、比对读段的分布、运行时间和内存消耗,作为对上述八种读段比对工具进行基准测试的特征。此外,我们使用 的实际数据验证了我们的发现,并展示了比对步骤对全基因组亚硫酸氢盐测序流程中DMR调用的影响。我们发现转化率对比对质量和唯一比对读段数的影响较小,而错误率和允许的最大错配数有很大影响,并导致八种读段比对工具的性能存在差异。总之,我们推荐运行时间最短且精度最高的BSMAP,以及内存需求最小且精度高且唯一比对读段数多的Bismark。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a14b/7093021/83edbbd45562/fpls-11-00176-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a14b/7093021/a4673a5073c8/fpls-11-00176-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a14b/7093021/c722a43a606d/fpls-11-00176-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a14b/7093021/c0c4c630b6ef/fpls-11-00176-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a14b/7093021/5d0457b69b78/fpls-11-00176-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a14b/7093021/08946b0124ff/fpls-11-00176-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a14b/7093021/edb4e5c8d716/fpls-11-00176-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a14b/7093021/e4c0dd8fbe5f/fpls-11-00176-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a14b/7093021/89ba18c0249d/fpls-11-00176-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a14b/7093021/83edbbd45562/fpls-11-00176-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a14b/7093021/a4673a5073c8/fpls-11-00176-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a14b/7093021/c722a43a606d/fpls-11-00176-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a14b/7093021/c0c4c630b6ef/fpls-11-00176-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a14b/7093021/5d0457b69b78/fpls-11-00176-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a14b/7093021/08946b0124ff/fpls-11-00176-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a14b/7093021/edb4e5c8d716/fpls-11-00176-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a14b/7093021/e4c0dd8fbe5f/fpls-11-00176-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a14b/7093021/89ba18c0249d/fpls-11-00176-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a14b/7093021/83edbbd45562/fpls-11-00176-g009.jpg

相似文献

1
Performance of Mapping Approaches for Whole-Genome Bisulfite Sequencing Data in Crop Plants.作物全基因组亚硫酸氢盐测序数据的图谱绘制方法性能
Front Plant Sci. 2020 Feb 28;11:176. doi: 10.3389/fpls.2020.00176. eCollection 2020.
2
Comprehensive benchmarking of software for mapping whole genome bisulfite data: from read alignment to DNA methylation analysis.全面比较全基因组亚硫酸氢盐数据图谱绘制软件:从读取序列比对到 DNA 甲基化分析。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab021.
3
An integrative approach for efficient analysis of whole genome bisulfite sequencing data.一种用于全基因组亚硫酸氢盐测序数据高效分析的综合方法。
BMC Genomics. 2015;16 Suppl 12(Suppl 12):S14. doi: 10.1186/1471-2164-16-S12-S14. Epub 2015 Dec 9.
4
Benchmarking DNA methylation analysis of 14 alignment algorithms for whole genome bisulfite sequencing in mammals.哺乳动物全基因组亚硫酸氢盐测序的14种比对算法的DNA甲基化分析基准测试
Comput Struct Biotechnol J. 2022 Aug 27;20:4704-4716. doi: 10.1016/j.csbj.2022.08.051. eCollection 2022.
5
Systematic and benchmarking studies of pipelines for mammal WGBS data in the novel NGS platform.针对新型 NGS 平台中哺乳动物 WGBS 数据的管道进行系统和基准研究。
BMC Bioinformatics. 2023 Jan 31;24(1):33. doi: 10.1186/s12859-023-05163-w.
6
Evaluation of preprocessing, mapping and postprocessing algorithms for analyzing whole genome bisulfite sequencing data.用于分析全基因组亚硫酸氢盐测序数据的预处理、映射和后处理算法的评估。
Brief Bioinform. 2016 Nov;17(6):938-952. doi: 10.1093/bib/bbv103. Epub 2015 Dec 1.
7
Efficient and accurate determination of genome-wide DNA methylation patterns in Arabidopsis thaliana with enzymatic methyl sequencing.利用酶甲基测序技术高效、准确地测定拟南芥全基因组 DNA 甲基化模式。
Epigenetics Chromatin. 2020 Oct 7;13(1):42. doi: 10.1186/s13072-020-00361-9.
8
Objective and comprehensive evaluation of bisulfite short read mapping tools.亚硫酸氢盐短读长比对工具的客观全面评估。
Adv Bioinformatics. 2014;2014:472045. doi: 10.1155/2014/472045. Epub 2014 Apr 15.
9
MethylC-analyzer: a comprehensive downstream pipeline for the analysis of genome-wide DNA methylation.甲基化C分析器:用于全基因组DNA甲基化分析的综合下游流程
Bot Stud. 2023 Jan 6;64(1):1. doi: 10.1186/s40529-022-00366-5.
10
Comparison and quantitative verification of mapping algorithms for whole-genome bisulfite sequencing.全基因组亚硫酸氢盐测序映射算法的比较与定量验证
Nucleic Acids Res. 2014 Apr;42(6):e43. doi: 10.1093/nar/gkt1325. Epub 2014 Jan 3.

引用本文的文献

1
DNA methylation in wheat: current understanding and future potential for enhancing biotic and abiotic stress tolerance.小麦中的DNA甲基化:当前的认识以及增强生物和非生物胁迫耐受性的未来潜力
Physiol Mol Biol Plants. 2024 Dec;30(12):1921-1933. doi: 10.1007/s12298-024-01539-1. Epub 2024 Dec 10.
2
Efficiently quantifying DNA methylation for bulk- and single-cell bisulfite data.高效定量批量和单细胞亚硫酸氢盐数据中的 DNA 甲基化。
Bioinformatics. 2023 Jun 1;39(6). doi: 10.1093/bioinformatics/btad386.
3
Systematic and benchmarking studies of pipelines for mammal WGBS data in the novel NGS platform.

本文引用的文献

1
DNA methylation analysis in plants: review of computational tools and future perspectives.植物中的 DNA 甲基化分析:计算工具的综述与未来展望。
Brief Bioinform. 2020 May 21;21(3):906-918. doi: 10.1093/bib/bbz039.
2
A reference-grade wild soybean genome.一份参照级别的野生大豆基因组。
Nat Commun. 2019 Mar 14;10(1):1216. doi: 10.1038/s41467-019-09142-9.
3
Plasticity of DNA methylation and gene expression under zinc deficiency in Arabidopsis roots.在拟南芥根中锌缺乏条件下 DNA 甲基化和基因表达的可塑性。
针对新型 NGS 平台中哺乳动物 WGBS 数据的管道进行系统和基准研究。
BMC Bioinformatics. 2023 Jan 31;24(1):33. doi: 10.1186/s12859-023-05163-w.
4
Multi-Omics Approaches and Resources for Systems-Level Gene Function Prediction in the Plant Kingdom.植物王国中用于系统水平基因功能预测的多组学方法与资源
Plants (Basel). 2022 Oct 5;11(19):2614. doi: 10.3390/plants11192614.
5
Benchmarking DNA methylation analysis of 14 alignment algorithms for whole genome bisulfite sequencing in mammals.哺乳动物全基因组亚硫酸氢盐测序的14种比对算法的DNA甲基化分析基准测试
Comput Struct Biotechnol J. 2022 Aug 27;20:4704-4716. doi: 10.1016/j.csbj.2022.08.051. eCollection 2022.
6
Analysis and Performance Assessment of the Whole Genome Bisulfite Sequencing Data Workflow: Currently Available Tools and a Practical Guide to Advance DNA Methylation Studies.全基因组亚硫酸氢盐测序数据分析和性能评估:现有工具及推进 DNA 甲基化研究的实用指南。
Small Methods. 2022 Mar;6(3):e2101251. doi: 10.1002/smtd.202101251. Epub 2022 Jan 22.
7
Challenges and Perspectives in the Epigenetics of Climate Change-Induced Forests Decline.气候变化导致森林衰退的表观遗传学中的挑战与展望
Front Plant Sci. 2022 Jan 4;12:797958. doi: 10.3389/fpls.2021.797958. eCollection 2021.
8
Low guanine content and biased nucleotide distribution in vertebrate mtDNA can cause overestimation of non-CpG methylation.脊椎动物线粒体DNA中鸟嘌呤含量低和核苷酸分布偏向性会导致非CpG甲基化的高估。
NAR Genom Bioinform. 2022 Jan 14;4(1):lqab119. doi: 10.1093/nargab/lqab119. eCollection 2022 Mar.
9
Performance of methods to detect genetic variants from bisulphite sequencing data in a non-model species.检测非模式物种亚硫酸氢盐测序数据中遗传变异的方法性能。
Mol Ecol Resour. 2022 Feb;22(2):834-846. doi: 10.1111/1755-0998.13493. Epub 2021 Sep 6.
10
Rapid identification of methylase specificity (RIMS-seq) jointly identifies methylated motifs and generates shotgun sequencing of bacterial genomes.快速鉴定甲基化酶特异性(RIMS-seq)联合鉴定甲基化基序,并对细菌基因组进行鸟枪法测序。
Nucleic Acids Res. 2021 Nov 8;49(19):e113. doi: 10.1093/nar/gkab705.
Plant Cell Physiol. 2018 Sep 1;59(9):1790-1802. doi: 10.1093/pcp/pcy100.
4
Analysis of DNA methylation in cancer: location revisited.癌症中 DNA 甲基化分析:位置再探。
Nat Rev Clin Oncol. 2018 Jul;15(7):459-466. doi: 10.1038/s41571-018-0004-4.
5
A comprehensive evaluation of alignment software for reduced representation bisulfite sequencing data.简化基因组重亚硫酸盐测序数据比对软件的综合评估。
Bioinformatics. 2018 Aug 15;34(16):2715-2723. doi: 10.1093/bioinformatics/bty174.
6
Defiant: (DMRs: easy, fast, identification and ANnoTation) identifies differentially Methylated regions from iron-deficient rat hippocampus. defiant:(dmrs:简便、快速、鉴定和注释)从缺铁性大鼠海马体中识别差异甲基化区域。
BMC Bioinformatics. 2018 Feb 5;19(1):31. doi: 10.1186/s12859-018-2037-1.
7
Parental DNA Methylation States Are Associated with Heterosis in Epigenetic Hybrids.亲本 DNA 甲基化状态与表观遗传杂种的杂种优势有关。
Plant Physiol. 2018 Feb;176(2):1627-1645. doi: 10.1104/pp.17.01054. Epub 2017 Dec 1.
8
A MBD-seq protocol for large-scale methylome-wide studies with (very) low amounts of DNA.一种用于在(非常)少量 DNA 条件下进行大规模全基因组甲基化研究的 MBD-seq 方案。
Epigenetics. 2017 Sep;12(9):743-750. doi: 10.1080/15592294.2017.1335849. Epub 2017 Nov 6.
9
Divergent cytosine DNA methylation patterns in single-cell, soybean root hairs.单细胞大豆根毛中不同的胞嘧啶DNA甲基化模式
New Phytol. 2017 Apr;214(2):808-819. doi: 10.1111/nph.14421. Epub 2017 Jan 20.
10
A comparison of tools for the simulation of genomic next-generation sequencing data.用于模拟基因组下一代测序数据的工具比较。
Nat Rev Genet. 2016 Aug;17(8):459-69. doi: 10.1038/nrg.2016.57. Epub 2016 Jun 20.