Rohde Christian, Zhang Yingying, Jurkowski Tomasz P, Stamerjohanns Heinrich, Reinhardt Richard, Jeltsch Albert
School of Engineering and Science, Jacobs University Bremen, Campus Ring 1, 28725 Bremen, Germany.
Nucleic Acids Res. 2008 Mar;36(5):e34. doi: 10.1093/nar/gkn083. Epub 2008 Feb 22.
During bisulfite genomic sequencing projects large amount of data are generated. The Bisulfite sequencing Data Presentation and Compilation (BDPC) web interface (http://biochem.jacobs-university.de/BDPC/) automatically analyzes bisulfite datasets prepared using the BiQ Analyzer. BDPC provides the following output: (i) MS-Excel compatible files compiling for each PCR product (a) the average methylation level, the number of clones analyzed and the percentage of CG sites analyzed (which is an indicator of data quality), (b) the methylation level observed at each CG site and (c) the methylation level of each clone. (ii) A methylation overview table compiling the methylation of all amplicons in all tissues. (iii) Publication grade figures in PNG format showing the methylation pattern for each PCR product embedded in an HMTL file summarizing the methylation data, the DNA sequence and some basic statistics. (iv) A summary file compiling the methylation pattern of different tissues, which is linked to the individual HTML result files, and can be directly used for presentation of the data in the internet. (v) A condensed file, containing all primary data in simplified format for further downstream data analysis and (vi) a custom track file for display of the results in the UCSC genome browser.
在亚硫酸氢盐基因组测序项目中会产生大量数据。亚硫酸氢盐测序数据呈现与汇编(BDPC)网络界面(http://biochem.jacobs-university.de/BDPC/)会自动分析使用BiQ Analyzer制备的亚硫酸氢盐数据集。BDPC提供以下输出:(i)与MS-Excel兼容的文件,针对每个PCR产物汇编(a)平均甲基化水平、分析的克隆数以及分析的CG位点百分比(这是数据质量的一个指标),(b)在每个CG位点观察到的甲基化水平,以及(c)每个克隆的甲基化水平。(ii)一个甲基化概况表,汇编所有组织中所有扩增子的甲基化情况。(iii)PNG格式的可用于发表的图表,展示每个PCR产物的甲基化模式,该图表嵌入在一个HTML文件中,该文件汇总了甲基化数据、DNA序列和一些基本统计信息。(iv)一个汇总文件,汇编不同组织的甲基化模式,该文件与各个HTML结果文件相关联,可直接用于在互联网上展示数据。(v)一个精简文件,包含所有简化格式的原始数据,用于进一步的下游数据分析;以及(vi)一个自定义轨迹文件,用于在UCSC基因组浏览器中显示结果。