Groningen Bioinformatics Centre, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Nijenborgh 7, 9747 AG Groningen, The Netherlands.
Anal Chem. 2011 Apr 1;83(7):2786-93. doi: 10.1021/ac2000994. Epub 2011 Mar 14.
The recent proliferation of high-resolution mass spectrometers has generated a wealth of new data analysis methods. However, flexible integration of these methods into configurations best suited to the research question is hampered by heterogeneous file formats and monolithic software development. The mzXML, mzData, and mzML file formats have enabled uniform access to unprocessed raw data. In this paper we present our efforts to produce an equally simple and powerful format, PeakML, to uniformly exchange processed intermediary and result data. To demonstrate the versatility of PeakML, we have developed an open source Java toolkit for processing, filtering, and annotating mass spectra in a customizable pipeline (mzMatch), as well as a user-friendly data visualization environment (PeakML Viewer). The PeakML format in particular enables the flexible exchange of processed data between software created by different groups or companies, as we illustrate by providing a PeakML-based integration of the widely used XCMS package with mzMatch data processing tools. As an added advantage, downstream analysis can benefit from direct access to the full mass trace information underlying summarized mass spectrometry results, providing the user with the means to rapidly verify results. The PeakML/mzMatch software is freely available at http://mzmatch.sourceforge.net, with documentation, tutorials, and a community forum.
近年来,高分辨率质谱仪的大量涌现产生了丰富的新数据分析方法。然而,由于文件格式的异构性和单一软件开发,这些方法很难灵活地集成到最适合研究问题的配置中。mzXML、mzData 和 mzML 文件格式已经实现了对未处理原始数据的统一访问。在本文中,我们介绍了我们为生成同样简单而强大的格式 PeakML 所做的努力,以便统一交换处理后的中间和结果数据。为了展示 PeakML 的多功能性,我们开发了一个用于处理、过滤和注释质谱图的开源 Java 工具包,可以在可定制的管道中进行处理(mzMatch),以及一个用户友好的数据可视化环境(PeakML Viewer)。特别地,PeakML 格式允许在不同组或公司创建的软件之间灵活地交换处理后的数据,我们通过提供基于 PeakML 的广泛使用的 XCMS 包与 mzMatch 数据处理工具的集成来展示这一点。作为一个额外的优势,下游分析可以从直接访问汇总质谱结果所依据的完整质量轨迹信息中受益,为用户提供了快速验证结果的手段。PeakML/mzMatch 软件可在 http://mzmatch.sourceforge.net 上免费获得,提供文档、教程和社区论坛。