proBAM 和 proBed 标准格式:实现基因组学和蛋白质组学数据的无缝集成。
The proBAM and proBed standard formats: enabling a seamless integration of genomics and proteomics data.
机构信息
Department of Mathematical Modeling, Statistics and Bioinformatics, Ghent University, Coupure links 653, 9000, Gent, Belgium.
Greehey Children's Cancer Research Institute, The University of Texas Health Science Center at San Antonio, San Antonio, TX, USA.
出版信息
Genome Biol. 2018 Jan 31;19(1):12. doi: 10.1186/s13059-017-1377-x.
On behalf of The Human Proteome Organization (HUPO) Proteomics Standards Initiative, we introduce here two novel standard data formats, proBAM and proBed, that have been developed to address the current challenges of integrating mass spectrometry-based proteomics data with genomics and transcriptomics information in proteogenomics studies. proBAM and proBed are adaptations of the well-defined, widely used file formats SAM/BAM and BED, respectively, and both have been extended to meet the specific requirements entailed by proteomics data. Therefore, existing popular genomics tools such as SAMtools and Bedtools, and several widely used genome browsers, can already be used to manipulate and visualize these formats "out-of-the-box." We also highlight that a number of specific additional software tools, properly supporting the proteomics information available in these formats, are now available providing functionalities such as file generation, file conversion, and data analysis. All the related documentation, including the detailed file format specifications and example files, are accessible at http://www.psidev.info/probam and at http://www.psidev.info/probed .
代表人类蛋白质组组织(HUPO)蛋白质组学标准倡议,我们在此引入两种新的标准数据格式,proBAM 和 proBed,旨在解决在蛋白质基因组学研究中将基于质谱的蛋白质组学数据与基因组学和转录组学信息整合所面临的当前挑战。proBAM 和 proBed 分别是经过充分定义的、广泛使用的文件格式 SAM/BAM 和 BED 的改编版,并且都经过扩展以满足蛋白质组学数据所需要的特定要求。因此,现有的流行基因组学工具,如 SAMtools 和 Bedtools,以及几个广泛使用的基因组浏览器,已经可以“开箱即用”地用于操作和可视化这些格式。我们还强调,现在有许多专门的附加软件工具,正确支持这些格式中提供的蛋白质组学信息,提供了诸如文件生成、文件转换和数据分析等功能。所有相关文档,包括详细的文件格式规范和示例文件,均可在 http://www.psidev.info/probam 和 http://www.psidev.info/probed 上获取。