Leibniz-Institut für Analytische Wissenschaften-ISAS-e.V. , Otto-Hahn-Straße 6b , 44227 Dortmund , Germany.
Wellcome Sanger Institute , Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA , United Kingdom.
Anal Chem. 2019 Mar 5;91(5):3302-3310. doi: 10.1021/acs.analchem.8b04310. Epub 2019 Feb 13.
Mass spectrometry (MS) is one of the primary techniques used for large-scale analysis of small molecules in metabolomics studies. To date, there has been little data format standardization in this field, as different software packages export results in different formats represented in XML or plain text, making data sharing, database deposition, and reanalysis highly challenging. Working within the consortia of the Metabolomics Standards Initiative, Proteomics Standards Initiative, and the Metabolomics Society, we have created mzTab-M to act as a common output format from analytical approaches using MS on small molecules. The format has been developed over several years, with input from a wide range of stakeholders. mzTab-M is a simple tab-separated text format, but importantly, the structure is highly standardized through the design of a detailed specification document, tightly coupled to validation software, and a mandatory controlled vocabulary of terms to populate it. The format is able to represent final quantification values from analyses, as well as the evidence trail in terms of features measured directly from MS (e.g., LC-MS, GC-MS, DIMS, etc.) and different types of approaches used to identify molecules. mzTab-M allows for ambiguity in the identification of molecules to be communicated clearly to readers of the files (both people and software). There are several implementations of the format available, and we anticipate widespread adoption in the field.
质谱 (MS) 是代谢组学研究中小分子大规模分析的主要技术之一。迄今为止,该领域几乎没有数据格式标准化,因为不同的软件包以 XML 或纯文本表示的不同格式导出结果,这使得数据共享、数据库存储和重新分析极具挑战性。在代谢组学标准倡议、蛋白质组学标准倡议和代谢组学学会的联合会内,我们创建了 mzTab-M,用作使用 MS 分析小分子的通用输出格式。该格式经过多年的发展,得到了广泛利益相关者的投入。mzTab-M 是一种简单的制表符分隔文本格式,但重要的是,通过设计详细的规范文档、紧密耦合的验证软件以及用于填充它的强制性术语控制词汇表,使结构高度标准化。该格式能够表示分析的最终定量值,以及通过直接从 MS(例如 LC-MS、GC-MS、DIMS 等)测量的特征和用于识别分子的不同类型的方法来表示的证据线索。mzTab-M 允许分子鉴定中的歧义清晰地传达给文件的读者(包括人和软件)。该格式有几个实现,我们预计它将在该领域得到广泛采用。