Department of Chemistry , University of British Columbia , Vancouver , British Columbia V6T 1Z3 , Canada.
Canada's Michael Smith Genome Sciences Centre , British Columbia Cancer Agency , Vancouver , British Columbia V5Z 1L3 , Canada.
J Proteome Res. 2019 Feb 1;18(2):700-708. doi: 10.1021/acs.jproteome.8b00721. Epub 2018 Dec 3.
Optimizing the quality of proteomics data collected from a mass spectrometer (MS) requires careful selection of acquisition parameters and proper assessment of instrument performance. Software tools capable of extracting a broad set of information from raw files, including meta, scan, quantification, and identification data, are needed to provide guidance for MS system management. In this work, direct extraction and utilization of these data is demonstrated using RawTools, a standalone tool for extracting meta and scan data directly from raw MS files generated on Thermo Orbitrap instruments. RawTools generates summarized and detailed plain text outputs after parsing individual raw files, including scan rates and durations, duty cycle characteristics, precursor and reporter ion quantification, and chromatography performance. RawTools also contains a diagnostic module that includes an optional "preview" database search for facilitating informed decision-making related to optimization of MS performance based on a variety of metrics. RawTools has been developed in C# and utilizes the Thermo RawFileReader library and thus can process raw MS files with high speed and high efficiency on all major operating systems (Windows, MacOS, Linux). To demonstrate the utility of RawTools, the extraction of meta and scan data from both individual and large collections of raw MS files was carried out to identify problematic characteristics of instrument performance. Taken together, the combined rich feature-set of RawTools with the capability for interrogation of MS and experiment performance makes this software a valuable tool for proteomics researchers.
优化从质谱仪 (MS) 收集的蛋白质组学数据的质量需要仔细选择采集参数并适当评估仪器性能。需要能够从原始文件中提取广泛信息的软件工具,包括元数据、扫描、定量和鉴定数据,为 MS 系统管理提供指导。在这项工作中,使用 RawTools 直接提取和利用这些数据,RawTools 是一种独立的工具,用于直接从 Thermo Orbitrap 仪器生成的原始 MS 文件中提取元数据和扫描数据。RawTools 在解析单个原始文件后生成摘要和详细的纯文本输出,包括扫描速率和持续时间、占空比特性、前体离子和报告离子定量以及色谱性能。RawTools 还包含一个诊断模块,其中包括可选的“预览”数据库搜索,以根据各种指标促进有关优化 MS 性能的决策。RawTools 是用 C#开发的,利用 Thermo RawFileReader 库,因此可以在所有主要操作系统(Windows、MacOS、Linux)上高速高效地处理原始 MS 文件。为了演示 RawTools 的实用性,从单个和大量原始 MS 文件中提取元数据和扫描数据,以确定仪器性能的问题特征。总之,RawTools 具有丰富的功能集,并具有对 MS 和实验性能进行询问的功能,这使得该软件成为蛋白质组学研究人员的有价值工具。