Zhang Wenchao, Zhao Patrick X
BMC Bioinformatics. 2014;15 Suppl 11(Suppl 11):S5. doi: 10.1186/1471-2105-15-S11-S5. Epub 2014 Oct 21.
Extracted ion chromatogram (EIC) extraction and chromatographic peak detection are two important processing procedures in liquid chromatography/mass spectrometry (LC/MS)-based metabolomics data analysis. Most commonly, the LC/MS technique employs electrospray ionization as the ionization method. The EICs from LC/MS data are often noisy and contain high background signals. Furthermore, the chromatographic peak quality varies with respect to its location in the chromatogram and most peaks have zigzag shapes. Therefore, there is a critical need to develop effective metrics for quality evaluation of EICs and chromatographic peaks in LC/MS based metabolomics data analysis.
We investigated a comprehensive set of potential quality evaluation metrics for extracted EICs and detected chromatographic peaks. Specifically, for EIC quality evaluation, we analyzed the mass chromatographic quality index (MCQ index) and propose a novel quality evaluation metric, the EIC-related global zigzag index, which is based on an EIC's first order derivatives. For chromatographic peak quality evaluation, we analyzed and compared six metrics: sharpness, Gaussian similarity, signal-to-noise ratio, peak significance level, triangle peak area similarity ratio and the local peak-related local zigzag index.
Although the MCQ index is suited for selecting and aligning analyte components, it cannot fairly evaluate EICs with high background signals or those containing only a single peak. Our proposed EIC related global zigzag index is robust enough to evaluate EIC qualities in both scenarios. Of the six peak quality evaluation metrics, the sharpness, peak significance level, and zigzag index outperform the others due to the zigzag nature of LC/MS chromatographic peaks. Furthermore, using several peak quality metrics in combination is more efficient than individual metrics in peak quality evaluation.
提取离子色谱图(EIC)提取和色谱峰检测是基于液相色谱/质谱(LC/MS)的代谢组学数据分析中的两个重要处理程序。最常见的是,LC/MS技术采用电喷雾电离作为电离方法。来自LC/MS数据的EIC通常有噪声且包含高背景信号。此外,色谱峰质量因其在色谱图中的位置而异,并且大多数峰具有锯齿形状。因此,迫切需要开发有效的指标来评估基于LC/MS的代谢组学数据分析中EIC和色谱峰的质量。
我们研究了一套全面的针对提取的EIC和检测到的色谱峰的潜在质量评估指标。具体而言,对于EIC质量评估,我们分析了质量色谱质量指数(MCQ指数),并提出了一种新的质量评估指标,即基于EIC一阶导数的EIC相关全局锯齿指数。对于色谱峰质量评估,我们分析并比较了六个指标:尖锐度、高斯相似度、信噪比、峰显著性水平、三角形峰面积相似度和局部峰相关局部锯齿指数。
虽然MCQ指数适用于选择和对齐分析物成分,但它不能公平地评估具有高背景信号或仅包含单个峰的EIC。我们提出的EIC相关全局锯齿指数足够稳健,能够在两种情况下评估EIC质量。在六个峰质量评估指标中,由于LC/MS色谱峰的锯齿性质,尖锐度、峰显著性水平和锯齿指数比其他指标表现更好。此外,在峰质量评估中,结合使用几个峰质量指标比单独使用指标更有效。