State Key Laboratory of Southwestern Chinese Medicine Resources, Chengdu University of Traditional Chinese Medicine, Chengdu 611137, China; School of Pharmacy, Chengdu University of Traditional Chinese Medicine, Chengdu, Sichuan 611137, China.
School of Basic Medicine, Chengdu University of Traditional Chinese Medicine, Chengdu, Sichuan 611137, China.
J Chromatogr A. 2022 Jun 21;1673:463086. doi: 10.1016/j.chroma.2022.463086. Epub 2022 Apr 21.
A new 'shape-orientated' continuous wavelet transform (CWT)-based algorithm employing an adapted Marr wavelet (AMW) with a shape matching index (SMI), defined as peak height normalized wavelet coefficient ( [Formula: see text] ) for feature filtering, was developed for chromatographic peak detection and quantification. Exploiting the chromatographic profile of a candidate peak, AMW-SMI algorithm emphasizes more on the matching of the overall chromatographic profile to a reference Gaussian shape, while partly alleviates the requirement on the signal intensity derived from single or several data points, thus it allows the detection of low-intensity features from metabolites at low abundance. AMW-SMI imposes maximum and minimum thresholds on the ridgeline width and length to define a valid ridgeline, which corresponds to a more stably shaped chromatographic profile. The maximum wavelet coefficient C(a,b) on the valid ridgeline determines the translation b as the peak center. AMW-SMI detects the valley lines to define the peak boundaries, which is important to obtain accurate peak quantification. As a more 'shape-orientated' peak detection algorithm, various methods related to the 'shape' are introduced for feature filtering, out of which, the effective SNR (eSNR) is defined to evaluate if the shape is strong or good enough relative to the 'shape noise', and the SMI, which can quantitatively evaluate the shape quality regardless of the data intensities and peak width, is applied to filter out the poorly shaped false positives. AMW-SMI performs Gaussian fitting of all data points between the defined peak boundaries to refine the peak parameters, and the refined SMI, SNR and peak width can be applied for further feature filtering and reinforce the 'shape-quality' of final selected peaks. The performance of AMW-SMI is evaluated qualitatively (by recall, precision and F-score) and quantitatively (by ratio of isotopic features and triplicate RSD) on the LC-MS data of model mixtures of 21 human metabolite standards and 8 plant metabolite standards, and of serum sample spiked with the 21 human metabolite standards, and on the triplicate LC-MS data of the same sample of cell metabolomic extracts. Compared with XCMS (centWave) and MZmine 2 (ADAP), the proposed AMW-SMI algorithm can faithfully identify chromatographic peaks with significantly fewer false positives and demonstrated general superiority in terms of qualitative precision (robustness) and quantitative accuracy (by ratio of isotopic features), and comparable recall (sensitivity) and quantitative stability (by RSD of triplicate measurements).
一种新的基于连续小波变换(CWT)的“形状导向”算法,使用了一种经过改进的 Marr 小波(AMW)和形状匹配指数(SMI),其特征过滤的定义为峰值高度归一化小波系数([公式:见正文])。该算法用于色谱峰检测和定量。利用候选峰的色谱轮廓,AMW-SMI 算法更强调整体色谱轮廓与参考高斯形状的匹配,同时部分减轻了对来自单个或几个数据点的信号强度的要求,从而允许从低丰度代谢物中检测到低强度特征。AMW-SMI 对脊线宽度和长度施加最大和最小阈值,以定义有效的脊线,这对应于更稳定的色谱轮廓。有效脊线上的最大小波系数 C(a,b)确定平移 b 作为峰中心。AMW-SMI 检测谷线以定义峰边界,这对于获得准确的峰定量很重要。作为一种更“形状导向”的峰检测算法,引入了各种与“形状”相关的方法进行特征过滤,其中定义了有效信噪比(eSNR)来评估形状相对于“形状噪声”的强弱,以及 SMI,它可以不考虑数据强度和峰宽,定量评估形状质量,用于过滤掉形状较差的假阳性。AMW-SMI 对定义的峰边界之间的所有数据点进行高斯拟合,以优化峰参数,优化后的 SMI、SNR 和峰宽可用于进一步的特征过滤,并加强最终选择的峰的“形状质量”。通过对 21 个人类代谢物标准和 8 种植物代谢物标准的 LC-MS 数据模型混合物、21 个人类代谢物标准添加的血清样本的 LC-MS 数据以及相同细胞代谢组提取物样本的三重复 LC-MS 数据进行定性(通过召回率、精度和 F 分数)和定量(通过同位素特征比和三重复 RSD)评估,与 XCMS(centWave)和 MZmine 2(ADAP)相比,所提出的 AMW-SMI 算法可以更准确地识别色谱峰,假阳性更少,并在定性精度(稳健性)和定量准确性(通过同位素特征比)方面具有明显优势,以及可比较的召回率(灵敏度)和定量稳定性(通过三重复测量的 RSD)。