Malm Louise, Kruve Anneli
Department of Materials and Environmental Chemistry, Stockholm University, 11418 Stockholm, Sweden.
Department of Environmental Science, Stockholm University, 11418 Stockholm, Sweden.
Analyst. 2025 Jul 16. doi: 10.1039/d5an00323g.
Retention time (RT) is essential in evaluating the likelihood of candidate structures in nontarget screening (NTS) with liquid chromatography high resolution mass spectrometry (LC/HRMS). Approaches for estimating the RTs of candidate structures can broadly be divided into projection and prediction methods. The first approach takes advantage of public databases of RTs measured on similar chromatographic systems (CS) and projects these to the chromatographic system applied in the NTS (CS) based on a small set of commonly analyzed chemicals. The second approach leverages machine learning (ML) model(s) trained on publicly available retention time data measured on one or more chromatographic systems (CS). Nevertheless, the CS and CS might differ substantially from CS. Therefore, it is of interest to evaluate the generalizability of projection models and prediction models in CSs routinely applied in NTS. Here we take advantage of the recent NORMAN interlaboratory comparison where 41 known calibration chemicals and 45 suspects were analyzed to evaluate both the projection and prediction approaches on 37 CSs. The accuracy of both approaches was directly linked to the similarity of the CS, and the pH of the mobile phase and the column chemistry were found to be most impactful. Furthermore, for cases where CS and CS differ substantially but CS and CS are similar, prediction models often performed on par with the projection models. These findings highlight the need to account for the mobile phase and column chemistry in ML model training and select the prediction model for RT.
保留时间(RT)在利用液相色谱高分辨率质谱(LC/HRMS)进行非目标筛查(NTS)时评估候选结构的可能性方面至关重要。估计候选结构保留时间的方法大致可分为投影法和预测法。第一种方法利用在类似色谱系统(CS)上测得的保留时间公共数据库,并基于一小组常用分析化学品将这些数据投影到NTS中应用的色谱系统(CS)上。第二种方法利用在一个或多个色谱系统(CS)上测得的公开可用保留时间数据训练的机器学习(ML)模型。然而,CS和CS可能与CS有很大差异。因此,评估投影模型和预测模型在NTS中常规应用的CS中的通用性是很有意义的。在此,我们利用最近的诺曼实验室间比较,其中分析了41种已知校准化学品和45种可疑物,以评估37种CS上的投影法和预测法。两种方法的准确性都与CS的相似性直接相关,发现流动相的pH值和柱化学性质影响最大。此外,对于CS和CS差异很大但CS和CS相似的情况,预测模型的表现通常与投影模型相当。这些发现凸显了在ML模型训练中考虑流动相和柱化学性质以及选择保留时间预测模型的必要性。