Department of Engineering and Chemical Sciences, Karlstad University, SE-651 88 Karlstad, Sweden.
Faculty of Pharmaceutical Sciences, University of Iceland, Hagi, Hofsvallargata 53, 107 Reykjavik, Iceland; ArcticMass, Sturlugata 8, 101 Reykjavík, Iceland.
J Chromatogr A. 2019 Aug 2;1598:92-100. doi: 10.1016/j.chroma.2019.03.043. Epub 2019 Mar 29.
A strategy for determining a suitable solvent gradient in silico in preparative peptide separations is presented. The strategy utilizes a machine-learning-based method, called ELUDE, for peptide retention time predictions based on the amino acid sequences of the peptides. A suitable gradient is calculated according to linear solvent strength theory by predicting the retention times of the peptides being purified at three different gradient slopes. The advantage of this strategy is that fewer experiments are needed to develop a purification method, making it useful for labs conducting many separations but with limited resources for method development. The preparative separation of met-enkephalin and leu-enkephalin was used as model solutes on two stationary phases: XBridge C18 and CSH C18. The ELUDE algorithm contains a support vector regression and is pre-trained, meaning that only 10-50 peptides are needed to calibrate a model for a certain stationary phase and gradient. The calibration is done once and the model can then be used for new peptides similar in size to those in the calibration set. We found that the accuracy of the retention time predictions is good enough to usefully estimate a suitable gradient and that it was possible to compare the selectivity on different stationary phases in silico. The absolute relative errors in retention time for the predicted gradients were 4.2% and 3.7% for met-enkephalin and leu-enkephalin, respectively, on the XBridge C18 column and 2.0% and 2.8% on the CSH C18 column. The predicted retention times were also used as initial values for adsorption isotherm parameter determination, facilitating the numerical calculation of overloaded elution profiles. Changing the trifluoroacetic acid (TFA) concentration from 0.05% to 0.15% in the eluent did not seriously affect the error in the retention time predictions for the XBridge C18 column, an increase of 1.0 min (in retention factor, 1.3). For the CSH C18 column the error was, on average, 2.6 times larger. This indicates that the model needs to be recalibrated when changing the TFA concentration for the CSH column. Studying possible scale-up complications from UHPLC to HPLC such as pressure, viscous heating (i.e., temperature gradients), and stationary-phase properties (e.g., packing heterogeneity and surface chemistry) revealed that all these factors were minor to negligible. The pressure effect had the largest effect on the retention, but increased retention by only 3%. In the presented case, method development can therefore proceed using UHPLC and then be robustly transferred to HPLC.
提出了一种在制备肽分离中在计算机中确定合适溶剂梯度的策略。该策略利用基于机器学习的方法 ELUDE,根据肽的氨基酸序列进行肽保留时间预测。根据线性溶剂强度理论,通过预测在三种不同梯度斜率下纯化的肽的保留时间,计算合适的梯度。这种策略的优点是,开发一种纯化方法所需的实验更少,因此对于进行许多分离但方法开发资源有限的实验室很有用。在两种固定相上,使用 met-enkephalin 和 leu-enkephalin 作为模型溶质进行制备分离:XBridge C18 和 CSH C18。ELUDE 算法包含一个支持向量回归并且是预训练的,这意味着仅需 10-50 个肽即可为特定固定相和梯度校准模型。只需进行一次校准,然后就可以将模型用于与校准集中的肽大小相似的新肽。我们发现,保留时间预测的准确性足以有效地估计合适的梯度,并且可以在计算机中比较不同固定相的选择性。对于 XBridge C18 柱,met-enkephalin 和 leu-enkephalin 的预测梯度的保留时间的绝对相对误差分别为 4.2%和 3.7%,对于 CSH C18 柱分别为 2.0%和 2.8%。预测的保留时间也被用作吸附等温线参数确定的初始值,便于对过载洗脱曲线进行数值计算。改变洗脱液中的三氟乙酸(TFA)浓度从 0.05%增加到 0.15%,对 XBridge C18 柱保留时间预测的误差影响不大,增加了 1.0 分钟(保留因子增加 1.3)。对于 CSH C18 柱,误差平均增加了 2.6 倍。这表明当为 CSH 柱改变 TFA 浓度时,模型需要重新校准。研究从 UHPLC 到 HPLC 的可能放大问题,例如压力、粘性加热(即温度梯度)和固定相性质(例如,填充不均匀性和表面化学),结果表明所有这些因素都是次要的或可以忽略不计的。压力效应对保留的影响最大,但仅增加了 3%的保留。在目前的情况下,因此可以使用 UHPLC 进行方法开发,然后稳健地转移到 HPLC。