Department of Biopharmaceutics and Pharmacodynamics, Medical University of Gdańsk, Al. Gen. Hallera 107, 80-416, Gdańsk, Poland.
Anal Bioanal Chem. 2018 Jun;410(16):3905-3915. doi: 10.1007/s00216-018-1061-3. Epub 2018 Apr 21.
It is relatively easy to collect chromatographic measurements for a large number of analytes, especially with gradient chromatographic methods coupled with mass spectrometry detection. Such data often have a hierarchical or clustered structure. For example, analytes with similar hydrophobicity and dissociation constant tend to be more alike in their retention than a randomly chosen set of analytes. Multilevel models recognize the existence of such data structures by assigning a model for each parameter, with its parameters also estimated from data. In this work, a multilevel model is proposed to describe retention time data obtained from a series of wide linear organic modifier gradients of different gradient duration and different mobile phase pH for a large set of acids and bases. The multilevel model consists of (1) the same deterministic equation describing the relationship between retention time and analyte-specific and instrument-specific parameters, (2) covariance relationships relating various physicochemical properties of the analyte to chromatographically specific parameters through quantitative structure-retention relationship based equations, and (3) stochastic components of intra-analyte and interanalyte variability. The model was implemented in Stan, which provides full Bayesian inference for continuous-variable models through Markov chain Monte Carlo methods. Graphical abstract Relationships between log k and MeOH content for acidic, basic, and neutral compounds with different log P. CI credible interval, PSA polar surface area.
对于大量的分析物,特别是与质谱检测相结合的梯度色谱方法,收集色谱测量数据相对容易。这些数据通常具有层次结构或聚类结构。例如,具有相似疏水性和离解常数的分析物在保留时间上往往比随机选择的一组分析物更为相似。多水平模型通过为每个参数分配一个模型来识别这种数据结构的存在,其参数也从数据中估计。在这项工作中,提出了一种多水平模型来描述从一系列不同梯度时间和不同流动相 pH 的宽线性有机修饰梯度中获得的大量酸和碱的保留时间数据。多水平模型由以下部分组成:(1)描述保留时间与分析物特定和仪器特定参数之间关系的相同确定性方程;(2)通过基于定量结构-保留关系的方程将分析物的各种物理化学性质与色谱特定参数相关联的协方差关系;(3)分析物内和分析物间变异性的随机成分。该模型在 Stan 中实现,Stan 通过马尔可夫链蒙特卡罗方法为连续变量模型提供完全贝叶斯推断。