Department of Physics and Astronomy, University of Manitoba, Winnipeg, R3T 2N2, Canada.
J Chromatogr A. 2010 Jan 22;1217(4):489-97. doi: 10.1016/j.chroma.2009.11.065. Epub 2009 Nov 27.
A model for predicting the slope (S) in the fundamental equation of linear-solvent-strength theory for peptidic compounds was developed. Our approach is based on the novel assumption that three well-defined molecular descriptors: peptide length (N), charge (Z) and hydrophobicity index (HI) are the major contributors to the value of S. Following the definition of the model's variables, the retention of a number of Arg-terminated synthetic peptides was investigated under isocratic elution conditions (100 A pore size C18 phase, 0.1% trifluoroacetic acid as ion-pairing modifier). The peptide sequences were systematically designed to span the properties of the typical tryptic peptides that are analyzed in proteomic experiments. Experimental data show that slopes S increase with the independent increase in both peptide charge and peptide length when the two other parameters are held constant. The influence of peptide hydrophobicity is more complex: depending on peptide length and charge, stronger RP-HPLC retention can either decrease or increase the values of S. We postulate a general function to explain this behavior: S=C1 x Z(C2)+C3 x N(C4)+C5 x HI(C6)+C7/Z+C8/N+C9/HI+C10 x ZN+C11 x ZHI+C12 x NHI+B. A simple optimization using a "random walk" through parameter-space was used to determine the optimal coefficients compared to the measured S-values of 37 peptides. The model gives a approximately 0.97 R(2) correlation between the measured and predicted S-values: it was verified against previously published data on a human growth hormone protein tryptic digest and some synthetic analogues from that mixture.
建立了一个预测线性溶剂强度理论基本方程中肽化合物斜率(S)的模型。我们的方法基于一个新的假设,即三个明确定义的分子描述符:肽长度(N)、电荷(Z)和疏水性指数(HI)是影响 S 值的主要因素。根据模型变量的定义,在等度洗脱条件下(100A 孔径 C18 相,0.1%三氟乙酸作为离子对修饰剂)研究了一系列 Arg 末端合成肽的保留情况。肽序列被系统设计,以跨越在蛋白质组学实验中分析的典型胰蛋白酶肽的性质。实验数据表明,当其他两个参数保持不变时,斜率 S 随着肽电荷和肽长度的独立增加而增加。肽疏水性的影响更为复杂:取决于肽的长度和电荷,更强的反相高效液相色谱(RP-HPLC)保留可以降低或增加 S 的值。我们假设了一个通用函数来解释这种行为:S=C1 x Z(C2)+C3 x N(C4)+C5 x HI(C6)+C7/Z+C8/N+C9/HI+C10 x ZN+C11 x ZHI+C12 x NHI+B。使用“随机漫步”通过参数空间进行简单优化,以确定与 37 个肽的测量 S 值相比的最佳系数。该模型在测量 S 值和预测 S 值之间提供了大约 0.97 的 R(2)相关性:它通过与先前发表的关于人类生长激素蛋白胰蛋白酶消化物和该混合物中一些合成类似物的数据进行验证。