Department of Chemistry, University of Louisville, Louisville, KY 40292, USA.
Biostatistics Core, Karmanos Cancer Institute, Wayne State University School of Medicine, Detroit, MI 48201, USA.
J Chromatogr A. 2014 Apr 11;1337:202-10. doi: 10.1016/j.chroma.2014.02.049. Epub 2014 Feb 24.
We developed a method, iMatch2, for compound identification using retention indices (RI) in NIST11 library. Three-way ANOVA test and Kruskal-Wallis test respectively demonstrate that column class and temperature program type defined by the NIST library are the most dominant factors affecting the magnitude of retention index while the retention index data type does not cause significant difference. The developed linear regression transformation for merging retention indices with different data types, but the same column class and temperature program type, reduces the standard deviation of retention index up to 8%, compared to the simple union approach used in the original iMatch. As for outlier detection methods to remove retention indices having large difference with the remaining data of the same compound, Tietjen-Moore test and generalized extreme studentized deviate test are the strictest methods, while methods such as Dixon's test, Thompson tau approach, and Grubbs' test are more conservative. To improve the accuracy of retention index window, a concept of compound specific retention index window is introduced for compounds with a large number of retention indices in the NIST11 library, while the retention index window is calculated from empirical distributions for the compounds with a small number of retention indices. Analysis of the experimental data of a mixture of compound standards and the metabolite extract from mouse liver show significant improvement of retention index quality in the NIST11 library and the new data analysis methods.
我们开发了一种使用 NIST11 库中的保留指数 (RI) 进行化合物鉴定的方法,iMatch2。三因素方差分析和克鲁斯卡尔-沃利斯检验分别表明,柱类别和 NIST 库定义的温度程序类型是影响保留指数大小的最主要因素,而保留指数数据类型不会造成显著差异。为了合并具有不同数据类型但具有相同柱类别和温度程序类型的保留指数,我们开发了一种线性回归转换方法,与原始 iMatch 中使用的简单联合方法相比,该方法可将保留指数的标准偏差降低 8%。对于用于去除与同一化合物的其余数据具有较大差异的保留指数的离群值检测方法,Tietjen-Moore 检验和广义极端学生化偏差检验是最严格的方法,而 Dixon 检验、Thompson tau 方法和 Grubbs 检验等方法则更为保守。为了提高保留指数窗口的准确性,我们为 NIST11 库中具有大量保留指数的化合物引入了化合物特异性保留指数窗口的概念,而对于具有少量保留指数的化合物,则从经验分布中计算保留指数窗口。对化合物标准混合物和小鼠肝提取物代谢物的实验数据分析表明,NIST11 库中的保留指数质量和新的数据分析方法得到了显著改善。