Spillane William J, Kelly Damien P, Curran Patrick J, Feeney Brendan G
Department of Chemistry, National University of Ireland, Galway, Ireland.
J Agric Food Chem. 2006 Aug 9;54(16):5996-6004. doi: 10.1021/jf0606656.
Forty-two new disubstituted phenylsulfamates have been synthesized, and 30 of these have been combined with 40 already available from earlier work to create a training database of 70 compounds. On the basis of panel taste data these were divided into three categories, N (nonsweet), N/S (nonsweet/sweet), and (S) sweet, and a "sweetness value" or weighting was also calculated for each compound. Using these 70 compounds as a training set and a series of nine predictors derived from Corey-Pauling-Koltun (CPK) models, calculated from the PC SPARTAN PRO program and Hammett sigma values taken from the literature, a classification and regression tree analysis (CART) was carried out leading to a regression tree that correctly classified 62 of the 70 compounds (89% overall correct classification). The tree's predictive ability varies for the different taste categories, and for nonsweet compounds it is virtually 100%; for nonsweet/sweet compounds it is 66%, and for sweet compounds it is approximately 75%. This tree correctly predicted taste categories for 10 compounds from a test set of 12 randomly selected from among the 42 new compounds (83% correct classification). Therefore, it can be used with a good degree of confidence to predict the tastes of disubstituted phenylsulfamates. For the design of new sweeteners, appropriate values or ranges of the descriptors are derived.
已合成了42种新的二取代苯磺酸盐,其中30种与早期工作中已有的40种相结合,创建了一个包含70种化合物的训练数据库。根据小组味觉数据,这些化合物被分为三类:N(非甜味)、N/S(非甜味/甜味)和(S)甜味,并且还为每种化合物计算了“甜度值”或权重。使用这70种化合物作为训练集,以及从Corey-Pauling-Koltun(CPK)模型导出的一系列九个预测变量(由PC SPARTAN PRO程序计算得出)和从文献中获取的哈米特σ值,进行了分类和回归树分析(CART),得到了一棵回归树,该回归树正确分类了70种化合物中的62种(总体正确分类率为89%)。该树对不同味觉类别的预测能力有所不同,对于非甜味化合物,其预测准确率几乎为100%;对于非甜味/甜味化合物,为66%,对于甜味化合物,约为75%。这棵树正确预测了从42种新化合物中随机选出的12种化合物的测试集中10种化合物的味觉类别(正确分类率为83%)。因此,可以相当有信心地使用它来预测二取代苯磺酸盐的味道。对于新甜味剂的设计,得出了描述符的适当值或范围。