Mahé Pierre, Ueda Nobuhisa, Akutsu Tatsuya, Perret Jean-Luc, Vert Jean-Philippe
Ecole des Mines de Paris, 35 rue Saint Honoré, 77305 Fontainebleau, France.
J Chem Inf Model. 2005 Jul-Aug;45(4):939-51. doi: 10.1021/ci050039t.
The support vector machine algorithm together with graph kernel functions has recently been introduced to model structure-activity relationships (SAR) of molecules from their 2D structure, without the need for explicit molecular descriptor computation. We propose two extensions to this approach with the double goal to reduce the computational burden associated with the model and to enhance its predictive accuracy: description of the molecules by a Morgan index process and definition of a second-order Markov model for random walks on 2D structures. Experiments on two mutagenicity data sets validate the proposed extensions, making this approach a possible complementary alternative to other modeling strategies.
支持向量机算法与图核函数最近被引入,用于从二维结构对分子的构效关系(SAR)进行建模,而无需显式计算分子描述符。我们对该方法提出了两种扩展,其双重目标是减轻与模型相关的计算负担并提高其预测准确性:通过摩根指数过程描述分子以及为二维结构上的随机游走定义二阶马尔可夫模型。在两个致突变性数据集上的实验验证了所提出的扩展,使该方法成为其他建模策略的一种可能的补充替代方法。