Ceroni Alessio, Costa Fabrizio, Frasconi Paolo
Machine Learning and Neural Networks Group, Dipartimento di Sistemi e Informatica, Universitá degli Studi di Firenze, Italy.
Bioinformatics. 2007 Aug 15;23(16):2038-45. doi: 10.1093/bioinformatics/btm298. Epub 2007 Jun 5.
Several kernel-based methods have been recently introduced for the classification of small molecules. Most available kernels on molecules are based on 2D representations obtained from chemical structures, but far less work has focused so far on the definition of effective kernels that can also exploit 3D information.
We introduce new ideas for building kernels on small molecules that can effectively use and combine 2D and 3D information. We tested these kernels in conjunction with support vector machines for binary classification on the 60 NCI cancer screening datasets as well as on the NCI HIV data set. Our results show that 3D information leveraged by these kernels can consistently improve prediction accuracy in all datasets.
An implementation of the small molecule classifier is available from http://www.dsi.unifi.it/neural/src/3DDK.
最近引入了几种基于核的方法用于小分子分类。大多数现有的分子核基于从化学结构获得的二维表示,但到目前为止,针对能够利用三维信息的有效核的定义所做的工作要少得多。
我们提出了构建小分子核的新思路,这些核能够有效地使用和结合二维和三维信息。我们将这些核与支持向量机结合起来,在60个NCI癌症筛查数据集以及NCI HIV数据集上进行二元分类测试。我们的结果表明,这些核利用的三维信息能够在所有数据集中持续提高预测准确率。
小分子分类器的实现可从http://www.dsi.unifi.it/neural/src/3DDK获取。