Cai Yu-Dong, Qian Ziliang, Lu Lin, Feng Kai-Yan, Meng Xin, Niu Bing, Zhao Guo-Dong, Lu Wen-Cong
CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, People's Republic of China.
Mol Divers. 2008 May;12(2):131-7. doi: 10.1007/s11030-008-9085-9. Epub 2008 Aug 14.
Efficient in silico screening approaches may provide valuable hints on biological functions of the compound-candidates, which could help to screen functional compounds either in basic researches on metabolic pathways or drug discovery. Here, we introduce a machine learning method (Nearest Neighbor Algorithm) based on functional group composition of compounds to the analysis of metabolic pathways. This method can quickly map small chemical molecules to the metabolic pathway that they likely belong to. A set of 2,764 compounds from 11 major classes of metabolic pathways were selected for study. The overall prediction rate reached 73.3%, indicating that functional group composition of compounds was really related to their biological metabolic functions.
高效的计算机筛选方法可为化合物候选物的生物学功能提供有价值的线索,这有助于在代谢途径的基础研究或药物发现中筛选功能性化合物。在此,我们将基于化合物官能团组成的机器学习方法(最近邻算法)引入代谢途径分析。该方法可以快速将小分子化学物质映射到它们可能所属的代谢途径。我们选择了来自11种主要代谢途径的2764种化合物进行研究。总体预测率达到73.3%,表明化合物的官能团组成确实与其生物代谢功能相关。