Organic Pharmaceutical Chemistry, Department of Medicinal Chemistry, BMC, Uppsala University, P.O. Box 574, SE-751 23 Uppsala, Sweden.
J Chem Inf Model. 2012 Jan 23;52(1):225-32. doi: 10.1021/ci2004835. Epub 2011 Dec 22.
Virtual screening is widely applied in drug discovery, and significant effort has been put into improving current methods. In this study, we have evaluated the performance of compound ranking in virtual screening using five different data fusion algorithms on a total of 16 data sets. The data were generated by docking, pharmacophore search, shape similarity, and electrostatic similarity, spanning both structure- and ligand-based methods. The algorithms used for data fusion were sum rank, rank vote, sum score, Pareto ranking, and parallel selection. None of the fusion methods require any prior knowledge or input other than the results from the single methods and, thus, are readily applicable. The results show that compound ranking using data fusion improves the performance and consistency of virtual screening compared to the single methods alone. The best performing data fusion algorithm was parallel selection, but both rank voting and Pareto ranking also have good performance.
虚拟筛选在药物发现中得到了广泛应用,人们已经投入了大量精力来改进当前的方法。在这项研究中,我们使用五种不同的数据融合算法,对总共 16 个数据集进行了虚拟筛选中化合物排序的性能评估。这些数据是通过对接、药效团搜索、形状相似性和静电相似性生成的,涵盖了基于结构和基于配体的方法。用于数据融合的算法包括总和排序、排序投票、总和评分、帕累托排序和并行选择。这些融合方法除了单个方法的结果外,不需要任何先验知识或输入,因此很容易应用。结果表明,与单独使用单一方法相比,使用数据融合进行化合物排序可以提高虚拟筛选的性能和一致性。表现最好的数据融合算法是并行选择,但排序投票和帕累托排序也有很好的性能。