Department of Biochemistry, University of Cambridge, 80 Tennis Court Road, CB2 1GA, Cambridge, United Kingdom.
J Cheminform. 2012 Nov 6;4(1):27. doi: 10.1186/1758-2946-4-27.
Ligand-based virtual screening using molecular shape is an important tool for researchers who wish to find novel chemical scaffolds in compound libraries. The Ultrafast Shape Recognition (USR) algorithm is capable of screening millions of compounds and is therefore suitable for usage in a web service. The algorithm however is agnostic of atom types and cannot discriminate compounds with similar shape but distinct pharmacophoric features. To solve this problem, an extension of USR called USRCAT, has been developed that includes pharmacophoric information whilst retaining the performance benefits of the original method.
The USRCAT extension is shown to outperform the traditional USR method in a retrospective virtual screening benchmark. Also, a relational database implementation is described that is capable of screening a million conformers in milliseconds and allows the inclusion of complex query parameters.
USRCAT provides a solution to the lack of atom type information in the USR algorithm. Researchers, particularly those with only limited resources, who wish to use ligand-based virtual screening in order to discover new hits, will benefit the most. Online chemical databases that offer a shape-based similarity method might also find advantage in using USRCAT due to its accuracy and performance. The source code is freely available and can easily be modified to fit specific needs.
基于配体的虚拟筛选使用分子形状是希望在化合物库中寻找新的化学支架的研究人员的重要工具。超快形状识别(USR)算法能够筛选数百万种化合物,因此非常适合在网络服务中使用。然而,该算法对原子类型是不可知的,无法区分具有相似形状但具有不同药效特征的化合物。为了解决这个问题,开发了一种称为 USRCAT 的 USR 扩展,它保留了原始方法的性能优势,同时包含药效特征信息。
USRCAT 扩展在回顾性虚拟筛选基准测试中被证明优于传统的 USR 方法。此外,还描述了一种关系数据库实现,它能够在毫秒内筛选一百万种构象,并允许包含复杂的查询参数。
USRCAT 为 USR 算法中缺乏原子类型信息提供了解决方案。希望使用基于配体的虚拟筛选来发现新的命中的研究人员,特别是那些资源有限的研究人员,将从中受益最多。提供基于形状的相似性方法的在线化学数据库也可能因其准确性和性能而发现使用 USRCAT 的优势。源代码是免费提供的,可以轻松修改以满足特定需求。