Syahdi Rezi Riadhi, Jasial Swarit, Maeda Itsuki, Miyao Tomoyuki
Graduate School of Science and Technology, Nara Institute of Science and Technology, 8916-5 Takayama-cho, Ikoma, Nara 630-0192, Japan.
Data Science Center, Nara Institute of Science and Technology, 8916-5 Takayama-cho, Ikoma, Nara 630-0192, Japan.
ACS Omega. 2024 Sep 3;9(37):38957-38969. doi: 10.1021/acsomega.4c05433. eCollection 2024 Sep 17.
Ligand-based virtual screening (LBVS) and structure-based virtual screening (SBVS), and their combinations, are frequently conducted in modern drug discovery campaigns. As a form of combination, an amalgamation of methods from ligand- and structure-based information, termed hybrid VS approaches, has been extensively investigated such as using interaction fingerprints (IFPs) in combination with machine learning (ML) models. This approach has the potential to prioritize active compounds in terms of protein-ligand binding and ligand structural characteristics, which is assumed to be difficult using either one of the approaches. Herein, we present an IFP, named the fragmented interaction fingerprint (FIFI), for hybrid VS approaches. FIFI is constructed from the extended connectivity fingerprint atom environments of a ligand proximal to the protein residues in the binding site. Each unique ligand substructure within each amino acid residue is encoded as a bit in FIFI while retaining sequence order. From the retrospective evaluation of activity prediction using a limited number and variety of active compounds for six biological targets, FIFI consistently showed higher prediction accuracy than that using previously proposed IFPs. For the same data sets, the screening performance of LBVS, SBVS sequential VS, parallel VS, and other hybrid VS approaches was investigated. Compared to these approaches, FIFI in combination with ML showed overall stable and high prediction accuracy, except for one target: the kappa opioid receptor, where the extended connectivity fingerprint combined with ML models showed better performance than other approaches by wide margins.
基于配体的虚拟筛选(LBVS)和基于结构的虚拟筛选(SBVS)及其组合,在现代药物发现活动中经常进行。作为一种组合形式,将基于配体和基于结构的信息的方法进行融合,即所谓的混合VS方法,已经得到了广泛研究,例如将相互作用指纹(IFP)与机器学习(ML)模型结合使用。这种方法有可能根据蛋白质-配体结合和配体结构特征对活性化合物进行优先级排序,而使用任何一种方法都被认为难以做到这一点。在此,我们提出了一种用于混合VS方法的IFP,名为片段化相互作用指纹(FIFI)。FIFI是由结合位点中与蛋白质残基相邻的配体的扩展连接性指纹原子环境构建而成。每个氨基酸残基内的每个独特配体子结构在FIFI中被编码为一位,同时保留序列顺序。通过对六个生物靶点使用数量有限且种类多样的活性化合物进行活性预测的回顾性评估,FIFI始终显示出比使用先前提出的IFP更高的预测准确性。对于相同的数据集,研究了LBVS、SBVS顺序VS、并行VS和其他混合VS方法的筛选性能。与这些方法相比,FIFI与ML结合显示出总体稳定且较高的预测准确性,但有一个靶点除外:κ阿片受体,在该靶点上,扩展连接性指纹与ML模型结合显示出比其他方法好得多的性能。