Center for Professional Studies, Lahore, Pakistan.
Department of Computer Science, University of Management and Technology, Lahore, Pakistan.
Curr Drug Discov Technol. 2021;18(4):463-472. doi: 10.2174/1570163817666200806165934.
Machine learning is an active area of research in computer science by the availability of big data collection of all sorts prompting interest in the development of novel tools for data mining. Machine learning methods have wide applications in computer-aided drug discovery methods. Most incredible approaches to machine learning are used in drug designing, which further aid the process of biological modelling in drug discovery. Mainly, two main categories are present which are Ligand-Based Virtual Screening (LBVS) and Structure-Based Virtual Screening (SBVS), however, the machine learning approaches fall mostly in the category of LBVS.
This study exposits the major machine learning approaches being used in LBVS. Moreover, we have introduced a protocol named FP-CADD which depicts a 4-steps rule of thumb for drug discovery, the four protocols of computer-aided drug discovery (FP-CADD). Various important aspects along with SWOT analysis of FP-CADD are also discussed in this article.
By this thorough study, we have observed that in LBVS algorithms, Support Vector Machines (SVM) and Random Forest (RF) are those which are widely used due to high accuracy and efficiency. These virtual screening approaches have the potential to revolutionize the drug designing field. Also, we believe that the process flow presented in this study, named FP-CADD, can streamline the whole process of computer-aided drug discovery. By adopting this rule, the studies related to drug discovery can be made homogeneous and this protocol can also be considered as an evaluation criterion in the peer-review process of research articles.
机器学习是计算机科学中一个活跃的研究领域,由于各种大数据的收集,人们对开发新的数据挖掘工具产生了兴趣。机器学习方法在计算机辅助药物发现方法中有广泛的应用。最令人难以置信的机器学习方法被用于药物设计,这进一步辅助了药物发现中的生物建模过程。主要有两种主要类别,配体基虚拟筛选(LBVS)和基于结构的虚拟筛选(SBVS),然而,机器学习方法大多属于 LBVS 类别。
本研究阐述了 LBVS 中使用的主要机器学习方法。此外,我们引入了一种名为 FP-CADD 的方案,描述了药物发现的 4 步经验法则,即计算机辅助药物发现的 4 个协议(FP-CADD)。本文还讨论了 FP-CADD 的各种重要方面以及 SWOT 分析。
通过这项深入研究,我们观察到在 LBVS 算法中,支持向量机(SVM)和随机森林(RF)由于准确性和效率高而被广泛使用。这些虚拟筛选方法有可能彻底改变药物设计领域。此外,我们相信,本研究提出的名为 FP-CADD 的流程可以简化计算机辅助药物发现的整个过程。通过采用这种规则,可以使与药物发现相关的研究变得均匀,并且该方案也可以被视为研究文章同行评审过程中的评估标准。