Charoenkwan Phasit, Schaduangrat Nalini, Lio' Pietro, Moni Mohammad Ali, Shoombuatong Watshara, Manavalan Balachandran
Modern Management and Information Technology, College of Arts, Media and Technology, Chiang Mai University, Chiang Mai 50200, Thailand.
Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand.
iScience. 2022 Aug 5;25(9):104883. doi: 10.1016/j.isci.2022.104883. eCollection 2022 Sep 16.
Discovery of potential drugs requires rapid and precise identification of drug targets. Although traditional experimental methodologies can accurately identify drug targets, they are time-consuming and inappropriate for high-throughput screening. Computational approaches based on machine learning (ML) algorithms can expedite the prediction of druggable proteins; however, the performance of the existing computational methods remains unsatisfactory. This study proposes a computational tool, SPIDER, to enhance the accurate prediction of druggable proteins. SPIDER employs various feature descriptors pertaining to several aspects, including physicochemical properties, compositional information, and composition-transition-distribution information, coupled with well-known ML algorithms to facilitate the construction of the final meta-predictor. The experimental results showed that SPIDER enabled more precise and robust prediction of druggable proteins than the baseline models and current existing methods in terms of the independent test dataset. An online web server was established and made freely available online.
发现潜在药物需要快速、精确地识别药物靶点。尽管传统实验方法能够准确识别药物靶点,但它们耗时且不适用于高通量筛选。基于机器学习(ML)算法的计算方法可以加快可成药蛋白的预测;然而,现有计算方法的性能仍不尽人意。本研究提出了一种计算工具SPIDER,以增强对可成药蛋白的准确预测。SPIDER采用了与多个方面相关的各种特征描述符,包括物理化学性质、组成信息以及组成-转变-分布信息,并结合著名的ML算法来构建最终的元预测器。实验结果表明,在独立测试数据集方面,与基线模型和现有方法相比,SPIDER能够对可成药蛋白进行更精确、更稳健的预测。此外,还建立了一个在线网络服务器并免费在线提供。