Research Center, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia; Research Chair in Healthcare Innovation, Information Systems Department, College of Computer and Information Sciences, King Saud University, Riyadh, Saudi Arabia.
College of Computer and Information Sciences, Information Technology Department, King Saud University, Riyadh, Saudi Arabia.
Comput Biol Chem. 2020 Dec;89:107377. doi: 10.1016/j.compbiolchem.2020.107377. Epub 2020 Sep 24.
The rapid development of computational methods and the increasing volume of chemical and biological data have contributed to an immense growth in chemical research. This field of study is known as "chemoinformatics," which is a discipline that uses machine-learning techniques to extract, process, and extrapolate data from chemical structures. One of the significant lines of research in chemoinformatics is the study of blood-brain barrier (BBB) permeability, which aims to identify drug penetration into the central nervous system (CNS). In this research, we attempt to solve the problem of BBB permeability by predicting compounds penetration to the CNS. To accomplish this goal: (i) First, an overview is provided to the field of chemoinformatics, its definition, applications, and challenges, (ii) Second, a broad view is taken to investigate previous machine-learning and deep-learning computational models to solve BBB permeability. Based on the analysis of previous models, three main challenges that collectively affect the classifier performance are identified, which we define as "the triple constraints"; subsequently, we map each constraint to a proposed solution, (iii) Finally, we conclude this endeavor by proposing a deep learning based Recurrent Neural Network model, to predict BBB permeability (RNN-BBB model). Our model outperformed other studies from the literature by scoring an overall accuracy of 96.53%, and a specificity score of 98.08%. The obtained results confirm that addressing the triple constraints substantially improves the classification model capability specifically when predicting compounds with low penetration.
计算方法的快速发展和化学与生物数据量的不断增加,推动了化学研究的巨大发展。这个研究领域被称为“化学信息学”,它是一门利用机器学习技术从化学结构中提取、处理和推断数据的学科。化学信息学的一个重要研究方向是研究血脑屏障(BBB)通透性,旨在确定药物穿透中枢神经系统(CNS)的能力。在这项研究中,我们试图通过预测化合物穿透中枢神经系统的能力来解决 BBB 通透性的问题。为了实现这一目标:(i)首先,对化学信息学领域进行了概述,介绍了它的定义、应用和挑战;(ii)其次,广泛研究了以前用于解决 BBB 通透性的机器学习和深度学习计算模型。基于对以前模型的分析,确定了三个共同影响分类器性能的主要挑战,我们将其定义为“三重约束”;随后,我们将每个约束映射到一个提出的解决方案;(iii)最后,我们通过提出一个基于深度学习的循环神经网络模型来预测 BBB 通透性(RNN-BBB 模型)来结束这项研究。我们的模型通过获得总体准确率为 96.53%和特异性分数为 98.08%的成绩,优于文献中的其他研究。结果证实,解决三重约束问题可以显著提高分类模型的能力,特别是在预测穿透能力较低的化合物时。