School of Life Science, Liaoning University, Shenyang 110036, China.
Research Center for Computer Simulating and Information Processing of Bio-macromolecules of Shenyang, Liaoning University, Shenyang 110036, China.
Chem Res Toxicol. 2021 Jun 21;34(6):1456-1467. doi: 10.1021/acs.chemrestox.0c00343. Epub 2021 May 28.
The ability of chemicals to enter the blood-brain barrier (BBB) is a key factor for central nervous system (CNS) drug development. Although many models for BBB permeability prediction have been developed, they have insufficient accuracy (ACC) and sensitivity (SEN). To improve performance, ensemble models were built to predict the BBB permeability of compounds. In this study, in silico ensemble-learning models were developed using 3 machine-learning algorithms and 9 molecular fingerprints from 1757 chemicals (integrated from 2 published data sets) to predict BBB permeability. The best prediction performance of the base classifier models was achieved by a prediction model based on an random forest (RF) and a MACCS molecular fingerprint with an ACC of 0.910, an area under the receiver-operating characteristic (ROC) curve (AUC) of 0.957, a SEN of 0.927, and a specificity of 0.867 in 5-fold cross-validation. The prediction performance of the ensemble models is better than that of most of the base classifiers. The final ensemble model has also demonstrated good accuracy for an external validation and can be used for the early screening of CNS drugs.
化学物质进入血脑屏障(BBB)的能力是中枢神经系统(CNS)药物开发的关键因素。尽管已经开发出许多用于预测 BBB 通透性的模型,但它们的准确性(ACC)和灵敏度(SEN)不足。为了提高性能,构建了集成模型来预测化合物的 BBB 通透性。在这项研究中,使用 3 种机器学习算法和 9 种分子指纹从 1757 种化合物(整合自 2 个已发表的数据集)中开发了基于计算机的集成学习模型,以预测 BBB 通透性。基于随机森林(RF)和 MACCS 分子指纹的预测模型的基本分类器模型取得了最佳预测性能,其 ACC 为 0.910,ROC 曲线(AUC)下面积为 0.957,SEN 为 0.927,特异性为 0.867在 5 折交叉验证中。集成模型的预测性能优于大多数基本分类器。最终的集成模型在外部验证中也表现出良好的准确性,可用于 CNS 药物的早期筛选。