Liang Li, Liu Zhiwen, Yang Xinyi, Zhang Yanmin, Liu Haichun, Chen Yadong
Laboratory of Molecular Design and Drug Discovery, School of Science, China Pharmaceutical University, 639 Longmian Avenue, Nanjing, 211198, China.
Mol Inform. 2024 Sep;43(9):e202300327. doi: 10.1002/minf.202300327. Epub 2024 Jun 12.
The assessment of compound blood-brain barrier (BBB) permeability poses a significant challenge in the discovery of drugs targeting the central nervous system. Conventional experimental approaches to measure BBB permeability are labor-intensive, cost-ineffective, and time-consuming. In this study, we constructed six machine learning classification models by combining various machine learning algorithms and molecular representations. The model based on ExtraTree algorithm and random partitioning strategy obtains the best prediction result, with AUC value of 0.932±0.004 and balanced accuracy (BA) of 0.837±0.010 for the test set. We employed the SHAP method to identify important features associated with BBB permeability. In addition, matched molecular pair (MMP) analysis and representative substructure derivation method were utilized to uncover the transformation rules and distinctive structural features of BBB permeable compounds. The machine learning models proposed in this work can serve as an effective tool for assessing BBB permeability in the drug discovery for central nervous system disease.
在发现靶向中枢神经系统的药物过程中,复合血脑屏障(BBB)通透性的评估是一项重大挑战。测量BBB通透性的传统实验方法 labor-intensive、成本低效且耗时。在本研究中,我们通过结合各种机器学习算法和分子表示构建了六个机器学习分类模型。基于ExtraTree算法和随机划分策略的模型获得了最佳预测结果,测试集的AUC值为0.932±0.004,平衡准确率(BA)为0.837±0.010。我们采用SHAP方法来识别与BBB通透性相关的重要特征。此外,利用匹配分子对(MMP)分析和代表性子结构推导方法来揭示BBB可渗透化合物的转化规则和独特结构特征。本工作中提出的机器学习模型可作为评估中枢神经系统疾病药物发现中BBB通透性的有效工具。 (注:原文中“labor-intensive”未翻译完整,可能是“劳动密集型”之类的意思,需结合完整语境准确理解。)