Jia Hengjian, Sosso Gabriele C
Department of Chemistry, University of Warwick, Coventry CV1 1DT, U.K.
J Chem Inf Model. 2024 Dec 9;64(23):8718-8728. doi: 10.1021/acs.jcim.4c01217. Epub 2024 Nov 18.
The blood-brain barrier (BBB) selectively regulates the passage of chemical compounds into and out of the central nervous system (CNS). As such, understanding the permeability of drug molecules through the BBB is key to treating neurological diseases and evaluating the response of the CNS to medical treatments. Within the last two decades, a diverse portfolio of machine learning (ML) models have been regularly utilized as a tool to predict, and, to a much lesser extent, understand, several functional properties of medicinal drugs, including their propensity to pass through the BBB. However, the most numerically accurate models to date lack in transparency, as they typically rely on complex blends of different descriptors (or features or fingerprints), many of which are not necessarily interpretable in a straightforward fashion. In fact, the "black-box" nature of these models has prevented us from pinpointing any specific design rule to craft the next generation of pharmaceuticals that need to pass (or not) through the BBB. In this work, we have developed a ML model that leverages an uncomplicated, transparent set of descriptors to predict the permeability of drug molecules through the BBB. In addition to its simplicity, our model achieves comparable results in terms of accuracy compared to state-of-the-art models. Moreover, we use a naive Bayes model as an analytical tool to provide further insights into the structure-function relation that underpins the capacity of a given drug molecule to pass through the BBB. Although our results are computational rather than experimental, we have identified several molecular fragments and functional groups that may significantly impact a drug's likelihood of permeating the BBB. This work provides a unique angle to the BBB problem and lays the foundations for future work aimed at leveraging additional transparent descriptors, potentially obtained via bespoke molecular dynamics simulations.
血脑屏障(BBB)选择性地调节化合物进出中枢神经系统(CNS)的过程。因此,了解药物分子透过血脑屏障的通透性是治疗神经疾病以及评估中枢神经系统对医学治疗反应的关键。在过去二十年中,各种各样的机器学习(ML)模型经常被用作预测工具,在较小程度上也用于理解药物的几种功能特性,包括其透过血脑屏障的倾向。然而,迄今为止数值上最精确的模型缺乏透明度,因为它们通常依赖于不同描述符(或特征或指纹)的复杂组合,其中许多不一定能以直接的方式进行解释。事实上,这些模型的“黑箱”性质使我们无法确定任何特定的设计规则来研发下一代需要透过(或不透过)血脑屏障的药物。在这项工作中,我们开发了一种机器学习模型,该模型利用一组简单、透明的描述符来预测药物分子透过血脑屏障的通透性。除了简单性之外,我们的模型在准确性方面与最先进的模型取得了相当的结果。此外,我们使用朴素贝叶斯模型作为分析工具,以进一步深入了解支撑给定药物分子透过血脑屏障能力的结构 - 功能关系。尽管我们的结果是通过计算而非实验得出的,但我们已经确定了几个可能会显著影响药物透过血脑屏障可能性的分子片段和官能团。这项工作为血脑屏障问题提供了一个独特的视角,并为未来旨在利用可能通过定制分子动力学模拟获得的额外透明描述符的工作奠定了基础。