Suppr超能文献

基于自由形态和血中形态数据集的分子描述符预测血脑屏障渗透性(BBBP)。

Prediction of Blood-Brain Barrier Penetration (BBBP) Based on Molecular Descriptors of the Free-Form and In-Blood-Form Datasets.

机构信息

Department of Science, Faculty of Science, Yamagata University, 1-4-12 Kojirakawa, Yamagata 990-8560, Japan.

出版信息

Molecules. 2021 Dec 7;26(24):7428. doi: 10.3390/molecules26247428.

Abstract

The blood-brain barrier (BBB) controls the entry of chemicals from the blood to the brain. Since brain drugs need to penetrate the BBB, rapid and reliable prediction of BBB penetration (BBBP) is helpful for drug development. In this study, free-form and in-blood-form datasets were prepared by modifying the original BBBP dataset, and the effects of the data modification were investigated. For each dataset, molecular descriptors were generated and used for BBBP prediction by machine learning (ML). For ML, the dataset was split into training, validation, and test data by the scaffold split algorithm MoleculeNet used. This creates an unbalanced split and makes the prediction difficult; however, we decided to use that algorithm to evaluate the predictive performance for unknown compounds dissimilar to existing ones. The highest prediction score was obtained by the random forest model using 212 descriptors from the free-form dataset, and this score was higher than the existing best score using the same split algorithm without using any external database. Furthermore, using a deep neural network, a comparable result was obtained with only 11 descriptors from the free-form dataset, and the resulting descriptors suggested the importance of recognizing the glucose-like characteristics in BBBP prediction.

摘要

血脑屏障 (BBB) 控制着血液中的化学物质进入大脑。由于脑药物需要穿透 BBB,因此快速可靠地预测 BBB 穿透 (BBBP) 有助于药物开发。在这项研究中,通过修改原始 BBBP 数据集来准备自由格式和血液形式的数据集,并研究了数据修改的效果。对于每个数据集,生成分子描述符,并通过机器学习 (ML) 用于 BBBP 预测。对于 ML,数据集通过使用 MoleculeNet 的支架拆分算法拆分为训练、验证和测试数据。这会造成不平衡的拆分,使预测变得困难;但是,我们决定使用该算法来评估对与现有化合物不相似的未知化合物的预测性能。使用自由格式数据集的 212 个描述符的随机森林模型获得了最高的预测分数,并且该分数高于使用相同拆分算法但不使用任何外部数据库的现有最佳分数。此外,使用深度神经网络,仅从自由格式数据集使用 11 个描述符即可获得可比的结果,并且得到的描述符表明在 BBBP 预测中识别葡萄糖样特征的重要性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2616/8708321/221b7b22ed55/molecules-26-07428-g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验