Suppr超能文献

利用广泛且经过整理的数据集实现更好的血脑屏障通透性预测

Towards Better BBB Passage Prediction Using an Extensive and Curated Data Set.

作者信息

Brito-Sánchez Yoan, Marrero-Ponce Yovani, Barigye Stephen J, Yaber-Goenaga Iván, Morell Pérez Carlos, Le-Thi-Thu Huong, Cherkasov Artem

机构信息

Vancouver Prostate Centre, University of British Columbia, Vancouver, British Columbia, V6H 3Z6, Canada.

Unit of Computer-Aided Molecular "Biosilico" Discovery and Bioinformatic Research, International Network (CAMD-BIR International Network), Los Laureles L76MD, Nuevo Bosque, 130015, Cartagena de Indias, Bolivar, Colombia. Homepage: http://www.uv.es/yoma/ Homepage: http://sites.google.com/site/ymponce/home.

出版信息

Mol Inform. 2015 May;34(5):308-30. doi: 10.1002/minf.201400118. Epub 2015 May 7.

Abstract

In the present report, the challenging task of drug delivery across the blood-brain barrier (BBB) is addressed via a computational approach. The BBB passage was modeled using classification and regression schemes on a novel extensive and curated data set (the largest to the best of our knowledge) in terms of log BB. Prior to the model development, steps of data analysis that comprise chemical data curation, structural, cutoff and cluster analysis (CA) were conducted. Linear Discriminant Analysis (LDA) and Multiple Linear Regression (MLR) were used to fit classification and correlation functions. The best LDA-based model showed overall accuracies over 85 % and 83 % for the training and test sets, respectively. Also a MLR-based model with acceptable explanation of more than 69 % of the variance in the experimental log BB was developed. A brief and general interpretation of proposed models allowed the estimation on how 'near' our computational approach is to the factors that determine the passage of molecules through the BBB. In a final effort some popular and powerful Machine Learning methods were considered. Comparable or similar performance was observed respect to the simpler linear techniques. Most of the compounds with anomalous behavior were put aside into a set denoted as controversial set and discussion regarding to these compounds is provided. Finally, our results were compared with methodologies previously reported in the literature showing comparable to better results. The results could represent useful tools available and reproducible by all scientific community in the early stages of neuropharmaceutical drug discovery/development projects.

摘要

在本报告中,通过计算方法解决了跨越血脑屏障(BBB)进行药物递送这一具有挑战性的任务。使用分类和回归方案,在一个关于log BB的新颖且广泛整理的数据集(据我们所知是最大的数据集)上对BBB通透情况进行建模。在模型开发之前,进行了包括化学数据整理、结构分析、截断分析和聚类分析(CA)在内的数据分析步骤。使用线性判别分析(LDA)和多元线性回归(MLR)来拟合分类和相关函数。基于LDA的最佳模型在训练集和测试集上的总体准确率分别超过85%和83%。还开发了一个基于MLR的模型,该模型对实验log BB中超过69%的方差具有可接受的解释度。对所提出模型的简要通用解释使得能够估计我们的计算方法与决定分子通过BBB的因素有多“接近”。最后考虑了一些流行且强大的机器学习方法。与更简单的线性技术相比,观察到了可比或相似的性能。大多数具有异常行为的化合物被归入一个称为有争议集的集合,并对这些化合物进行了讨论。最后,将我们的结果与文献中先前报道的方法进行了比较,结果显示与之相当或更好。这些结果可能代表了所有科学界在神经药物发现/开发项目早期阶段可获得且可重复使用的有用工具。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验