药物发现早期阶段的数据可视化。

Data visualization during the early stages of drug discovery.

作者信息

Maniyar Dharmesh M, Nabney Ian T, Williams Bruce S, Sewing Andreas

机构信息

Neural Computing Research Group, Information Engineering, Aston University, Birmingham, B4 7ET, United Kingdom.

出版信息

J Chem Inf Model. 2006 Jul-Aug;46(4):1806-18. doi: 10.1021/ci050471a.

DOI:10.1021/ci050471a

PMID:16859312

Abstract

Multidimensional compound optimization is a new paradigm in the drug discovery process, yielding efficiencies during early stages and reducing attrition in the later stages of drug development. The success of this strategy relies heavily on understanding this multidimensional data and extracting useful information from it. This paper demonstrates how principled visualization algorithms can be used to understand and explore a large data set created in the early stages of drug discovery. The experiments presented are performed on a real-world data set comprising biological activity data and some whole-molecular physicochemical properties. Data visualization is a popular way of presenting complex data in a simpler form. We have applied powerful principled visualization methods, such as generative topographic mapping (GTM) and hierarchical GTM (HGTM), to help the domain experts (screening scientists, chemists, biologists, etc.) understand and draw meaningful decisions. We also benchmark these principled methods against relatively better known visualization approaches, principal component analysis (PCA), Sammon's mapping, and self-organizing maps (SOMs), to demonstrate their enhanced power to help the user visualize the large multidimensional data sets one has to deal with during the early stages of the drug discovery process. The results reported clearly show that the GTM and HGTM algorithms allow the user to cluster active compounds for different targets and understand them better than the benchmarks. An interactive software tool supporting these visualization algorithms was provided to the domain experts. The tool facilitates the domain experts by exploration of the projection obtained from the visualization algorithms providing facilities such as parallel coordinate plots, magnification factors, directional curvatures, and integration with industry standard software.

摘要

多维复合优化是药物发现过程中的一种新范式，在早期阶段提高效率，并在药物开发后期减少损耗。该策略的成功很大程度上依赖于对这种多维数据的理解以及从中提取有用信息。本文展示了如何使用有原则的可视化算法来理解和探索在药物发现早期阶段创建的大型数据集。所呈现的实验是在一个包含生物活性数据和一些全分子物理化学性质的真实数据集上进行的。数据可视化是以更简单的形式呈现复杂数据的一种流行方式。我们应用了强大的有原则的可视化方法，如生成地形映射（GTM）和分层GTM（HGTM），以帮助领域专家（筛选科学家、化学家、生物学家等）理解并做出有意义的决策。我们还将这些有原则的方法与相对更知名的可视化方法——主成分分析（PCA）、 Sammon映射和自组织映射（SOM）进行基准测试，以证明它们在帮助用户可视化在药物发现过程早期必须处理的大型多维数据集方面具有更强的能力。报告的结果清楚地表明，GTM和HGTM算法允许用户对不同靶点的活性化合物进行聚类，并且比基准方法能更好地理解它们。我们为领域专家提供了一个支持这些可视化算法的交互式软件工具。该工具通过探索从可视化算法获得的投影来方便领域专家，提供诸如平行坐标图、放大因子、方向曲率以及与行业标准软件集成等功能。

相似文献

Data visualization during the early stages of drug discovery.

J Chem Inf Model. 2006 Jul-Aug;46(4):1806-18. doi: 10.1021/ci050471a.

Generative topographic mapping applied to clustering and visualization of motor unit action potentials.

Biosystems. 2005 Dec;82(3):273-84. doi: 10.1016/j.biosystems.2005.09.004. Epub 2005 Oct 19.

Visualization of large-scale aqueous solubility data using a novel hierarchical data visualization technique.

J Chem Inf Model. 2006 May-Jun;46(3):1054-9. doi: 10.1021/ci0504770.

InfVis--platform-independent visual data mining of multidimensional chemical data sets.

J Chem Inf Model. 2005 Sep-Oct;45(5):1456-67. doi: 10.1021/ci050202k.

Visualization of molecular fingerprints.

J Chem Inf Model. 2011 Jul 25;51(7):1552-63. doi: 10.1021/ci1004042. Epub 2011 Jul 8.

CareVis: integrated visualization of computerized protocols and temporal patient data.

Artif Intell Med. 2006 Jul;37(3):203-18. doi: 10.1016/j.artmed.2006.04.002.

Assessing the predictive power of unsupervised visualization techniques to improve the identification of GPCR-focused compound libraries.

J Chem Inf Model. 2006 Jul-Aug;46(4):1580-7. doi: 10.1021/ci060037o.

Molecular Property eXplorer: a novel approach to visualizing SAR using tree-maps and heatmaps.

J Chem Inf Model. 2005 Mar-Apr;45(2):523-32. doi: 10.1021/ci0496954.

Improving cluster visualization in self-organizing maps: application in gene expression data analysis.

Comput Biol Med. 2007 Dec;37(12):1677-89. doi: 10.1016/j.compbiomed.2007.04.003. Epub 2007 Jun 4.

Computational mapping tools for drug discovery.

Drug Discov Today. 2009 Aug;14(15-16):767-75. doi: 10.1016/j.drudis.2009.05.016. Epub 2009 Jun 9.

引用本文的文献

Leveraging Artificial Intelligence for Synergies in Drug Discovery: From Computers to Clinics.

Curr Pharm Des. 2024;30(28):2187-2205. doi: 10.2174/0113816128308066240529121148.

Natural product drug discovery in the artificial intelligence era.

Chem Sci. 2021 Dec 13;13(6):1526-1546. doi: 10.1039/d1sc04471k. eCollection 2022 Feb 9.

Discovery of novel chemical reactions by deep generative recurrent neural network.

Sci Rep. 2021 Feb 4;11(1):3178. doi: 10.1038/s41598-021-81889-y.

Scaffold Hunter: a comprehensive visual analytics framework for drug discovery.

J Cheminform. 2017 May 11;9(1):28. doi: 10.1186/s13321-017-0213-3.

Predictive cartography of metal binders using generative topographic mapping.

J Comput Aided Mol Des. 2017 Aug;31(8):701-714. doi: 10.1007/s10822-017-0033-6. Epub 2017 Jul 7.

Supervised extensions of chemography approaches: case studies of chemical liabilities assessment.

J Cheminform. 2014 May 7;6:20. doi: 10.1186/1758-2946-6-20. eCollection 2014.

Impact of distance-based metric learning on classification and visualization model performance and structure-activity landscapes.

J Comput Aided Mol Des. 2014 Feb;28(2):61-73. doi: 10.1007/s10822-014-9719-1. Epub 2014 Feb 4.

Chemical space: missing pieces in cheminformatics.

Pharm Res. 2010 Oct;27(10):2035-9. doi: 10.1007/s11095-010-0229-0. Epub 2010 Aug 4.

In silico pharmacology for drug discovery: methods for virtual ligand screening and profiling.

Br J Pharmacol. 2007 Sep;152(1):9-20. doi: 10.1038/sj.bjp.0707305. Epub 2007 Jun 4.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

药物发现早期阶段的数据可视化。

Data visualization during the early stages of drug discovery.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献