使用卷积神经网络的活性景观图像分析

Activity landscape image analysis using convolutional neural networks.

作者信息

Iqbal Javed, Vogt Martin, Bajorath Jürgen

机构信息

Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Endenicher Allee 19c, 53115, Bonn, Germany.

出版信息

J Cheminform. 2020 May 18;12(1):34. doi: 10.1186/s13321-020-00436-5.

DOI:10.1186/s13321-020-00436-5

PMID:33431003

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7236149/

Abstract

Activity landscapes (ALs) are graphical representations that combine compound similarity and activity data. ALs are constructed for visualizing local and global structure-activity relationships (SARs) contained in compound data sets. Three-dimensional (3D) ALs are reminiscent of geographical maps where differences in landscape topology mirror different SAR characteristics. 3D AL models can be stored as differently formatted images and are thus amenable to image analysis approaches, which have thus far not been considered in the context of graphical SAR analysis. In this proof-of-concept study, 3D ALs were constructed for a variety of compound activity classes and 3D AL image variants of varying topology and information content were generated and classified. To these ends, convolutional neural networks (CNNs) were initially applied to images of original 3D AL models with color-coding reflecting compound potency information that were taken from different viewpoints. Images of 3D AL models were transformed into variants from which one-dimensional features were extracted. Other machine learning approaches including support vector machine (SVM) and random forest (RF) algorithms were applied to derive models on the basis of such features. In addition, SVM and RF models were trained using other features obtained from images through edge filtering. Machine learning was able to accurately distinguish between 3D AL image variants with different topology and information content. Overall, CNNs which directly learned feature representations from 3D AL images achieved highest classification accuracy. Predictive performance for CNN, SVM, and RF models was highest for image variants emphasizing topological elevation. In addition, SVM models trained on rudimentary images from edge filtering classified such images with high accuracy, which further supported the critical role of altitude-dependent topological features for image analysis and predictions. Taken together, the findings of our proof-of-concept investigation indicate that image analysis has considerable potential for graphical SAR exploration to systematically infer different SAR characteristics from topological features of 3D ALs.

摘要

活性景观图（ALs）是结合化合物相似性和活性数据的图形表示。构建活性景观图是为了可视化化合物数据集中包含的局部和全局结构-活性关系（SARs）。三维（3D）活性景观图让人联想到地理地图，其中景观拓扑结构的差异反映了不同的SAR特征。3D AL模型可以存储为不同格式的图像，因此适合采用图像分析方法，而在图形SAR分析的背景下，迄今为止尚未考虑过这些方法。在这项概念验证研究中，针对各种化合物活性类别构建了3D ALs，并生成和分类了具有不同拓扑结构和信息内容的3D AL图像变体。为此，卷积神经网络（CNNs）最初应用于原始3D AL模型的图像，这些图像通过颜色编码反映从不同视角获取的化合物效力信息。3D AL模型的图像被转换为变体，从中提取一维特征。其他机器学习方法，包括支持向量机（SVM）和随机森林（RF）算法，被应用于基于这些特征推导模型。此外，使用通过边缘滤波从图像中获得的其他特征对SVM和RF模型进行训练。机器学习能够准确区分具有不同拓扑结构和信息内容的3D AL图像变体。总体而言，直接从3D AL图像中学习特征表示的CNNs实现了最高的分类准确率。对于强调拓扑高程的图像变体，CNN、SVM和RF模型的预测性能最高。此外，在边缘滤波得到的基础图像上训练的SVM模型能够高精度地对这类图像进行分类，这进一步支持了高度依赖的拓扑特征在图像分析和预测中的关键作用。综上所述，我们的概念验证研究结果表明，图像分析在图形SAR探索方面具有巨大潜力，能够从3D ALs的拓扑特征系统地推断不同的SAR特征。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9e8/7236149/69aba2896d10/13321_2020_436_Fig1_HTML.jpg

相似文献

Activity landscape image analysis using convolutional neural networks.

J Cheminform. 2020 May 18;12(1):34. doi: 10.1186/s13321-020-00436-5.

Computational Method for Quantitative Comparison of Activity Landscapes on the Basis of Image Data.

Molecules. 2020 Aug 29;25(17):3952. doi: 10.3390/molecules25173952.

Quantitative Comparison of Three-Dimensional Activity Landscapes of Compound Data Sets Based upon Topological Features.

ACS Omega. 2020 Sep 10;5(37):24111-24117. doi: 10.1021/acsomega.0c03659. eCollection 2020 Sep 22.

Automated Identification of Hookahs (Waterpipes) on Instagram: An Application in Feature Extraction Using Convolutional Neural Network and Support Vector Machine Classification.

J Med Internet Res. 2018 Nov 21;20(11):e10513. doi: 10.2196/10513.

Rationalizing three-dimensional activity landscapes and the influence of molecular representations on landscape topology and the formation of activity cliffs.

J Chem Inf Model. 2010 Jun 28;50(6):1021-33. doi: 10.1021/ci100091e.

Three-Dimensional Activity Landscape Models of Different Design and Their Application to Compound Mapping and Potency Prediction.

J Chem Inf Model. 2019 Mar 25;59(3):993-1004. doi: 10.1021/acs.jcim.8b00661. Epub 2018 Dec 12.

Co-trained convolutional neural networks for automated detection of prostate cancer in multi-parametric MRI.

Med Image Anal. 2017 Dec;42:212-227. doi: 10.1016/j.media.2017.08.006. Epub 2017 Aug 24.

Combining deep residual neural network features with supervised machine learning algorithms to classify diverse food image datasets.

Comput Biol Med. 2018 Apr 1;95:217-233. doi: 10.1016/j.compbiomed.2018.02.008. Epub 2018 Feb 17.

Deep learning approaches using 2D and 3D convolutional neural networks for generating male pelvic synthetic computed tomography from magnetic resonance imaging.

Med Phys. 2019 Sep;46(9):3788-3798. doi: 10.1002/mp.13672. Epub 2019 Jul 26.

Prediction of activity cliffs on the basis of images using convolutional neural networks.

J Comput Aided Mol Des. 2021 Dec;35(12):1157-1164. doi: 10.1007/s10822-021-00380-y. Epub 2021 Mar 19.

引用本文的文献

Exploring the Role of Chemoinformatics in Accelerating Drug Discovery: A Computational Approach.

Methods Mol Biol. 2024;2714:203-213. doi: 10.1007/978-1-0716-3441-7_12.

Prediction of activity cliffs on the basis of images using convolutional neural networks.

J Comput Aided Mol Des. 2021 Dec;35(12):1157-1164. doi: 10.1007/s10822-021-00380-y. Epub 2021 Mar 19.

From Big Data to Artificial Intelligence: chemoinformatics meets new challenges.

J Cheminform. 2020 Dec 18;12(1):74. doi: 10.1186/s13321-020-00475-y.

Recent progress on cheminformatics approaches to epigenetic drug discovery.

Drug Discov Today. 2020 Dec;25(12):2268-2276. doi: 10.1016/j.drudis.2020.09.021. Epub 2020 Sep 30.

Quantitative Comparison of Three-Dimensional Activity Landscapes of Compound Data Sets Based upon Topological Features.

ACS Omega. 2020 Sep 10;5(37):24111-24117. doi: 10.1021/acsomega.0c03659. eCollection 2020 Sep 22.

Computational Method for Quantitative Comparison of Activity Landscapes on the Basis of Image Data.

Molecules. 2020 Aug 29;25(17):3952. doi: 10.3390/molecules25173952.

本文引用的文献

Rationalizing the Formation of Activity Cliffs in Different Compound Data Sets.

ACS Omega. 2018 Jul 11;3(7):7736-7744. doi: 10.1021/acsomega.8b01188. eCollection 2018 Jul 31.

KekuleScope: prediction of cancer cell line sensitivity and compound potency using convolutional neural networks trained on compound images.

J Cheminform. 2019 Jun 19;11(1):41. doi: 10.1186/s13321-019-0364-5.

Accurate Prediction of Biological Assays with High-Throughput Microscopy Images and Convolutional Networks.

J Chem Inf Model. 2019 Mar 25;59(3):1163-1171. doi: 10.1021/acs.jcim.8b00670. Epub 2019 Mar 6.

Three-Dimensional Activity Landscape Models of Different Design and Their Application to Compound Mapping and Potency Prediction.

J Chem Inf Model. 2019 Mar 25;59(3):993-1004. doi: 10.1021/acs.jcim.8b00661. Epub 2018 Dec 12.

Machine learning and image-based profiling in drug discovery.

Curr Opin Syst Biol. 2018 Aug;10:43-52. doi: 10.1016/j.coisb.2018.05.004.

Toxic Colors: The Use of Deep Learning for Predicting Toxicity of Compounds Merely from Their Graphic Images.

J Chem Inf Model. 2018 Aug 27;58(8):1533-1543. doi: 10.1021/acs.jcim.8b00338. Epub 2018 Aug 15.

Repurposing High-Throughput Image Assays Enables Biological Activity Prediction for Drug Discovery.

Cell Chem Biol. 2018 May 17;25(5):611-618.e3. doi: 10.1016/j.chembiol.2018.01.015. Epub 2018 Mar 1.

Deep convolutional neural network for the automated detection and diagnosis of seizure using EEG signals.

Comput Biol Med. 2018 Sep 1;100:270-278. doi: 10.1016/j.compbiomed.2017.09.017. Epub 2017 Sep 27.

Cell segmentation in histopathological images with deep learning algorithms by utilizing spatial relationships.

Med Biol Eng Comput. 2017 Oct;55(10):1829-1848. doi: 10.1007/s11517-017-1630-1. Epub 2017 Feb 28.

The ChEMBL database in 2017.

Nucleic Acids Res. 2017 Jan 4;45(D1):D945-D954. doi: 10.1093/nar/gkw1074. Epub 2016 Nov 28.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用卷积神经网络的活性景观图像分析

Activity landscape image analysis using convolutional neural networks.

作者信息

Iqbal Javed, Vogt Martin, Bajorath Jürgen

机构信息

Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Endenicher Allee 19c, 53115, Bonn, Germany.

出版信息

J Cheminform. 2020 May 18;12(1):34. doi: 10.1186/s13321-020-00436-5.

DOI:10.1186/s13321-020-00436-5

PMID:33431003

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7236149/

Abstract

摘要

使用卷积神经网络的活性景观图像分析

Activity landscape image analysis using convolutional neural networks.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献

使用卷积神经网络的活性景观图像分析

Activity landscape image analysis using convolutional neural networks.

作者信息

机构信息

出版信息