Suppr超能文献

利用机器学习进展进行药物发现和分子生物学中的数据整合

Data Integration Using Advances in Machine Learning in Drug Discovery and Molecular Biology.

作者信息

Hudson Irene Lena

机构信息

Mathematical Sciences, School of Science, RMIT University, Melbourne, VIC, Australia.

出版信息

Methods Mol Biol. 2021;2190:167-184. doi: 10.1007/978-1-0716-0826-5_7.

Abstract

While the term artificial intelligence and the concept of deep learning are not new, recent advances in high-performance computing, the availability of large annotated data sets required for training, and novel frameworks for implementing deep neural networks have led to an unprecedented acceleration of the field of molecular (network) biology and pharmacogenomics. The need to align biological data to innovative machine learning has stimulated developments in both data integration (fusion) and knowledge representation, in the form of heterogeneous, multiplex, and biological networks or graphs. In this chapter we briefly introduce several popular neural network architectures used in deep learning, namely, the fully connected deep neural network, recurrent neural network, convolutional neural network, and the autoencoder. Deep learning predictors, classifiers, and generators utilized in modern feature extraction may well assist interpretability and thus imbue AI tools with increased explication, potentially adding insights and advancements in novel chemistry and biology discovery.The capability of learning representations from structures directly without using any predefined structure descriptor is an important feature distinguishing deep learning from other machine learning methods and makes the traditional feature selection and reduction procedures unnecessary. In this chapter we briefly show how these technologies are applied for data integration (fusion) and analysis in drug discovery research covering these areas: (1) application of convolutional neural networks to predict ligand-protein interactions; (2) application of deep learning in compound property and activity prediction; (3) de novo design through deep learning. We also: (1) discuss some aspects of future development of deep learning in drug discovery/chemistry; (2) provide references to published information; (3) provide recently advocated recommendations on using artificial intelligence and deep learning in -omics research and drug discovery.

摘要

虽然人工智能这个术语和深度学习的概念并不新鲜,但高性能计算的最新进展、训练所需的大量带注释数据集的可用性以及用于实现深度神经网络的新颖框架,已导致分子(网络)生物学和药物基因组学领域前所未有的加速发展。将生物数据与创新的机器学习相结合的需求,刺激了数据集成(融合)和知识表示方面的发展,其形式为异构、多路复用和生物网络或图形。在本章中,我们简要介绍深度学习中使用的几种流行神经网络架构,即全连接深度神经网络、循环神经网络、卷积神经网络和自动编码器。现代特征提取中使用的深度学习预测器、分类器和生成器很可能有助于可解释性,从而使人工智能工具具有更强的解释力,有可能为新化学和生物学发现增添见解和进展。直接从结构中学习表示而不使用任何预定义结构描述符的能力,是深度学习区别于其他机器学习方法的一个重要特征,使得传统的特征选择和简化程序不再必要。在本章中,我们简要展示这些技术如何应用于药物发现研究中的数据集成(融合)和分析,涵盖以下领域:(1)应用卷积神经网络预测配体 - 蛋白质相互作用;(2)深度学习在化合物性质和活性预测中的应用;(3)通过深度学习进行从头设计。我们还:(1)讨论深度学习在药物发现/化学领域未来发展的一些方面;(2)提供已发表信息的参考文献;(3)提供最近在 -omics 研究和药物发现中使用人工智能和深度学习的倡导建议。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验