Suppr超能文献

一种深度图网络增强的采样方法,用于高效探索蛋白质简化表示空间。

A Deep Graph Network-Enhanced Sampling Approach to Efficiently Explore the Space of Reduced Representations of Proteins.

作者信息

Errica Federico, Giulini Marco, Bacciu Davide, Menichetti Roberto, Micheli Alessio, Potestio Raffaello

机构信息

Department of Computer Science, University of Pisa, Pisa, Italy.

Physics Department, University of Trento, Trento, Italy.

出版信息

Front Mol Biosci. 2021 Apr 29;8:637396. doi: 10.3389/fmolb.2021.637396. eCollection 2021.

Abstract

The limits of molecular dynamics (MD) simulations of macromolecules are steadily pushed forward by the relentless development of computer architectures and algorithms. The consequent explosion in the number and extent of MD trajectories induces the need for automated methods to rationalize the raw data and make quantitative sense of them. Recently, an algorithmic approach was introduced by some of us to identify the subset of a protein's atoms, or mapping, that enables the most informative description of the system. This method relies on the computation, for a given reduced representation, of the associated mapping entropy, that is, a measure of the information loss due to such simplification; albeit relatively straightforward, this calculation can be time-consuming. Here, we describe the implementation of a deep learning approach aimed at accelerating the calculation of the mapping entropy. We rely on Deep Graph Networks, which provide extreme flexibility in handling structured input data and whose predictions prove to be accurate and-remarkably efficient. The trained network produces a speedup factor as large as 10 with respect to the algorithmic computation of the mapping entropy, enabling the reconstruction of its landscape by means of the Wang-Landau sampling scheme. Applications of this method reach much further than this, as the proposed pipeline is easily transferable to the computation of arbitrary properties of a molecular structure.

摘要

计算机架构和算法的不断发展稳步推动了大分子分子动力学(MD)模拟的极限。MD轨迹数量和范围的激增引发了对自动化方法的需求,以便对原始数据进行合理化处理并使其具有定量意义。最近,我们中的一些人引入了一种算法方法,用于识别蛋白质原子的子集或映射,从而能够对系统进行最具信息性的描述。该方法依赖于针对给定简化表示计算相关映射熵,即由于这种简化导致的信息损失的度量;尽管相对简单,但这种计算可能很耗时。在这里,我们描述了一种旨在加速映射熵计算的深度学习方法的实现。我们依赖于深度图网络,它在处理结构化输入数据方面具有极大的灵活性,并且其预测被证明是准确且非常高效的。相对于映射熵的算法计算,训练后的网络产生的加速因子高达10,从而能够通过Wang-Landau采样方案重建其景观。该方法的应用远不止于此,因为所提出的管道很容易转移到分子结构任意属性的计算中。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/932d/8116519/896dad914cb4/fmolb-08-637396-g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验