Suppr超能文献

加权系综模拟中进展坐标的无监督学习:应用于NTL9蛋白质折叠

Unsupervised Learning of Progress Coordinates during Weighted Ensemble Simulations: Application to NTL9 Protein Folding.

作者信息

Leung Jeremy M G, Frazee Nicolas C, Brace Alexander, Bogetti Anthony T, Ramanathan Arvind, Chong Lillian T

机构信息

Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States.

Data Science and Learning Division, Argonne National Laboratory, Lemont, Illinois 60439, United States.

出版信息

J Chem Theory Comput. 2025 Apr 8;21(7):3691-3699. doi: 10.1021/acs.jctc.4c01136. Epub 2025 Mar 19.

Abstract

A major challenge for many rare-event sampling strategies is the identification of progress coordinates that capture the slowest relevant motions. Machine-learning methods that can identify progress coordinates in an unsupervised manner have therefore been of great interest to the simulation community. Here, we developed a general method for identifying progress coordinates "on-the-fly" during weighted ensemble (WE) rare-event sampling via deep learning (DL) of outliers among sampled conformations. Our method identifies outliers in a latent space model of the system's sampled conformations that is periodically trained using a convolutional variational autoencoder. As a proof of principle, we applied our DL-enhanced WE method to simulate the NTL9 protein folding process. To enable rapid tests, our simulations propagated discrete-state synthetic molecular dynamics trajectories using a generative, fine-grained Markov state model. Results revealed that our on-the-fly DL of outliers enhanced the efficiency of WE by >3-fold in estimating the folding rate constant. Our efforts are a significant step forward in the unsupervised learning of slow coordinates during rare event sampling.

摘要

对于许多稀有事件采样策略而言,一个主要挑战是识别能够捕捉最慢相关运动的进展坐标。因此,能够以无监督方式识别进展坐标的机器学习方法引起了模拟社区的极大兴趣。在此,我们开发了一种通用方法,通过对采样构象中的异常值进行深度学习(DL),在加权系综(WE)稀有事件采样过程中“即时”识别进展坐标。我们的方法在系统采样构象的潜在空间模型中识别异常值,该模型使用卷积变分自动编码器进行定期训练。作为原理验证,我们将深度学习增强的加权系综方法应用于模拟NTL9蛋白折叠过程。为了实现快速测试,我们的模拟使用生成式、细粒度马尔可夫状态模型传播离散状态合成分子动力学轨迹。结果表明,我们对异常值的即时深度学习在估计折叠速率常数方面将加权系综的效率提高了3倍以上。我们的工作在稀有事件采样过程中慢坐标的无监督学习方面向前迈出了重要一步。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/199c/11983707/89ef1777a423/ct4c01136_0001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验