Suppr超能文献

mdCATH:一个用于数据驱动计算生物物理学的大规模分子动力学数据集。

mdCATH: A Large-Scale MD Dataset for Data-Driven Computational Biophysics.

作者信息

Mirarchi Antonio, Giorgino Toni, Fabritiis Gianni De

机构信息

Computational Science Laboratory, Universitat Pompeu Fabra, Barcelona Biomedical Research Park (PRBB), Carrer Dr. Aiguader 88, Barcelona, 08003, Spain.

Biophysics Institute, National Research Council (CNR-IBF), Via Celoria 26, Milan, 20133, Italy.

出版信息

ArXiv. 2024 Dec 3:arXiv:2407.14794v2.

Abstract

Recent advancements in protein structure determination are revolutionizing our understanding of proteins. Still, a significant gap remains in the availability of comprehensive datasets that focus on the dynamics of proteins, which are crucial for understanding protein function, folding, and interactions. To address this critical gap, we introduce mdCATH, a dataset generated through an extensive set of all-atom molecular dynamics simulations of a diverse and representative collection of protein domains. This dataset comprises all-atom systems for 5,398 domains, modeled with a state-of-the-art classical force field, and simulated in five replicates each at five temperatures from 320 K to 450 K. The mdCATH dataset records coordinates and forces every 1 ns, for over 62 ms of accumulated simulation time, effectively capturing the dynamics of the various classes of domains and providing a unique resource for proteome-wide statistical analyses of protein unfolding thermodynamics and kinetics. We outline the dataset structure and showcase its potential through four easily reproducible case studies, highlighting its capabilities in advancing protein science.

摘要

蛋白质结构测定的最新进展正在彻底改变我们对蛋白质的理解。然而,在专注于蛋白质动力学的全面数据集的可用性方面,仍然存在重大差距,而蛋白质动力学对于理解蛋白质功能、折叠和相互作用至关重要。为了弥补这一关键差距,我们引入了mdCATH数据集,该数据集是通过对各种具有代表性的蛋白质结构域进行广泛的全原子分子动力学模拟生成的。该数据集包含5398个结构域的全原子系统,采用最先进的经典力场进行建模,并在从320 K到450 K的五个温度下各进行五次重复模拟。mdCATH数据集每1 ns记录一次坐标和力,累积模拟时间超过62 ms,有效地捕捉了各类结构域的动力学,并为蛋白质组范围内蛋白质解折叠热力学和动力学的统计分析提供了独特的资源。我们概述了数据集结构,并通过四个易于重现的案例研究展示了其潜力,突出了其在推动蛋白质科学发展方面的能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1fce/11643217/9ef06f799e1a/nihpp-2407.14794v2-f0001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验