Institute for Research in Biomedicine (IRB Barcelona). The Barcelona Institute of Science and Technology, Barcelona, Spain.
Department of Biochemistry and Biomedicine. University of Barcelona, Barcelona, Spain.
Nucleic Acids Res. 2024 Jan 5;52(D1):D393-D403. doi: 10.1093/nar/gkad991.
Molecular dynamics (MD) simulations are keeping computers busy around the world, generating a huge amount of data that is typically not open to the scientific community. Pioneering efforts to ensure the safety and reusability of MD data have been based on the use of simple databases providing a limited set of standard analyses on single-short trajectories. Despite their value, these databases do not offer a true solution for the current community of MD users, who want a flexible analysis pipeline and the possibility to address huge non-Markovian ensembles of large systems. Here we present a new paradigm for MD databases, resilient to large systems and long trajectories, and designed to be compatible with modern MD simulations. The data are offered to the community through a web-based graphical user interface (GUI), implemented with state-of-the-art technology, which incorporates system-specific analysis designed by the trajectory providers. A REST API and associated Jupyter Notebooks are integrated into the platform, allowing fully customized meta-analysis by final users. The new technology is illustrated using a collection of trajectories obtained by the community in the context of the effort to fight the COVID-19 pandemic. The server is accessible at https://bioexcel-cv19.bsc.es/#/. It is free and open to all users and there are no login requirements. It is also integrated into the simulations section of the BioExcel-MolSSI COVID-19 Molecular Structure and Therapeutics Hub: https://covid.molssi.org/simulations/ and is part of the MDDB effort (https://mddbr.eu).
分子动力学(MD)模拟让全球的计算机都忙个不停,产生了大量通常对科学界不开放的数据。确保 MD 数据的安全性和可重复性的开创性努力是基于使用简单的数据库,对单短轨迹提供有限的标准分析。尽管这些数据库具有一定的价值,但它们并不能为 MD 用户提供真正的解决方案,因为用户希望有一个灵活的分析管道,并有可能处理大型非马尔可夫系统的庞大非马尔可夫集合。在这里,我们提出了一种新的 MD 数据库范例,该范例能够抵抗大型系统和长轨迹,并且与现代 MD 模拟兼容。通过基于网络的图形用户界面(GUI)向社区提供数据,该界面采用最先进的技术实现,其中包含由轨迹提供者设计的特定于系统的分析。该平台集成了一个 REST API 和相关的 Jupyter Notebook,允许最终用户进行完全自定义的元分析。该新技术使用社区在 COVID-19 大流行背景下努力获取的一系列轨迹进行说明。该服务器可在 https://bioexcel-cv19.bsc.es/#/ 访问。它对所有用户免费开放,无需登录。它还集成到了 BioExcel-MolSSI COVID-19 分子结构和治疗学中心的模拟部分:https://covid.molssi.org/simulations/,并且是 MDDB 工作的一部分(https://mddbr.eu)。