Suppr超能文献

通过可重现的打包和核心编程接口将 Rosetta 套件与 Python 软件堆栈集成,用于分布式模拟。

Integration of the Rosetta suite with the python software stack via reproducible packaging and core programming interfaces for distributed simulation.

机构信息

Institute for Protein Innovation, Boston, Massachusetts.

Institute for Protein Design, University of Washington, Seattle, Washington.

出版信息

Protein Sci. 2020 Jan;29(1):43-51. doi: 10.1002/pro.3721. Epub 2019 Dec 2.

Abstract

The Rosetta software suite for macromolecular modeling is a powerful computational toolbox for protein design, structure prediction, and protein structure analysis. The development of novel Rosetta-based scientific tools requires two orthogonal skill sets: deep domain-specific expertise in protein biochemistry and technical expertise in development, deployment, and analysis of molecular simulations. Furthermore, the computational demands of molecular simulation necessitate large scale cluster-based or distributed solutions for nearly all scientifically relevant tasks. To reduce the technical barriers to entry for new development, we integrated Rosetta with modern, widely adopted computational infrastructure. This allows simplified deployment in large-scale cluster and cloud computing environments, and effective reuse of common libraries for simulation execution and data analysis. To achieve this, we integrated Rosetta with the Conda package manager; this simplifies installation into existing computational environments and packaging as docker images for cloud deployment. Then, we developed programming interfaces to integrate Rosetta with the PyData stack for analysis and distributed computing, including the popular tools Jupyter, Pandas, and Dask. We demonstrate the utility of these components by generating a library of a thousand de novo disulfide-rich miniproteins in a hybrid simulation that included cluster-based design and interactive notebook-based analyses. Our new tools enable users, who would otherwise not have access to the necessary computational infrastructure, to perform state-of-the-art molecular simulation and design with Rosetta.

摘要

用于大分子建模的 Rosetta 软件套件是蛋白质设计、结构预测和蛋白质结构分析的强大计算工具包。开发新的基于 Rosetta 的科学工具需要两种正交的技能集:蛋白质生物化学的深入领域专业知识和分子模拟的开发、部署和分析技术专长。此外,分子模拟的计算需求几乎需要所有科学相关任务的大规模集群或分布式解决方案。为了降低新开发的技术障碍,我们将 Rosetta 与现代、广泛采用的计算基础设施集成在一起。这允许在大规模集群和云计算环境中简化部署,并有效地重复使用用于模拟执行和数据分析的通用库。为此,我们将 Rosetta 与 Conda 包管理器集成在一起;这简化了在现有计算环境中的安装,并将其打包为用于云部署的 docker 映像。然后,我们开发了编程接口,将 Rosetta 与用于分析和分布式计算的 PyData 堆栈集成在一起,包括流行的工具 Jupyter、Pandas 和 Dask。我们通过在包括基于集群的设计和交互式笔记本分析的混合模拟中生成一千个新的富含二硫键的迷你蛋白库来展示这些组件的实用性。我们的新工具使没有必要计算基础设施的用户能够使用 Rosetta 进行最先进的分子模拟和设计。

相似文献

3
Real-time PyMOL visualization for Rosetta and PyRosetta.用于 Rosetta 和 PyRosetta 的实时 PyMOL 可视化。
PLoS One. 2011;6(8):e21931. doi: 10.1371/journal.pone.0021931. Epub 2011 Aug 16.
7
Google-Accelerated Biomolecular Simulations.谷歌加速生物分子模拟
Methods Mol Biol. 2019;2022:291-309. doi: 10.1007/978-1-4939-9608-7_12.

引用本文的文献

2
De novo design of immunoglobulin-like domains.从头设计免疫球蛋白样结构域。
Nat Commun. 2022 Oct 3;13(1):5661. doi: 10.1038/s41467-022-33004-6.

本文引用的文献

7
Accurate de novo design of hyperstable constrained peptides.超稳定约束肽的精确从头设计。
Nature. 2016 Oct 20;538(7625):329-335. doi: 10.1038/nature19791. Epub 2016 Sep 14.
8
Protocols for Molecular Modeling with Rosetta3 and RosettaScripts.使用Rosetta3和RosettaScripts进行分子建模的协议。
Biochemistry. 2016 Aug 30;55(34):4748-63. doi: 10.1021/acs.biochem.6b00444. Epub 2016 Aug 16.
9
3Dmol.js: molecular visualization with WebGL.3Dmol.js:使用WebGL进行分子可视化。
Bioinformatics. 2015 Apr 15;31(8):1322-4. doi: 10.1093/bioinformatics/btu829. Epub 2014 Dec 12.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验