• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

协同合作:分布式协作社区中成功的科学软件开发要素。

Better together: Elements of successful scientific software development in a distributed collaborative community.

机构信息

Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, United States of America.

Dept of Biology, New York University, New York, NY, United States of America.

出版信息

PLoS Comput Biol. 2020 May 4;16(5):e1007507. doi: 10.1371/journal.pcbi.1007507. eCollection 2020 May.

DOI:10.1371/journal.pcbi.1007507
PMID:32365137
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7197760/
Abstract

Many scientific disciplines rely on computational methods for data analysis, model generation, and prediction. Implementing these methods is often accomplished by researchers with domain expertise but without formal training in software engineering or computer science. This arrangement has led to underappreciation of sustainability and maintainability of scientific software tools developed in academic environments. Some software tools have avoided this fate, including the scientific library Rosetta. We use this software and its community as a case study to show how modern software development can be accomplished successfully, irrespective of subject area. Rosetta is one of the largest software suites for macromolecular modeling, with 3.1 million lines of code and many state-of-the-art applications. Since the mid 1990s, the software has been developed collaboratively by the RosettaCommons, a community of academics from over 60 institutions worldwide with diverse backgrounds including chemistry, biology, physiology, physics, engineering, mathematics, and computer science. Developing this software suite has provided us with more than two decades of experience in how to effectively develop advanced scientific software in a global community with hundreds of contributors. Here we illustrate the functioning of this development community by addressing technical aspects (like version control, testing, and maintenance), community-building strategies, diversity efforts, software dissemination, and user support. We demonstrate how modern computational research can thrive in a distributed collaborative community. The practices described here are independent of subject area and can be readily adopted by other software development communities.

摘要

许多科学学科都依赖于计算方法来进行数据分析、模型生成和预测。这些方法的实现通常由具有领域专业知识但没有软件工程或计算机科学正式培训的研究人员来完成。这种安排导致人们对学术环境中开发的科学软件工具的可持续性和可维护性认识不足。一些软件工具避免了这种命运,包括科学库 Rosetta。我们使用这个软件及其社区作为案例研究,展示了即使在不同的学科领域,现代软件开发也可以成功完成。 Rosetta 是用于大分子建模的最大软件套件之一,拥有 310 万行代码和许多最先进的应用程序。自 20 世纪 90 年代中期以来,该软件一直由 RosettaCommons 共同开发,这是一个由来自全球 60 多个机构的学者组成的社区,他们的背景包括化学、生物学、生理学、物理学、工程学、数学和计算机科学。开发这个软件套件为我们提供了二十多年的经验,了解如何在拥有数百名贡献者的全球社区中有效地开发先进的科学软件。在这里,我们通过解决技术方面(如版本控制、测试和维护)、社区建设策略、多样性工作、软件传播和用户支持来展示这个开发社区的运作方式。我们展示了现代计算研究如何在分布式协作社区中蓬勃发展。这里描述的实践独立于学科领域,可以被其他软件开发社区轻易采用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69f2/7197760/868140388191/pcbi.1007507.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69f2/7197760/9360c1379363/pcbi.1007507.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69f2/7197760/99bd3ee89c9e/pcbi.1007507.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69f2/7197760/7fd23cd9651c/pcbi.1007507.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69f2/7197760/2f0b1c74542a/pcbi.1007507.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69f2/7197760/5d07b56eecbb/pcbi.1007507.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69f2/7197760/868140388191/pcbi.1007507.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69f2/7197760/9360c1379363/pcbi.1007507.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69f2/7197760/99bd3ee89c9e/pcbi.1007507.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69f2/7197760/7fd23cd9651c/pcbi.1007507.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69f2/7197760/2f0b1c74542a/pcbi.1007507.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69f2/7197760/5d07b56eecbb/pcbi.1007507.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69f2/7197760/868140388191/pcbi.1007507.g006.jpg

相似文献

1
Better together: Elements of successful scientific software development in a distributed collaborative community.协同合作:分布式协作社区中成功的科学软件开发要素。
PLoS Comput Biol. 2020 May 4;16(5):e1007507. doi: 10.1371/journal.pcbi.1007507. eCollection 2020 May.
2
Integration of the Rosetta suite with the python software stack via reproducible packaging and core programming interfaces for distributed simulation.通过可重现的打包和核心编程接口将 Rosetta 套件与 Python 软件堆栈集成,用于分布式模拟。
Protein Sci. 2020 Jan;29(1):43-51. doi: 10.1002/pro.3721. Epub 2019 Dec 2.
3
Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).大分子拥挤现象:化学与物理邂逅生物学(瑞士阿斯科纳,2012年6月10日至14日)
Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2.
4
The PyRosetta Toolkit: a graphical user interface for the Rosetta software suite.PyRosetta 工具包:用于 Rosetta 软件套件的图形用户界面。
PLoS One. 2013 Jul 9;8(7):e66856. doi: 10.1371/journal.pone.0066856. Print 2013.
5
Serverification of molecular modeling applications: the Rosetta Online Server that Includes Everyone (ROSIE).分子建模应用的服务器化:包含每个人的罗塞塔在线服务器(ROSIE)。
PLoS One. 2013 May 22;8(5):e63906. doi: 10.1371/journal.pone.0063906. Print 2013.
6
A call for bioimaging software usability.呼吁关注生物成像软件的可用性。
Nat Methods. 2012 Jun 28;9(7):666-70. doi: 10.1038/nmeth.2073.
7
Proceedings of the Second Workshop on Theory meets Industry (Erwin-Schrödinger-Institute (ESI), Vienna, Austria, 12-14 June 2007).第二届理论与产业研讨会会议录(2007年6月12日至14日,奥地利维也纳埃尔温·薛定谔研究所)
J Phys Condens Matter. 2008 Feb 13;20(6):060301. doi: 10.1088/0953-8984/20/06/060301. Epub 2008 Jan 24.
8
Practical Computational Reproducibility in the Life Sciences.生命科学中的实用计算可重复性。
Cell Syst. 2018 Jun 27;6(6):631-635. doi: 10.1016/j.cels.2018.03.014.
9
Comparative case study of two biomedical research collaboratories.两个生物医学研究合作实验室的比较案例研究
J Med Internet Res. 2005 Oct 25;7(5):e53. doi: 10.2196/jmir.7.5.e53.
10
Workflow based framework for life science informatics.基于工作流的生命科学信息学框架。
Comput Biol Chem. 2007 Oct;31(5-6):305-19. doi: 10.1016/j.compbiolchem.2007.08.009. Epub 2007 Aug 19.

引用本文的文献

1
Solid-Liquid Interfacial Free Energy from Computer Simulations: Challenges and Recent Advances.计算机模拟中的固-液界面自由能:挑战与最新进展
Chem Rev. 2025 May 28;125(10):5003-5053. doi: 10.1021/acs.chemrev.4c00833. Epub 2025 May 11.
2
Local structural flexibility drives oligomorphism in computationally designed protein assemblies.局部结构灵活性驱动计算设计的蛋白质组装体中的多态性。
Nat Struct Mol Biol. 2025 Feb 26. doi: 10.1038/s41594-025-01490-z.
3
Artificial Intelligence Transforming Post-Translational Modification Research.

本文引用的文献

1
How to support open-source software and stay sane.如何支持开源软件并保持理智。
Nature. 2019 Jul;571(7763):133-134. doi: 10.1038/d41586-019-02046-0.
2
Chemical shift-based methods in NMR structure determination.基于化学位移的 NMR 结构测定方法。
Prog Nucl Magn Reson Spectrosc. 2018 Jun-Aug;106-107:1-25. doi: 10.1016/j.pnmrs.2018.03.002. Epub 2018 Mar 11.
3
Creating custom Foldit puzzles for teaching biochemistry.创建用于生物化学教学的定制化Foldit谜题。
人工智能正在改变翻译后修饰研究。
Bioengineering (Basel). 2024 Dec 31;12(1):26. doi: 10.3390/bioengineering12010026.
4
Teaching Research Data Management with DataLad: A Multi-year, Multi-domain Effort.使用DataLad进行教学研究数据管理:一项多年、多领域的工作。
Neuroinformatics. 2024 Oct;22(4):635-645. doi: 10.1007/s12021-024-09665-7. Epub 2024 May 7.
5
Leveraging Artificial Intelligence to Expedite Antibody Design and Enhance Antibody-Antigen Interactions.利用人工智能加速抗体设计并增强抗体-抗原相互作用。
Bioengineering (Basel). 2024 Feb 15;11(2):185. doi: 10.3390/bioengineering11020185.
6
Computational Methods in Immunology and Vaccinology: Design and Development of Antibodies and Immunogens.免疫与疫苗学的计算方法:抗体和免疫原的设计与开发。
J Chem Theory Comput. 2023 Aug 22;19(16):5315-5333. doi: 10.1021/acs.jctc.3c00513. Epub 2023 Aug 1.
7
Advances in the Computational Design of Small-Molecule-Controlled Protein-Based Circuits for Synthetic Biology.用于合成生物学的小分子控制的基于蛋白质的电路的计算设计进展。
Proc IEEE Inst Electr Electron Eng. 2022 May;110(5):659-674. doi: 10.1109/JPROC.2022.3157898. Epub 2022 Apr 8.
8
SP-A binding to the SARS-CoV-2 spike protein using hybrid quantum and classical in silico modeling and molecular pruning by Quantum Approximate Optimization Algorithm (QAOA) Based MaxCut with ZDOCK.使用混合量子和经典的计算建模,通过基于 QAOA 的最大割与 ZDOCK 的分子修剪,对 SP-A 与 SARS-CoV-2 刺突蛋白的结合进行模拟。
Front Immunol. 2022 Sep 13;13:945317. doi: 10.3389/fimmu.2022.945317. eCollection 2022.
9
Insight into Viral Hijacking of CRL4 Ubiquitin Ligase through Structural Analysis of the pUL145-DDB1 Complex.通过 pUL145-DDB1 复合物的结构分析深入了解病毒对 CRL4 泛素连接酶的劫持。
J Virol. 2022 Sep 14;96(17):e0082622. doi: 10.1128/jvi.00826-22. Epub 2022 Aug 8.
10
Protein Design: From the Aspect of Water Solubility and Stability.蛋白质设计:从水溶性和稳定性方面考虑。
Chem Rev. 2022 Sep 28;122(18):14085-14179. doi: 10.1021/acs.chemrev.1c00757. Epub 2022 Aug 3.
Biochem Mol Biol Educ. 2019 Mar;47(2):133-139. doi: 10.1002/bmb.21208. Epub 2019 Jan 14.
4
CS-ROSETTA.CS-罗塞塔
Methods Enzymol. 2019;614:321-362. doi: 10.1016/bs.mie.2018.07.005. Epub 2018 Sep 11.
5
Reproducibility of research: Issues and proposed remedies.研究的可重复性:问题与建议的补救措施。
Proc Natl Acad Sci U S A. 2018 Mar 13;115(11):2561-2562. doi: 10.1073/pnas.1802324115. Epub 2018 Mar 12.
6
Comprehensive computational design of ordered peptide macrocycles.有序肽大环的综合计算设计
Science. 2017 Dec 15;358(6369):1461-1466. doi: 10.1126/science.aap7577.
7
A cyber-linked undergraduate research experience in computational biomolecular structure prediction and design.一次关于计算生物分子结构预测与设计的网络连接式本科研究经历。
PLoS Comput Biol. 2017 Dec 7;13(12):e1005837. doi: 10.1371/journal.pcbi.1005837. eCollection 2017 Dec.
8
Foldit Standalone: a video game-derived protein structure manipulation interface using Rosetta.Foldit单机版:一个使用Rosetta的源自电子游戏的蛋白质结构操纵界面。
Bioinformatics. 2017 Sep 1;33(17):2765-2767. doi: 10.1093/bioinformatics/btx283.
9
The Rosetta All-Atom Energy Function for Macromolecular Modeling and Design.用于大分子建模与设计的罗塞塔全原子能量函数。
J Chem Theory Comput. 2017 Jun 13;13(6):3031-3048. doi: 10.1021/acs.jctc.7b00125. Epub 2017 May 12.
10
Problem-Solving Phase Transitions During Team Collaboration.团队协作过程中的问题解决相变
Cogn Sci. 2018 Jan;42(1):129-167. doi: 10.1111/cogs.12482. Epub 2017 Feb 18.