Protabit LLC, 129 N. Hill Avenue, Suite 102, Pasadena, California, 91106.
Department of Biochemistry and Molecular Biology, and the Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, Pennsylvania, 16802.
Protein Sci. 2018 Jun;27(6):1113-1124. doi: 10.1002/pro.3406. Epub 2018 Apr 30.
We present ProtaBank, a repository for storing, querying, analyzing, and sharing protein design and engineering data in an actively maintained and updated database. ProtaBank provides a format to describe and compare all types of protein mutational data, spanning a wide range of properties and techniques. It features a user-friendly web interface and programming layer that streamlines data deposition and allows for batch input and queries. The database schema design incorporates a standard format for reporting protein sequences and experimental data that facilitates comparison of results across different data sets. A suite of analysis and visualization tools are provided to facilitate discovery, to guide future designs, and to benchmark and train new predictive tools and algorithms. ProtaBank will provide a valuable resource to the protein engineering community by storing and safeguarding newly generated data, allowing for fast searching and identification of relevant data from the existing literature, and exploring correlations between disparate data sets. ProtaBank invites researchers to contribute data to the database to make it accessible for search and analysis. ProtaBank is available at https://protabank.org.
我们介绍 ProtaBank,这是一个用于存储、查询、分析和共享蛋白质设计和工程数据的存储库,以一个积极维护和更新的数据库呈现。ProtaBank 提供了一种格式,用于描述和比较各种类型的蛋白质突变数据,涵盖了广泛的性质和技术。它具有用户友好的 Web 界面和编程层,简化了数据的存储,并允许批量输入和查询。数据库模式设计采用了一种标准格式来报告蛋白质序列和实验数据,便于在不同数据集之间比较结果。还提供了一套分析和可视化工具,以促进发现、指导未来的设计,并对新的预测工具和算法进行基准测试和培训。ProtaBank 将通过存储和保护新生成的数据,允许从现有文献中快速搜索和识别相关数据,并探索不同数据集之间的相关性,为蛋白质工程界提供宝贵的资源。ProtaBank 邀请研究人员向数据库贡献数据,以便于搜索和分析。ProtaBank 可在 https://protabank.org 上获得。