Suppr超能文献

CoLiDe:用于探测蛋白质序列空间的组合文库设计工具。

CoLiDe: Combinatorial Library Design tool for probing protein sequence space.

机构信息

Department of Cell Biology, Faculty of Science, Charles University, Biocev, Prague, Czech Republic.

Department of Biochemistry, Faculty of Science, Charles University, 128 00 Prague 2, Czech Republic.

出版信息

Bioinformatics. 2021 May 1;37(4):482-489. doi: 10.1093/bioinformatics/btaa804.

Abstract

MOTIVATION

Current techniques of protein engineering focus mostly on re-designing small targeted regions or defined structural scaffolds rather than constructing combinatorial libraries of versatile compositions and lengths. This is a missed opportunity because combinatorial libraries are emerging as a vital source of novel functional proteins and are of interest in diverse research areas.

RESULTS

Here, we present a computational tool for Combinatorial Library Design (CoLiDe) offering precise control over protein sequence composition, length and diversity. The algorithm uses evolutionary approach to provide solutions to combinatorial libraries of degenerate DNA templates. We demonstrate its performance and precision using four different input alphabet distribution on different sequence lengths. In addition, a model design and experimental pipeline for protein library expression and purification is presented, providing a proof-of-concept that our protocol can be used to prepare purified protein library samples of up to 1011-1012 unique sequences. CoLiDe presents a composition-centric approach to protein design towards different functional phenomena.

AVAILABILITYAND IMPLEMENTATION

CoLiDe is implemented in Python and freely available at https://github.com/voracva1/CoLiDe.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

目前的蛋白质工程技术主要集中在重新设计小的靶向区域或定义的结构支架上,而不是构建组合文库,这些文库具有多种组成和长度。这是一个错失的机会,因为组合文库正在成为新型功能蛋白的重要来源,并且在不同的研究领域都有兴趣。

结果

在这里,我们提出了一种用于组合文库设计(CoLiDe)的计算工具,它可以精确控制蛋白质序列的组成、长度和多样性。该算法使用进化方法为简并 DNA 模板的组合文库提供解决方案。我们使用不同的序列长度和 4 种不同的输入字母分布来演示其性能和精度。此外,还提出了一种用于蛋白质文库表达和纯化的模型设计和实验方案,证明了我们的方案可以用于制备多达 1011-1012 个独特序列的纯化蛋白质文库样品。CoLiDe 提出了一种以组合为中心的蛋白质设计方法,用于不同的功能现象。

可用性和实现

CoLiDe 是用 Python 实现的,可以在 https://github.com/voracva1/CoLiDe 上免费获得。

补充信息

补充数据可在生物信息学在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b536/8088326/ef37e6e43db6/btaa804f1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验