• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于模块化的并行蛋白质设计算法及其使用共享内存编程的实现。

Modularity-based parallel protein design algorithm with an implementation using shared memory programming.

作者信息

Pal Abantika, Mulumudy Rohith, Mitra Pralay

机构信息

Department of Computer Science and Engineering, Indian Institute of Technology Kharagpur, Kharagpur, West Bengal, India.

出版信息

Proteins. 2022 Mar;90(3):658-669. doi: 10.1002/prot.26263. Epub 2021 Oct 23.

DOI:10.1002/prot.26263
PMID:34651333
Abstract

Given a target protein structure, the prime objective of protein design is to find amino acid sequences that will fold/acquire to the given three-dimensional structure. The protein design problem belongs to the non-deterministic polynomial-time-hard class as sequence search space increases exponentially with protein length. To ensure better search space exploration and faster convergence, we propose a protein modularity-based parallel protein design algorithm. The modular architecture of the protein structure is exploited by considering an intermediate structural organization between secondary structure and domain defined as protein unit (PU). Here, we have incorporated a divide-and-conquer approach where a protein is split into PUs and each PU region is explored in a parallel fashion. It has been further analyzed that our shared memory implementation of modularity-based parallel sequence search leads to better search space exploration compared to the case of traditional full protein design. Sequence-based analysis on design sequences depicts an average of 39.7% sequence similarity on the benchmark data set. Structure-based comparison of the modeled structures of the design protein with the target structure exhibited an average root-mean-square deviation of 1.17 Å and an average template modeling score of 0.89. The selected modeled structures of the design protein sequences are validated using 100 ns molecular dynamics simulations where 80% of the proteins have shown better or similar stability to the respective target proteins. Our study informs that our modularity-based protein design algorithm can be extended to protein interaction design as well.

摘要

给定一个目标蛋白质结构,蛋白质设计的主要目标是找到能够折叠/获得给定三维结构的氨基酸序列。随着序列搜索空间随蛋白质长度呈指数增长,蛋白质设计问题属于非确定性多项式时间难问题。为了确保更好地探索搜索空间并实现更快的收敛,我们提出了一种基于蛋白质模块化的并行蛋白质设计算法。通过考虑二级结构和定义为蛋白质单元(PU)的结构域之间的中间结构组织,利用了蛋白质结构的模块化架构。在这里,我们采用了分治法,将蛋白质拆分为蛋白质单元,每个蛋白质单元区域以并行方式进行探索。进一步分析表明,与传统的全蛋白质设计相比,我们基于模块化的并行序列搜索的共享内存实现能够更好地探索搜索空间。对设计序列进行的基于序列的分析表明,在基准数据集上平均序列相似度为39.7%。设计蛋白质的建模结构与目标结构的基于结构的比较显示,平均均方根偏差为1.17 Å,平均模板建模分数为0.89。使用100 ns分子动力学模拟对设计蛋白质序列的选定建模结构进行了验证,其中80%的蛋白质对各自的目标蛋白质表现出更好或相似的稳定性。我们的研究表明,我们基于模块化的蛋白质设计算法也可以扩展到蛋白质相互作用设计。

相似文献

1
Modularity-based parallel protein design algorithm with an implementation using shared memory programming.基于模块化的并行蛋白质设计算法及其使用共享内存编程的实现。
Proteins. 2022 Mar;90(3):658-669. doi: 10.1002/prot.26263. Epub 2021 Oct 23.
2
An Evolutionary Profile Guided Greedy Parallel Replica-Exchange Monte Carlo Search Algorithm for Rapid Convergence in Protein Design.一种进化特征指导的贪婪并行副本交换蒙特卡罗搜索算法,用于蛋白质设计中的快速收敛。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Mar-Apr;18(2):489-499. doi: 10.1109/TCBB.2019.2928809. Epub 2021 Apr 8.
3
A sequence space search engine for computational protein design to modulate molecular functionality.一种用于计算蛋白质设计的序列空间搜索引擎,用于调节分子功能。
J Biomol Struct Dyn. 2023 Apr;41(7):2937-2946. doi: 10.1080/07391102.2022.2042386. Epub 2022 Feb 26.
4
Accurate prediction for atomic-level protein design and its application in diversifying the near-optimal sequence space.原子水平蛋白质设计的准确预测及其在扩展近最优序列空间中的应用。
Proteins. 2009 May 15;75(3):682-705. doi: 10.1002/prot.22280.
5
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
6
Parallel Computational Protein Design.并行计算蛋白质设计
Methods Mol Biol. 2017;1529:265-277. doi: 10.1007/978-1-4939-6637-0_13.
7
An ant colony optimisation algorithm for the 2D and 3D hydrophobic polar protein folding problem.一种用于二维和三维疏水极性蛋白质折叠问题的蚁群优化算法。
BMC Bioinformatics. 2005 Feb 14;6:30. doi: 10.1186/1471-2105-6-30.
8
BWM*: A Novel, Provable, Ensemble-based Dynamic Programming Algorithm for Sparse Approximations of Computational Protein Design.BWM*:一种用于计算蛋白质设计稀疏逼近的新型、可证明的、基于集成的动态规划算法。
J Comput Biol. 2016 Jun;23(6):413-24. doi: 10.1089/cmb.2015.0194. Epub 2016 Jan 8.
9
Protein-fold recognition using an improved single-source K diverse shortest paths algorithm.使用改进的单源K条不同最短路径算法进行蛋白质折叠识别。
Proteins. 2016 Apr;84(4):467-72. doi: 10.1002/prot.24993. Epub 2016 Feb 4.
10
Evaluating the accuracy of protein design using native secondary sub-structures.使用天然二级子结构评估蛋白质设计的准确性。
BMC Bioinformatics. 2016 Sep 5;17(1):353. doi: 10.1186/s12859-016-1199-y.

引用本文的文献

1
Insights into the recognition mechanism of shark-derived single-domain antibodies with high affinity and specificity targeting fluoroquinolones.对具有高亲和力和特异性靶向氟喹诺酮类的鲨鱼源单域抗体识别机制的见解。
Mar Life Sci Technol. 2025 Feb 13;7(2):340-351. doi: 10.1007/s42995-024-00277-3. eCollection 2025 May.