• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

优化蛋白质结构字母序列的方法。

Methods for optimizing the structure alphabet sequences of proteins.

作者信息

Dong Qi-wen, Wang Xiao-long, Lin Lei

机构信息

School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China.

出版信息

Comput Biol Med. 2007 Nov;37(11):1610-6. doi: 10.1016/j.compbiomed.2007.03.002. Epub 2007 May 10.

DOI:10.1016/j.compbiomed.2007.03.002
PMID:17493604
Abstract

Protein structure prediction based on fragment assemble has made great progress in recent years. Local protein structure prediction is receiving increased attention. One essential step of local protein structure prediction method is that the three-dimensional conformations must be compressed into one-dimensional series of letters of a structural alphabet. The traditional method assigns each structure fragment the structure alphabet that has the best local structure similarity. However, such locally optimal structure alphabet sequence does not guarantee to produce the globally optimal structure. This study presents two efficient methods trying to find the optimal structure alphabet sequence, which can model the native structures as accuracy as possible. First, a 28-letter structure alphabet is derived by clustering fragment in Cartesian space with fragment length of seven residues. The average quantization error of the 28 letters is 0.82 A in term of root mean square deviation. Then, two efficient methods are presented to encode the protein structures into series of structure alphabet letters, that is, the greedy and dynamic programming algorithm. They are tested on PDB database using the structure alphabet developed in Cartesian coordinates space (our structure alphabet) and in torsion angles space (the PB structure alphabet), respectively. The experimental results show that these two methods can find the approximately optimal structure alphabet sequences by searching a small fraction of the modeling space. The traditional local-optimization method achieves 26.27 A root mean square deviations between the reconstructed structures and the native one, while the modeling accuracy is improved to 3.28 A by the greedy algorithm. The results are helpful for local protein structure prediction.

摘要

近年来,基于片段组装的蛋白质结构预测取得了很大进展。局部蛋白质结构预测越来越受到关注。局部蛋白质结构预测方法的一个关键步骤是将三维构象压缩成结构字母表的一维字母序列。传统方法为每个结构片段分配具有最佳局部结构相似性的结构字母。然而,这种局部最优的结构字母序列并不能保证产生全局最优结构。本研究提出了两种有效的方法来寻找最优的结构字母序列,以尽可能准确地模拟天然结构。首先,通过在笛卡尔空间中对长度为七个残基的片段进行聚类,得到了一个28字母的结构字母表。就均方根偏差而言,这28个字母的平均量化误差为0.82埃。然后,提出了两种有效的方法将蛋白质结构编码为结构字母序列,即贪心算法和动态规划算法。分别使用在笛卡尔坐标空间(我们的结构字母表)和扭转角空间(PB结构字母表)中开发的结构字母表在PDB数据库上对它们进行了测试。实验结果表明,这两种方法可以通过搜索一小部分建模空间找到近似最优的结构字母序列。传统的局部优化方法在重建结构和天然结构之间的均方根偏差为26.27埃,而贪心算法将建模精度提高到了3.28埃。这些结果有助于局部蛋白质结构预测。

相似文献

1
Methods for optimizing the structure alphabet sequences of proteins.优化蛋白质结构字母序列的方法。
Comput Biol Med. 2007 Nov;37(11):1610-6. doi: 10.1016/j.compbiomed.2007.03.002. Epub 2007 May 10.
2
A structural alphabet for local protein structures: improved prediction methods.一种用于局部蛋白质结构的结构字母表:改进的预测方法。
Proteins. 2005 Jun 1;59(4):810-27. doi: 10.1002/prot.20458.
3
Combining evolutionary and structural information for local protein structure prediction.结合进化和结构信息进行局部蛋白质结构预测。
Proteins. 2004 Sep 1;56(4):782-94. doi: 10.1002/prot.20158.
4
Prediction of protein local structures and folding fragments based on building-block library.基于构建模块库的蛋白质局部结构和折叠片段预测
Proteins. 2008 Jul;72(1):353-66. doi: 10.1002/prot.21931.
5
Fragment-based local statistical potentials derived by combining an alphabet of protein local structures with secondary structures and solvent accessibilities.通过将蛋白质局部结构字母表与二级结构和溶剂可及性相结合得出的基于片段的局部统计势。
Proteins. 2009 Mar;74(4):820-36. doi: 10.1002/prot.22191.
6
Protein structure mining using a structural alphabet.使用结构字母表进行蛋白质结构挖掘。
Proteins. 2008 May 1;71(2):920-37. doi: 10.1002/prot.21776.
7
Analysis and prediction of protein local structure based on structure alphabets.基于结构字母表的蛋白质局部结构分析与预测。
Proteins. 2008 Jul;72(1):163-72. doi: 10.1002/prot.21904.
8
A hidden markov model derived structural alphabet for proteins.一种用于蛋白质的隐马尔可夫模型衍生结构字母表。
J Mol Biol. 2004 Jun 4;339(3):591-605. doi: 10.1016/j.jmb.2004.04.005.
9
Protein structure prediction based on sequence similarity.基于序列相似性的蛋白质结构预测。
Methods Mol Biol. 2009;569:129-56. doi: 10.1007/978-1-59745-524-4_7.
10
Accurate prediction for atomic-level protein design and its application in diversifying the near-optimal sequence space.原子水平蛋白质设计的准确预测及其在扩展近最优序列空间中的应用。
Proteins. 2009 May 15;75(3):682-705. doi: 10.1002/prot.22280.

引用本文的文献

1
PYTHIA: Deep Learning Approach for Local Protein Conformation Prediction.PYTHIA:用于局部蛋白质构象预测的深度学习方法。
Int J Mol Sci. 2021 Aug 17;22(16):8831. doi: 10.3390/ijms22168831.
2
SAFlex: A structural alphabet extension to integrate protein structural flexibility and missing data information.SAFlex:一种结构字母扩展,用于整合蛋白质结构的灵活性和缺失数据信息。
PLoS One. 2018 Jul 5;13(7):e0198854. doi: 10.1371/journal.pone.0198854. eCollection 2018.
3
PBxplore: a tool to analyze local protein structure and deformability with Protein Blocks.
PBxplore:一种利用蛋白质模块分析局部蛋白质结构和可变形性的工具。
PeerJ. 2017 Nov 20;5:e4013. doi: 10.7717/peerj.4013. eCollection 2017.
4
Characterization and Prediction of Protein Flexibility Based on Structural Alphabets.基于结构字母表的蛋白质柔性表征与预测
Biomed Res Int. 2016;2016:4628025. doi: 10.1155/2016/4628025. Epub 2016 Aug 30.
5
A short survey on protein blocks.关于蛋白质模块的简短调查。
Biophys Rev. 2010 Aug;2(3):137-147. doi: 10.1007/s12551-010-0036-1. Epub 2010 Aug 5.
6
A new prediction strategy for long local protein structures using an original description.一种使用原始描述的长局部蛋白质结构的新预测策略。
Proteins. 2009 Aug 15;76(3):570-87. doi: 10.1002/prot.22370.
7
Protein sequence and structure alignments within one framework.在一个框架内进行蛋白质序列和结构比对。
Algorithms Mol Biol. 2008 Apr 1;3:4. doi: 10.1186/1748-7188-3-4.