• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

穿线算法的多序列信息。

Multiple sequence information for threading algorithms.

作者信息

Defay T R, Cohen F E

机构信息

Graduate Group in Biophysics, University of California, San Francisco 94143-0450, USA.

出版信息

J Mol Biol. 1996 Sep 20;262(2):314-23. doi: 10.1006/jmbi.1996.0515.

DOI:10.1006/jmbi.1996.0515
PMID:8831796
Abstract

Threading algorithms attempt to solve the inverse protein folding problem: given a group of structures and a sequence, identify the structure that is most compatible with this sequence. A recent study of this class of algorithms by S. J. Wodak and colleagues suggests that while threading algorithms are capable of recognizing many folding motifs, their performance in truly blind predictions is disappointing, and the underlying alignments upon which the selections are based are frequently errant. To help overcome this problem we have developed a Test of Optimal Mutagenesis algorithm (TOM) that exploits information inherent in the variation between several homologues in a multiple sequence alignment. This information is used to help select the correct structural motif for the sequence from a database of known structures. A total of 305 high-resolution structures were selected to represent the set of known folds; 56 proteins were chosen that had at least one close structural match in this set. To test TOM, we attempted to determine which of the 305 folds was a match to each of the 56 protein sequences. TOM correctly predicts a close structural match for 45% of these proteins. THREADER, an algorithm chosen as a literature standard, correctly matched 20% of the test set. By comparing the performance of TOM, THREADER, and TOM NOVAR (a version of TOM without variability information), we conclude that the tendency of an amino acid to be buried or exposed is the dominant determinant of the success of threading algorithms. In addition, the structural alignments produced by TOM suggest that the exact alignment of just 30 to 50% of the residues in a sequence with the correct fold is necessary to select it as the highest scoring match in a set of folds.

摘要

穿线法算法试图解决蛋白质折叠逆问题

给定一组结构和一个序列,识别与该序列最匹配的结构。S. J. 沃达克及其同事最近对这类算法进行的一项研究表明,虽然穿线法算法能够识别许多折叠基序,但它们在真正的盲预测中的表现令人失望,而且作为选择基础的潜在比对常常是错误的。为了帮助克服这个问题,我们开发了一种最优诱变测试算法(TOM),该算法利用了多序列比对中几个同源物之间变异所固有的信息。此信息用于帮助从已知结构数据库中为该序列选择正确的结构基序。总共选择了305个高分辨率结构来代表已知折叠集;选择了56种在该集合中至少有一个紧密结构匹配的蛋白质。为了测试TOM,我们试图确定305种折叠中哪一种与56种蛋白质序列中的每一种匹配。TOM正确地为其中45%的蛋白质预测了紧密的结构匹配。THREADER是一种被选为文献标准的算法,它正确匹配了20%的测试集。通过比较TOM、THREADER和TOM NOVAR(一种没有变异信息的TOM版本)的性能,我们得出结论,氨基酸被埋藏或暴露的趋势是穿线法算法成功的主要决定因素。此外,TOM产生的结构比对表明,序列中仅30%至50%的残基与正确折叠的精确比对对于将其选为一组折叠中得分最高的匹配是必要的。

相似文献

1
Multiple sequence information for threading algorithms.穿线算法的多序列信息。
J Mol Biol. 1996 Sep 20;262(2):314-23. doi: 10.1006/jmbi.1996.0515.
2
Protein structure prediction by threading methods: evaluation of current techniques.基于穿线法的蛋白质结构预测:当前技术评估
Proteins. 1995 Nov;23(3):337-55. doi: 10.1002/prot.340230308.
3
An integrated approach to the analysis and modeling of protein sequences and structures. III. A comparative study of sequence conservation in protein structural families using multiple structural alignments.一种蛋白质序列与结构分析及建模的综合方法。III. 使用多重结构比对对蛋白质结构家族中的序列保守性进行比较研究。
J Mol Biol. 2000 Aug 18;301(3):691-711. doi: 10.1006/jmbi.2000.3975.
4
Successful protein fold recognition by optimal sequence threading validated by rigorous blind testing.通过严格的盲测验证的最优序列穿线法实现成功的蛋白质折叠识别。
Proteins. 1995 Nov;23(3):387-97. doi: 10.1002/prot.340230312.
5
Protein threading using PROSPECT: design and evaluation.使用PROSPECT进行蛋白质穿线法:设计与评估
Proteins. 2000 Aug 15;40(3):343-54.
6
Protein fold recognition by mapping predicted secondary structures.通过映射预测的二级结构进行蛋白质折叠识别。
J Mol Biol. 1996 Jun 14;259(3):349-65. doi: 10.1006/jmbi.1996.0325.
7
Threading a database of protein cores.梳理蛋白质核心数据库。
Proteins. 1995 Nov;23(3):356-69. doi: 10.1002/prot.340230309.
8
CAALIGN: a program for pairwise and multiple protein-structure alignment.CAALIGN:一个用于蛋白质结构两两比对和多序列比对的程序。
Acta Crystallogr D Biol Crystallogr. 2007 Apr;63(Pt 4):514-25. doi: 10.1107/S0907444907000844. Epub 2007 Mar 16.
9
MUSTANG: a multiple structural alignment algorithm.MUSTANG:一种多重结构比对算法。
Proteins. 2006 Aug 15;64(3):559-74. doi: 10.1002/prot.20921.
10
Filtered neighbors threading.过滤邻居线程化
Proteins. 1999 Nov 15;37(3):346-59.

引用本文的文献

1
Analysis of the "thermodynamic information content" of a Homo sapiens structural database reveals hierarchical thermodynamic organization.对人类结构数据库“热力学信息内容”的分析揭示了层次化的热力学组织。
Protein Sci. 2004 Jul;13(7):1787-801. doi: 10.1110/ps.04706204.
2
An approach to large scale identification of non-obvious structural similarities between proteins.一种大规模识别蛋白质之间非明显结构相似性的方法。
BMC Bioinformatics. 2004 May 17;5:61. doi: 10.1186/1471-2105-5-61.
3
Structural characterization of genomes by large scale sequence-structure threading.
通过大规模序列-结构穿线法对基因组进行结构表征。
BMC Bioinformatics. 2004 Apr 3;5:37. doi: 10.1186/1471-2105-5-37.
4
JEvTrace: refinement and variations of the evolutionary trace in JAVA.JEvTrace:JAVA 中进化追踪的细化与变体
Genome Biol. 2002;3(12):RESEARCH0077. doi: 10.1186/gb-2002-3-12-research0077. Epub 2002 Nov 26.
5
The directional atomic solvation energy: an atom-based potential for the assignment of protein sequences to known folds.定向原子溶剂化能:一种基于原子的将蛋白质序列分配到已知折叠结构的势能。
Proc Natl Acad Sci U S A. 2002 Dec 10;99(25):16041-6. doi: 10.1073/pnas.252626399. Epub 2002 Dec 2.
6
Feasibility in the inverse protein folding protocol.反向蛋白质折叠方案的可行性。
Protein Sci. 1999 May;8(5):1001-9. doi: 10.1110/ps.8.5.1001.
7
Protein fold recognition without Boltzmann statistics or explicit physical basis.无需玻尔兹曼统计或明确物理基础的蛋白质折叠识别。
Protein Sci. 1998 Jan;7(1):142-9. doi: 10.1002/pro.5560070115.