• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

识别序列比对未检测到的序列-结构对。

Identifying sequence-structure pairs undetected by sequence alignments.

作者信息

Miyazawa S, Jernigan R L

机构信息

Faculty of Technology, Gunma University, Kiryu, Gunma 376, Japan and Room B-116, Bldg 12B, MSC 5677, Laboratory of Experimental and Computational Biology, DBS, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892-5677,USA.

出版信息

Protein Eng. 2000 Jul;13(7):459-75. doi: 10.1093/protein/13.7.459.

DOI:10.1093/protein/13.7.459
PMID:10906342
Abstract

We examine how effectively simple potential functions previously developed can identify compatibilities between sequences and structures of proteins for database searches. The potential function consists of pairwise contact energies, repulsive packing potentials of residues for overly dense arrangement and short-range potentials for secondary structures, all of which were estimated from statistical preferences observed in known protein structures. Each potential energy term was modified to represent compatibilities between sequences and structures for globular proteins. Pairwise contact interactions in a sequence-structure alignment are evaluated in a mean field approximation on the basis of probabilities of site pairs to be aligned. Gap penalties are assumed to be proportional to the number of contacts at each residue position, and as a result gaps will be more frequently placed on protein surfaces than in cores. In addition to minimum energy alignments, we use probability alignments made by successively aligning site pairs in order by pairwise alignment probabilities. The results show that the present energy function and alignment method can detect well both folds compatible with a given sequence and, inversely, sequences compatible with a given fold, and yield mostly similar alignments for these two types of sequence and structure pairs. Probability alignments consisting of most reliable site pairs only can yield extremely small root mean square deviations, and including less reliable pairs increases the deviations. Also, it is observed that secondary structure potentials are usefully complementary to yield improved alignments with this method. Remarkably, by this method some individual sequence-structure pairs are detected having only 5-20% sequence identity.

摘要

我们研究了先前开发的简单势函数在数据库搜索中识别蛋白质序列与结构之间兼容性的有效性。该势函数由成对接触能、用于过度密集排列的残基排斥堆积势以及二级结构的短程势组成,所有这些都是根据已知蛋白质结构中观察到的统计偏好估计的。每个势能项都经过修改,以表示球状蛋白质序列与结构之间的兼容性。序列 - 结构比对中的成对接触相互作用基于位点对对齐的概率,在平均场近似中进行评估。间隙罚分假定与每个残基位置的接触数成正比,因此间隙将更频繁地出现在蛋白质表面而非核心区域。除了最小能量比对之外,我们还使用通过按成对比对概率依次对齐位点对而进行的概率比对。结果表明,当前的能量函数和比对方法能够很好地检测出与给定序列兼容的折叠,反之,也能检测出与给定折叠兼容的序列,并且对于这两种类型的序列和结构对,产生的比对结果大多相似。仅由最可靠的位点对组成的概率比对能够产生极小的均方根偏差,而纳入不太可靠的位点对会增加偏差。此外,观察到二级结构势在此方法中对产生改进的比对结果有有益的补充作用。值得注意的是,通过这种方法能够检测到一些序列 - 结构对,它们的序列同一性仅为5 - 20%。

相似文献

1
Identifying sequence-structure pairs undetected by sequence alignments.识别序列比对未检测到的序列-结构对。
Protein Eng. 2000 Jul;13(7):459-75. doi: 10.1093/protein/13.7.459.
2
Protein sequence-structure alignment based on site-alignment probabilities.基于位点比对概率的蛋白质序列-结构比对
Genome Inform Ser Workshop Genome Inform. 2000;11:141-50.
3
Large-scale comparison of protein sequence alignment algorithms with structure alignments.蛋白质序列比对算法与结构比对的大规模比较。
Proteins. 2000 Jul 1;40(1):6-22. doi: 10.1002/(sici)1097-0134(20000701)40:1<6::aid-prot30>3.0.co;2-7.
4
Iterative sequence/secondary structure search for protein homologs: comparison with amino acid sequence alignments and application to fold recognition in genome databases.用于蛋白质同源物的迭代序列/二级结构搜索:与氨基酸序列比对的比较及在基因组数据库中折叠识别的应用
Bioinformatics. 2000 Nov;16(11):988-1002. doi: 10.1093/bioinformatics/16.11.988.
5
A reliable sequence alignment method based on probabilities of residue correspondences.一种基于残基对应概率的可靠序列比对方法。
Protein Eng. 1995 Oct;8(10):999-1009. doi: 10.1093/protein/8.10.999.
6
NdPASA: a novel pairwise protein sequence alignment algorithm that incorporates neighbor-dependent amino acid propensities.NdPASA:一种整合了邻域依赖氨基酸倾向的新型双序列蛋白质序列比对算法。
Proteins. 2005 Feb 15;58(3):628-37. doi: 10.1002/prot.20359.
7
Probabilistic description of protein alignments for sequences and structures.序列和结构的蛋白质比对的概率描述。
Proteins. 2004 Jul 1;56(1):157-66. doi: 10.1002/prot.20067.
8
Using CLUSTAL for multiple sequence alignments.使用CLUSTAL进行多序列比对。
Methods Enzymol. 1996;266:383-402. doi: 10.1016/s0076-6879(96)66024-8.
9
Use of residue pairs in protein sequence-sequence and sequence-structure alignments.残基对在蛋白质序列-序列和序列-结构比对中的应用。
Protein Sci. 2000 Aug;9(8):1576-88. doi: 10.1110/ps.9.8.1576.
10
An integrated approach to the analysis and modeling of protein sequences and structures. III. A comparative study of sequence conservation in protein structural families using multiple structural alignments.一种蛋白质序列与结构分析及建模的综合方法。III. 使用多重结构比对对蛋白质结构家族中的序列保守性进行比较研究。
J Mol Biol. 2000 Aug 18;301(3):691-711. doi: 10.1006/jmbi.2000.3975.

引用本文的文献

1
Profile conditional random fields for modeling protein families with structural information.用于利用结构信息对蛋白质家族进行建模的轮廓条件随机场。
Biophysics (Nagoya-shi). 2009 May 30;5:37-44. doi: 10.2142/biophysics.5.37. eCollection 2009.
2
BioShell-Threading: versatile Monte Carlo package for protein 3D threading.BioShell-Threading:用于蛋白质 3D 结构预测的多功能蒙特卡罗包。
BMC Bioinformatics. 2014 Jan 20;15:22. doi: 10.1186/1471-2105-15-22.
3
From isotropic to anisotropic side chain representations: comparison of three models for residue contact estimation.
从各向同性到各向异性侧链表示:残基接触估计的三种模型比较。
PLoS One. 2011 Apr 28;6(4):e19238. doi: 10.1371/journal.pone.0019238.
4
Computational immunology meets bioinformatics: the use of prediction tools for molecular binding in the simulation of the immune system.计算免疫学与生物信息学相遇:在免疫系统模拟中使用预测分子结合的工具。
PLoS One. 2010 Apr 16;5(4):e9862. doi: 10.1371/journal.pone.0009862.
5
Statistical potential for assessment and prediction of protein structures.用于蛋白质结构评估和预测的统计势
Protein Sci. 2006 Nov;15(11):2507-24. doi: 10.1110/ps.062416606.
6
Predicting binding sites of hydrolase-inhibitor complexes by combining several methods.通过结合多种方法预测水解酶-抑制剂复合物的结合位点。
BMC Bioinformatics. 2004 Dec 17;5:205. doi: 10.1186/1471-2105-5-205.
7
Statistical potentials for fold assessment.用于折叠评估的统计势
Protein Sci. 2002 Feb;11(2):430-48. doi: 10.1002/pro.110430.
8
Identification of related proteins with weak sequence identity using secondary structure information.利用二级结构信息鉴定具有弱序列相似性的相关蛋白质。
Protein Sci. 2001 Apr;10(4):788-97. doi: 10.1110/ps.30001.