• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于残基对应概率的可靠序列比对方法。

A reliable sequence alignment method based on probabilities of residue correspondences.

作者信息

Miyazawa S

机构信息

Faculty of Technology, Gunma University, Japan.

出版信息

Protein Eng. 1995 Oct;8(10):999-1009. doi: 10.1093/protein/8.10.999.

DOI:10.1093/protein/8.10.999
PMID:8771180
Abstract

Probabilities of all possible correspondences of residues in aligning two proteins are evaluated by assuming that the statistical weight of each alignment is proportional to the exponent of its total similarity score. Based on such probabilities, a probability alignment that includes the most probable correspondences is proposed. In the case of highly similar sequence pairs, the probability alignments agree with the maximum similarity alignments that correspond to the alignments with the maximum similarity score. Significant correspondences in the probability alignments are those whose probabilities are > 0.5. The probability alignment method is applied to a few protein pairs, and results indicate that such highly probable correspondences in the probability alignments are probably correct correspondences that agree with the structural alignments and that incorrect correspondences in the maximum similarity alignments are usually insignificant correspondences in the probability alignments. The root mean square deviations in superimposition of corresponding residues tend to be smaller for significant correspondences in the probability alignments than for all correspondences in the maximum similarity alignments, indicating that incorrect correspondences in the maximum similarity alignments tend to be insignificant correspondences in probability alignments. This fact is also confirmed in 109 protein pairs that are similar to each other with sequence identities between 90 and 35%. In addition, the probability alignment method may better predict correct correspondences than the maximum similarity alignment method. Probability alignments do, of course, depend on a scoring scheme but are less sensitive to the value of parameters such as gap penalties. The present probability alignment method is useful for constructing reliable alignments based on the probabilities of correspondences and can be used with any scoring scheme.

摘要

通过假设每种比对的统计权重与其总相似性得分的指数成正比,来评估比对两个蛋白质时残基所有可能对应关系的概率。基于这些概率,提出了一种包含最可能对应关系的概率比对。对于高度相似的序列对,概率比对与对应于具有最大相似性得分的比对的最大相似性比对一致。概率比对中显著的对应关系是那些概率大于0.5的对应关系。将概率比对方法应用于少数蛋白质对,结果表明,概率比对中这种高度可能的对应关系可能是与结构比对一致的正确对应关系,而最大相似性比对中的错误对应关系通常在概率比对中是不显著的对应关系。概率比对中显著对应关系的相应残基叠加时的均方根偏差往往比最大相似性比对中所有对应关系的均方根偏差小,这表明最大相似性比对中的错误对应关系在概率比对中往往是不显著的对应关系。这一事实在109个序列同一性在90%至35%之间且彼此相似的蛋白质对中也得到了证实。此外,概率比对方法可能比最大相似性比对方法能更好地预测正确的对应关系。当然,概率比对取决于一种评分方案,但对诸如空位罚分等参数的值不太敏感。当前的概率比对方法对于基于对应关系的概率构建可靠的比对很有用,并且可以与任何评分方案一起使用。

相似文献

1
A reliable sequence alignment method based on probabilities of residue correspondences.一种基于残基对应概率的可靠序列比对方法。
Protein Eng. 1995 Oct;8(10):999-1009. doi: 10.1093/protein/8.10.999.
2
Protein sequence-structure alignment based on site-alignment probabilities.基于位点比对概率的蛋白质序列-结构比对
Genome Inform Ser Workshop Genome Inform. 2000;11:141-50.
3
Identifying sequence-structure pairs undetected by sequence alignments.识别序列比对未检测到的序列-结构对。
Protein Eng. 2000 Jul;13(7):459-75. doi: 10.1093/protein/13.7.459.
4
Using CLUSTAL for multiple sequence alignments.使用CLUSTAL进行多序列比对。
Methods Enzymol. 1996;266:383-402. doi: 10.1016/s0076-6879(96)66024-8.
5
Improving pairwise sequence alignment accuracy using near-optimal protein sequence alignments.利用近乎最优的蛋白质序列比对来提高两两序列比对的准确性。
BMC Bioinformatics. 2010 Mar 22;11:146. doi: 10.1186/1471-2105-11-146.
6
Statistical significance of ungapped sequence alignments.
Pac Symp Biocomput. 1998:463-72.
7
Towards an automatic method of predicting protein structure by homology: an evaluation of suboptimal sequence alignments.
Protein Eng. 1992 Jun;5(4):305-11. doi: 10.1093/protein/5.4.305.
8
Accuracy of structure-based sequence alignment of automatic methods.自动方法的基于结构的序列比对准确性。
BMC Bioinformatics. 2007 Sep 20;8:355. doi: 10.1186/1471-2105-8-355.
9
Segment-based scores for pairwise and multiple sequence alignments.用于成对和多序列比对的基于片段的评分。
Proc Int Conf Intell Syst Mol Biol. 1998;6:115-21.
10
On the reliability and the limits of inference of amino acid sequence alignments.关于氨基酸序列比对的可靠性和推断限制。
Bioinformatics. 2022 Jun 24;38(Suppl 1):i255-i263. doi: 10.1093/bioinformatics/btac247.

引用本文的文献

1
ProbPFP: a multiple sequence alignment algorithm combining hidden Markov model optimized by particle swarm optimization with partition function.ProbPFP:一种通过粒子群优化算法优化的隐马尔可夫模型与分区函数相结合的多序列比对算法。
BMC Bioinformatics. 2019 Nov 25;20(Suppl 18):573. doi: 10.1186/s12859-019-3132-7.
2
How sequence alignment scores correspond to probability models.序列比对分数如何对应概率模型。
Bioinformatics. 2020 Jan 15;36(2):408-415. doi: 10.1093/bioinformatics/btz576.
3
Bioinformatics applications on Apache Spark.基于 Apache Spark 的生物信息学应用。
Gigascience. 2018 Aug 1;7(8):giy098. doi: 10.1093/gigascience/giy098.
4
QuickProbs 2: Towards rapid construction of high-quality alignments of large protein families.QuickProbs 2:快速构建高质量的大型蛋白质家族序列比对
Sci Rep. 2017 Jan 31;7:41553. doi: 10.1038/srep41553.
5
Profile conditional random fields for modeling protein families with structural information.用于利用结构信息对蛋白质家族进行建模的轮廓条件随机场。
Biophysics (Nagoya-shi). 2009 May 30;5:37-44. doi: 10.2142/biophysics.5.37. eCollection 2009.
6
Efficient representation of uncertainty in multiple sequence alignments using directed acyclic graphs.使用有向无环图对多序列比对中的不确定性进行有效表示。
BMC Bioinformatics. 2015 Apr 1;16:108. doi: 10.1186/s12859-015-0516-1.
7
Effective alignment of RNA pseudoknot structures using partition function posterior log-odds scores.使用分区函数后验对数值评分来有效对准 RNA 假结结构。
BMC Bioinformatics. 2015 Feb 6;16:39. doi: 10.1186/s12859-015-0464-9.
8
MSARC: Multiple sequence alignment by residue clustering.MSARC:基于残基聚类的多序列比对。
Algorithms Mol Biol. 2014 Apr 16;9:12. doi: 10.1186/1748-7188-9-12. eCollection 2014.
9
Probabilistic approaches to alignment with tandem repeats.与串联重复序列比对的概率方法。
Algorithms Mol Biol. 2014 Mar 1;9(1):3. doi: 10.1186/1748-7188-9-3.
10
Sequence alignment by passing messages.通过传递消息进行序列比对。
BMC Genomics. 2014;15 Suppl 1(Suppl 1):S14. doi: 10.1186/1471-2164-15-S1-S14. Epub 2014 Jan 24.