Suppr超能文献

中间序列增加了序列之间同源性的检测。

Intermediate sequences increase the detection of homology between sequences.

作者信息

Park J, Teichmann S A, Hubbard T, Chothia C

机构信息

Cambridge Centre for Protein Engineering, UK.

出版信息

J Mol Biol. 1997 Oct 17;273(1):349-54. doi: 10.1006/jmbi.1997.1288.

Abstract

Two homologous sequences, which have diverged beyond the point where their homology can be recognised by a simple direct comparison, can be related through a third sequence that is suitably intermediate between the two. High scores, for a sequence match between the first and third sequences and between the second and the third sequences, imply that the first and second sequences are related even though their own match score is low. We have tested the usefulness of this idea using a database that contains the sequences of 971 protein domains whose structures are known and whose residue identities with each other are some 40% or less (PDB40D). On the basis of sequence and structural information, 2143 pairs of these sequences are known to have an evolutionary relationship. FASTA, in an all-against-all comparison of the sequences in the database, detected 320 (15%) of these relationships as well as three false positive (i.e. 1% error rate). Using intermediate sequences found by FASTA matches of PDB40D sequences to those in the large non-redundant OWL database we could detect 550 evolutionary relationships with an error rate of 1%. This means the intermediate sequence procedure increases the ability to recognise the evolutionary relationships amongst the PDB40D sequences by 70%.

摘要

两条同源序列,其差异程度已超过通过简单直接比较就能识别同源性的阶段,可以通过介于两者之间的第三条合适的序列建立联系。第一条序列与第三条序列之间以及第二条序列与第三条序列之间的序列匹配得分很高,这意味着即使第一条序列与第二条序列自身的匹配得分很低,它们之间也存在关联。我们使用一个数据库测试了这一想法的实用性,该数据库包含971个蛋白质结构域的序列,这些结构域的结构已知,且彼此之间的残基同一性约为40%或更低(PDB40D)。根据序列和结构信息,已知这些序列中有2143对具有进化关系。在对数据库中的序列进行全对全比较时,FASTA检测到了其中320对(15%)的这种关系以及三个假阳性结果(即错误率为1%)。利用通过PDB40D序列与大型非冗余OWL数据库中的序列进行FASTA匹配所找到的中间序列,我们能够检测到550种进化关系,错误率为1%。这意味着中间序列方法将识别PDB40D序列之间进化关系的能力提高了70%。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验