• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用跳跃比对进行序列数据库搜索。

Sequence database search using jumping alignments.

作者信息

Spang R, Rehmsmeier M, Stoye J

机构信息

German Cancer Research Center (DKFZ), Theoretical Bioinformatics, Heidelberg, Germany.

出版信息

Proc Int Conf Intell Syst Mol Biol. 2000;8:367-75.

PMID:10977097
Abstract

We describe a new algorithm for amino acid sequence classification and the detection of remote homologues. The rationale is to exploit both vertical and horizontal information of a multiple alignment in a well balanced manner. This is in contrast to established methods like profiles and hidden Markov models which focus on vertical information as they model the columns of the alignment independently. In our setting, we want to select from a given database of "candidate sequences" those proteins that belong to a given superfamily. In order to do so, each candidate sequence is separately tested against a multiple alignment of the known members of the superfamily by means of a new jumping alignment algorithm. This algorithm is an extension of the Smith-Waterman algorithm and computes a local alignment of a single sequence and a multiple alignment. In contrast to traditional methods, however, this alignment is not based on a summary of the individual columns of the multiple alignment. Rather, the candidate sequence at each position is aligned to one sequence of the multiple alignment, called the "reference sequence". In addition, the reference sequence may change within the alignment, while each such jump is penalized. To evaluate the discriminative quality of the jumping alignment algorithm, we compared it to hidden Markov models on a subset of the SCOP database of protein domains. The discriminative quality was assessed by counting the number of false positives that ranked higher than the first true positive (FP-count). For moderate FP-counts above five, the number of successful searches with our method was considerably higher than with hidden Markov models.

摘要

我们描述了一种用于氨基酸序列分类和检测远源同源物的新算法。其基本原理是以一种平衡的方式利用多重比对的纵向和横向信息。这与诸如轮廓模型和隐马尔可夫模型等已有的方法不同,后者在独立对比对的列进行建模时侧重于纵向信息。在我们的设定中,我们希望从给定的“候选序列”数据库中选择属于给定超家族的那些蛋白质。为了做到这一点,通过一种新的跳跃比对算法,将每个候选序列分别与超家族已知成员的多重比对进行测试。该算法是史密斯 - 沃特曼算法的扩展,用于计算单个序列与多重比对的局部比对。然而,与传统方法不同的是,这种比对不是基于多重比对中各个列的汇总。相反,候选序列在每个位置与多重比对中的一个序列(称为“参考序列”)进行比对。此外,参考序列在比对过程中可能会发生变化,而每次这样的跳跃都会受到惩罚。为了评估跳跃比对算法的判别质量,我们在蛋白质结构域的SCOP数据库的一个子集中将其与隐马尔可夫模型进行了比较。通过计算排名高于第一个真正阳性的假阳性数量(FP计数)来评估判别质量。对于高于5的中等FP计数,我们的方法成功搜索的次数明显高于隐马尔可夫模型。

相似文献

1
Sequence database search using jumping alignments.使用跳跃比对进行序列数据库搜索。
Proc Int Conf Intell Syst Mol Biol. 2000;8:367-75.
2
A novel approach to remote homology detection: jumping alignments.一种用于远程同源性检测的新方法:跳跃比对。
J Comput Biol. 2002;9(5):747-60. doi: 10.1089/106652702761034172.
3
Use of multiple profiles corresponding to a sequence alignment enables effective detection of remote homologues.使用与序列比对相对应的多个图谱能够有效地检测远源同源物。
Bioinformatics. 2005 Jun 15;21(12):2821-6. doi: 10.1093/bioinformatics/bti432. Epub 2005 Apr 7.
4
Scoring profile-to-profile sequence alignments.对图谱与图谱之间的序列进行比对评分。
Protein Sci. 2004 Jun;13(6):1612-26. doi: 10.1110/ps.03601504.
5
Gapped alignment of protein sequence motifs through Monte Carlo optimization of a hidden Markov model.通过隐马尔可夫模型的蒙特卡罗优化实现蛋白质序列基序的间隙比对。
BMC Bioinformatics. 2004 Oct 25;5:157. doi: 10.1186/1471-2105-5-157.
6
A comprehensive system for evaluation of remote sequence similarity detection.一种用于评估远程序列相似性检测的综合系统。
BMC Bioinformatics. 2007 Aug 28;8:314. doi: 10.1186/1471-2105-8-314.
7
Application of protein structure alignments to iterated hidden Markov model protocols for structure prediction.蛋白质结构比对在用于结构预测的迭代隐马尔可夫模型协议中的应用。
BMC Bioinformatics. 2006 Sep 14;7:410. doi: 10.1186/1471-2105-7-410.
8
Database search based on Bayesian alignment.基于贝叶斯比对的数据库搜索。
Proc Int Conf Intell Syst Mol Biol. 1999:297-305.
9
HMM-ModE--improved classification using profile hidden Markov models by optimising the discrimination threshold and modifying emission probabilities with negative training sequences.HMM-ModE——通过优化判别阈值并利用负训练序列修改发射概率,使用轮廓隐马尔可夫模型改进分类。
BMC Bioinformatics. 2007 Mar 27;8:104. doi: 10.1186/1471-2105-8-104.
10
An adaptive and iterative algorithm for refining multiple sequence alignment.一种用于优化多序列比对的自适应迭代算法。
Comput Biol Chem. 2004 Apr;28(2):141-8. doi: 10.1016/j.compbiolchem.2004.02.001.

引用本文的文献

1
Conserved Motifs and Domains in Members of .. 成员中的保守基序和结构域
Cells. 2022 Jan 11;11(2):230. doi: 10.3390/cells11020230.
2
jpHMM at GOBICS: a web server to detect genomic recombinations in HIV-1.GOBICS 中的 jpHMM:一个用于检测 HIV-1 基因组重组的网络服务器。
Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W463-5. doi: 10.1093/nar/gkl255.
3
A jumping profile Hidden Markov Model and applications to recombination sites in HIV and HCV genomes.一种跳跃轮廓隐马尔可夫模型及其在HIV和HCV基因组重组位点中的应用。
BMC Bioinformatics. 2006 May 22;7:265. doi: 10.1186/1471-2105-7-265.