Suppr超能文献

在未比对的蛋白质序列中寻找灵活模式。

Finding flexible patterns in unaligned protein sequences.

作者信息

Jonassen I, Collins J F, Higgins D G

机构信息

Department of Informatics, University of Bergen, HIB, Norway.

出版信息

Protein Sci. 1995 Aug;4(8):1587-95. doi: 10.1002/pro.5560040817.

Abstract

We present a new method for the identification of conserved patterns in a set of unaligned related protein sequences. It is able to discover patterns of a quite general form, allowing for both ambiguous positions and for variable length wildcard regions. It allows the user to define a class of patterns (e.g., the degree of ambiguity allowed and the length and number of gaps), and the method is then guaranteed to find the conserved patterns in this class scoring highest according to a significance measure defined. Identified patterns may be refined using one of two new algorithms. We present a new (nonstatistical) significance measure for flexible patterns. The method is shown to recover known motifs for PROSITE families and is also applied to some recently described families from the literature.

摘要

我们提出了一种新方法,用于识别一组未比对的相关蛋白质序列中的保守模式。该方法能够发现形式相当通用的模式,允许存在模糊位置和可变长度的通配符区域。它允许用户定义一类模式(例如,允许的模糊程度以及间隙的长度和数量),然后该方法保证能找到根据所定义的显著性度量在此类中得分最高的保守模式。可使用两种新算法之一对识别出的模式进行优化。我们提出了一种针对灵活模式的新的(非统计)显著性度量。结果表明,该方法能够找回PROSITE家族的已知基序,并且还应用于文献中最近描述的一些家族。

相似文献

5
ARCS: an aggregated related column scoring scheme for aligned sequences.ARCS:一种用于比对序列的聚合相关列评分方案。
Bioinformatics. 2006 Oct 1;22(19):2326-32. doi: 10.1093/bioinformatics/btl398. Epub 2006 Jul 26.
6
Designing patterns for profile HMM search.设计用于隐马尔可夫模型轮廓搜索的模式。
Bioinformatics. 2007 Jan 15;23(2):e36-43. doi: 10.1093/bioinformatics/btl323.
8
Detecting patterns in protein sequences.检测蛋白质序列中的模式。
J Mol Biol. 1994 Jun 24;239(5):698-712. doi: 10.1006/jmbi.1994.1407.
9
Fast model-based protein homology detection without alignment.基于快速模型的无需比对的蛋白质同源性检测。
Bioinformatics. 2007 Jul 15;23(14):1728-36. doi: 10.1093/bioinformatics/btm247. Epub 2007 May 8.

引用本文的文献

本文引用的文献

4
Identification of sequence motifs from a set of proteins with related function.
Protein Eng. 1994 Feb;7(2):165-71. doi: 10.1093/protein/7.2.165.
7
Detecting patterns in protein sequences.检测蛋白质序列中的模式。
J Mol Biol. 1994 Jun 24;239(5):698-712. doi: 10.1006/jmbi.1994.1407.
9
PROSITE: recent developments.PROSITE:最新进展。
Nucleic Acids Res. 1994 Sep;22(17):3583-9.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验