SubSeqer：一种基于图形的方法，用于检测和识别低复杂度序列中的重复元件。

SubSeqer: a graph-based approach for the detection and identification of repetitive elements in low-complexity sequences.

作者信息

He David, Parkinson John

机构信息

Program in Molecular Structure and Function, Hospital for Sick Children, University of Toronto, Toronto, Canada.

出版信息

Bioinformatics. 2008 Apr 1;24(7):1016-7. doi: 10.1093/bioinformatics/btn073. Epub 2008 Feb 26.

DOI:10.1093/bioinformatics/btn073

PMID:18304932

Abstract

Low-complexity, repetitive protein sequences with a limited amino acid palette are abundant in nature, and many of them play an important role in the structure and function of certain types of proteins. However, such repetitive sequences often do not have rigidly defined motifs. Consequently, the identification of these low-complexity repetitive elements has proven challenging for existing pattern-matching algorithms. Here we introduce a new web-tool SubSeqer (http://compsysbio.org/subseqer/) which uses graphical visualization methods borrowed from protein interaction studies to identify and characterize repetitive elements in low-complexity sequences. Given their abundance, we suggest that SubSeqer represents a valuable resource for the study of typically neglected low-complexity sequences.

摘要

低复杂性、具有有限氨基酸组成的重复蛋白质序列在自然界中大量存在，其中许多在某些类型蛋白质的结构和功能中发挥着重要作用。然而，此类重复序列往往没有严格定义的基序。因此，对于现有的模式匹配算法而言，识别这些低复杂性重复元件已被证明具有挑战性。在此，我们引入了一种新的网络工具SubSeqer（http://compsysbio.org/subseqer/），它利用从蛋白质相互作用研究中借鉴的图形可视化方法来识别和表征低复杂性序列中的重复元件。鉴于它们的丰富性，我们认为SubSeqer是研究通常被忽视的低复杂性序列的宝贵资源。

相似文献

SubSeqer: a graph-based approach for the detection and identification of repetitive elements in low-complexity sequences.

Bioinformatics. 2008 Apr 1;24(7):1016-7. doi: 10.1093/bioinformatics/btn073. Epub 2008 Feb 26.

QOMA: quasi-optimal multiple alignment of protein sequences.

Bioinformatics. 2007 Jan 15;23(2):162-8. doi: 10.1093/bioinformatics/btl590. Epub 2006 Nov 22.

The 3of5 web application for complex and comprehensive pattern matching in protein sequences.

BMC Bioinformatics. 2006 Mar 16;7:144. doi: 10.1186/1471-2105-7-144.

SVM-HUSTLE--an iterative semi-supervised machine learning approach for pairwise protein remote homology detection.

Bioinformatics. 2008 Mar 15;24(6):783-90. doi: 10.1093/bioinformatics/btn028. Epub 2008 Feb 1.

A Novel algorithm for identifying low-complexity regions in a protein sequence.

Bioinformatics. 2006 Dec 15;22(24):2980-7. doi: 10.1093/bioinformatics/btl495. Epub 2006 Oct 2.

Supervised identification of allergen-representative peptides for in silico detection of potentially allergenic proteins.

Bioinformatics. 2005 Jan 1;21(1):39-50. doi: 10.1093/bioinformatics/bth477. Epub 2004 Aug 19.

Optimizing the size of the sequence profiles to increase the accuracy of protein sequence alignments generated by profile-profile algorithms.

Bioinformatics. 2008 May 1;24(9):1145-53. doi: 10.1093/bioinformatics/btn097. Epub 2008 Mar 12.

Tracking repeats using significance and transitivity.

Bioinformatics. 2004 Aug 4;20 Suppl 1:i311-7. doi: 10.1093/bioinformatics/bth911.

BiasViz: visualization of amino acid biased regions in protein alignments.

Bioinformatics. 2007 Nov 15;23(22):3093-4. doi: 10.1093/bioinformatics/btm489. Epub 2007 Oct 6.

An efficient, versatile and scalable pattern growth approach to mine frequent patterns in unaligned protein sequences.

Bioinformatics. 2007 Mar 15;23(6):687-93. doi: 10.1093/bioinformatics/btl665. Epub 2007 Jan 19.

引用本文的文献

Bioinformatics tools for the sequence complexity estimates.

Biophys Rev. 2023 Sep 15;15(5):1367-1378. doi: 10.1007/s12551-023-01140-y. eCollection 2023 Oct.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

SubSeqer：一种基于图形的方法，用于检测和识别低复杂度序列中的重复元件。

SubSeqer: a graph-based approach for the detection and identification of repetitive elements in low-complexity sequences.

作者信息

He David, Parkinson John

机构信息

Program in Molecular Structure and Function, Hospital for Sick Children, University of Toronto, Toronto, Canada.

出版信息

Bioinformatics. 2008 Apr 1;24(7):1016-7. doi: 10.1093/bioinformatics/btn073. Epub 2008 Feb 26.

DOI:10.1093/bioinformatics/btn073

PMID:18304932

Abstract

摘要

SubSeqer：一种基于图形的方法，用于检测和识别低复杂度序列中的重复元件。

SubSeqer: a graph-based approach for the detection and identification of repetitive elements in low-complexity sequences.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

SubSeqer：一种基于图形的方法，用于检测和识别低复杂度序列中的重复元件。

SubSeqer: a graph-based approach for the detection and identification of repetitive elements in low-complexity sequences.

作者信息

机构信息

出版信息

相似文献

引用本文的文献