• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种识别蛋白质序列中远距离重复序列的方法。

A method to recognize distant repeats in protein sequences.

作者信息

Heringa J, Argos P

机构信息

European Molecular Biology Laboratory, Heidelberg, Germany.

出版信息

Proteins. 1993 Dec;17(4):391-41. doi: 10.1002/prot.340170407.

DOI:10.1002/prot.340170407
PMID:8108381
Abstract

An automated algorithm is presented that delineates protein sequence fragments which display similarity. The method incorporates a selection of a number of local nonoverlapping sequence alignments with the highest similarity scores and a graph-theoretical approach to elucidate the consistent start and end points of the fragments comprising one or more ensembles of related subsequences. The procedure allows the simultaneous identification of different types of repeats within one sequence. A multiple alignment of the resulting fragments is performed and a consensus sequence derived from the ensemble(s). Finally, a profile is constructed from the multiple alignment to detect possible and more distant members within the sequence. The method tolerates mutations in the repeats as well as insertions and deletions. The sequence spans between the various repeats or repeat clusters may be of different lengths. The technique has been applied to a number of proteins where the repeating fragments have been derived from information additional to the protein sequences.

摘要

本文提出了一种自动算法,用于描绘显示相似性的蛋白质序列片段。该方法包括选择一些具有最高相似性得分的局部非重叠序列比对,以及一种图论方法,以阐明构成一个或多个相关子序列集合的片段的一致起点和终点。该程序允许在一个序列中同时识别不同类型的重复序列。对所得片段进行多重比对,并从该集合中导出共有序列。最后,根据多重比对构建一个图谱,以检测序列中可能存在的更远距离的成员。该方法能够容忍重复序列中的突变以及插入和缺失。不同重复序列或重复簇之间的序列跨度可能不同。该技术已应用于许多蛋白质,其中重复片段来自于蛋白质序列之外的信息。

相似文献

1
A method to recognize distant repeats in protein sequences.一种识别蛋白质序列中远距离重复序列的方法。
Proteins. 1993 Dec;17(4):391-41. doi: 10.1002/prot.340170407.
2
Tracking repeats using significance and transitivity.利用显著性和传递性追踪重复序列。
Bioinformatics. 2004 Aug 4;20 Suppl 1:i311-7. doi: 10.1093/bioinformatics/bth911.
3
An integrated approach to the analysis and modeling of protein sequences and structures. III. A comparative study of sequence conservation in protein structural families using multiple structural alignments.一种蛋白质序列与结构分析及建模的综合方法。III. 使用多重结构比对对蛋白质结构家族中的序列保守性进行比较研究。
J Mol Biol. 2000 Aug 18;301(3):691-711. doi: 10.1006/jmbi.2000.3975.
4
Multiple alignment by sequence annealing.通过序列退火进行多序列比对。
Bioinformatics. 2007 Jan 15;23(2):e24-9. doi: 10.1093/bioinformatics/btl311.
5
Protein structure alignment considering phenotypic plasticity.考虑表型可塑性的蛋白质结构比对
Bioinformatics. 2008 Aug 15;24(16):i98-104. doi: 10.1093/bioinformatics/btn271.
6
Multiple sequence alignments.多序列比对
Curr Opin Struct Biol. 2005 Jun;15(3):261-6. doi: 10.1016/j.sbi.2005.04.002.
7
Global multiple-sequence alignment with repeats.含重复序列的全局多序列比对。
Proteins. 2006 Jul 1;64(1):263-74. doi: 10.1002/prot.20957.
8
Incremental window-based protein sequence alignment algorithms.基于窗口递增的蛋白质序列比对算法。
Bioinformatics. 2007 Jan 15;23(2):e17-23. doi: 10.1093/bioinformatics/btl297.
9
Analysis and prediction of functional sub-types from protein sequence alignments.基于蛋白质序列比对的功能亚类型分析与预测。
J Mol Biol. 2000 Oct 13;303(1):61-76. doi: 10.1006/jmbi.2000.4036.
10
Prediction of protein structure by evaluation of sequence-structure fitness. Aligning sequences to contact profiles derived from three-dimensional structures.通过评估序列-结构适应性预测蛋白质结构。将序列与从三维结构推导的接触谱进行比对。
J Mol Biol. 1993 Aug 5;232(3):805-25. doi: 10.1006/jmbi.1993.1433.

引用本文的文献

1
Application of the MAHDS Method for Multiple Alignment of Highly Diverged Amino Acid Sequences.MAHDS方法在高度分化氨基酸序列多重比对中的应用。
Int J Mol Sci. 2022 Mar 29;23(7):3764. doi: 10.3390/ijms23073764.
2
Identification and Analysis of Long Repeats of Proteins at the Domain Level.在结构域水平上对蛋白质长重复序列的鉴定与分析。
Front Bioeng Biotechnol. 2019 Oct 8;7:250. doi: 10.3389/fbioe.2019.00250. eCollection 2019.
3
Tandem Repeats in Proteins: Prediction Algorithms and Biological Role.蛋白质串联重复:预测算法与生物学作用。
Front Bioeng Biotechnol. 2015 Sep 24;3:143. doi: 10.3389/fbioe.2015.00143. eCollection 2015.
4
Understanding and identifying amino acid repeats.理解和识别氨基酸重复序列。
Brief Bioinform. 2014 Jul;15(4):582-91. doi: 10.1093/bib/bbt003.
5
Ab initio detection of fuzzy amino acid tandem repeats in protein sequences.从头开始检测蛋白质序列中的模糊氨基酸串联重复。
BMC Bioinformatics. 2012 Mar 21;13 Suppl 3(Suppl 3):S8. doi: 10.1186/1471-2105-13-S3-S8.
6
Algorithm to find distant repeats in a single protein sequence.在单个蛋白质序列中查找远距离重复序列的算法。
Bioinformation. 2008;3(1):28-32. doi: 10.6026/97320630003028. Epub 2008 Sep 19.
7
Multiple alignment of protein sequences with repeats and rearrangements.具有重复和重排的蛋白质序列的多序列比对。
Nucleic Acids Res. 2006;34(20):5932-42. doi: 10.1093/nar/gkl511. Epub 2006 Oct 26.
8
HHrep: de novo protein repeat detection and the origin of TIM barrels.HHrep:从头蛋白质重复序列检测与TIM桶的起源
Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W137-42. doi: 10.1093/nar/gkl130.
9
Protein structure prediction and analysis using the Robetta server.使用Robetta服务器进行蛋白质结构预测与分析。
Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W526-31. doi: 10.1093/nar/gkh468.
10
BAliBASE (Benchmark Alignment dataBASE): enhancements for repeats, transmembrane sequences and circular permutations.BAliBASE(基准比对数据库):针对重复序列、跨膜序列和环形排列的增强功能。
Nucleic Acids Res. 2001 Jan 1;29(1):323-6. doi: 10.1093/nar/29.1.323.