• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于比对多条蛋白质序列的新型随机迭代策略。

A novel randomized iterative strategy for aligning multiple protein sequences.

作者信息

Berger M P, Munson P J

机构信息

Analytical Biostatistics Section, National Institutes of Health, Bethesda, MD 20892.

出版信息

Comput Appl Biosci. 1991 Oct;7(4):479-84. doi: 10.1093/bioinformatics/7.4.479.

DOI:10.1093/bioinformatics/7.4.479
PMID:1747779
Abstract

The rigorous alignment of multiple protein sequences becomes impractical even with a modest number of sequences, since computer memory and time requirements increase as the product of the lengths of the sequences. We have devised a strategy to approach such an optimal alignment, which modifies the intensive computer storage and time requirements of dynamic programming. Our algorithm randomly divides a group of unaligned sequences into two subgroups, between which an optimal alignment is then obtained by a Needleman-Wunsch style of algorithm. Our algorithm uses a matrix with dimensions corresponding to the lengths of the two aligned sequence subgroups. The pairwise alignment process is repeated using different random divisions of the whole group into two subgroups. Compared with the rigorous approach of solving the n-dimensional lattice by dynamic programming, our iterative algorithm results in alignments that match or are close to the optimal solution, on a limited set of test problems. We have implemented this algorithm in a computer program that runs on the IBM PC class of machines, together with a user-friendly environment for interactively selecting sequences or groups of sequences to be aligned either simultaneously or progressively.

摘要

即使序列数量不多,对多个蛋白质序列进行严格比对也变得不切实际,因为计算机内存和时间需求会随着序列长度的乘积而增加。我们设计了一种策略来实现这种最优比对,该策略改变了动态规划对计算机存储和时间的高强度需求。我们的算法将一组未比对的序列随机分成两个子组,然后通过Needleman-Wunsch算法在这两个子组之间获得最优比对。我们的算法使用一个维度与两个比对序列子组长度相对应的矩阵。对整个组进行不同的随机划分成两个子组,重复进行两两比对过程。与通过动态规划解决n维晶格的严格方法相比,在一组有限的测试问题上,我们的迭代算法得出的比对结果与最优解匹配或接近最优解。我们已在运行于IBM PC类机器上的计算机程序中实现了该算法,并提供了一个用户友好的环境,用于交互式选择要同时或逐步比对的序列或序列组。

相似文献

1
A novel randomized iterative strategy for aligning multiple protein sequences.一种用于比对多条蛋白质序列的新型随机迭代策略。
Comput Appl Biosci. 1991 Oct;7(4):479-84. doi: 10.1093/bioinformatics/7.4.479.
2
A multiple sequence alignment algorithm for homologous proteins using secondary structure information and optionally keying alignments to functionally important sites.一种用于同源蛋白质的多序列比对算法,该算法利用二级结构信息,并可选择将比对与功能重要位点关联起来。
Comput Appl Biosci. 1989 Apr;5(2):141-50. doi: 10.1093/bioinformatics/5.2.141.
3
On global sequence alignment.关于全局序列比对。
Comput Appl Biosci. 1994 Jun;10(3):227-35. doi: 10.1093/bioinformatics/10.3.227.
4
Using CLUSTAL for multiple sequence alignments.使用CLUSTAL进行多序列比对。
Methods Enzymol. 1996;266:383-402. doi: 10.1016/s0076-6879(96)66024-8.
5
Dynamic programming algorithms for biological sequence comparison.用于生物序列比较的动态规划算法。
Methods Enzymol. 1992;210:575-601. doi: 10.1016/0076-6879(92)10029-d.
6
A method for detecting distant evolutionary relationships between protein or nucleic acid sequences in the presence of deletions or insertions.一种在存在缺失或插入的情况下检测蛋白质或核酸序列之间远距离进化关系的方法。
J Mol Evol. 1978 Jun 20;11(2):143-61. doi: 10.1007/BF01733890.
7
Automatic generation of primary sequence patterns from sets of related protein sequences.从相关蛋白质序列集中自动生成一级序列模式。
Proc Natl Acad Sci U S A. 1990 Jan;87(1):118-22. doi: 10.1073/pnas.87.1.118.
8
Alignment of protein sequences by their profiles.通过蛋白质序列的图谱进行比对。
Protein Sci. 2004 Apr;13(4):1071-87. doi: 10.1110/ps.03379804.
9
A fast and sensitive multiple sequence alignment algorithm.
Comput Appl Biosci. 1989 Apr;5(2):115-21. doi: 10.1093/bioinformatics/5.2.115.
10
Significant improvement in accuracy of multiple protein sequence alignments by iterative refinement as assessed by reference to structural alignments.通过参照结构比对进行迭代优化,多重蛋白质序列比对的准确性得到显著提高。
J Mol Biol. 1996 Dec 13;264(4):823-38. doi: 10.1006/jmbi.1996.0679.

引用本文的文献

1
Epistasis between promoter activity and coding mutations shapes gene evolvability.启动子活性与编码突变的上位性塑造基因的可进化性。
Sci Adv. 2023 Feb 3;9(5):eadd9109. doi: 10.1126/sciadv.add9109.
2
Developments in Algorithms for Sequence Alignment: A Review.序列比对算法的发展:综述。
Biomolecules. 2022 Apr 6;12(4):546. doi: 10.3390/biom12040546.
3
Segmental duplications and their variation in a complete human genome.人类全基因组中的串联重复序列及其变异。
Science. 2022 Apr;376(6588):eabj6965. doi: 10.1126/science.abj6965. Epub 2022 Apr 1.
4
Neutralizing Monoclonal Antibodies against the Gn and the Gc of the Andes Virus Glycoprotein Spike Complex Protect from Virus Challenge in a Preclinical Hamster Model.针对安第斯病毒糖蛋白刺突复合物的 Gn 和 Gc 的中和单克隆抗体可在临床前仓鼠模型中预防病毒挑战。
mBio. 2020 Mar 24;11(2):e00028-20. doi: 10.1128/mBio.00028-20.
5
Genome-wide discovery, and computational and transcriptional characterization of an AIG gene family in the freshwater snail Biomphalaria glabrata, a vector for Schistosoma mansoni.在淡水螺类生物玻利维亚圆口螺(一种曼氏血吸虫的传播媒介)中,通过全基因组发现和计算及转录特征分析,确定了 AIG 基因家族。
BMC Genomics. 2020 Mar 2;21(1):190. doi: 10.1186/s12864-020-6534-z.
6
Phylogeographic analyses point to long-term survival on the spot in micro-endemic Lycian salamanders.系统发生地理学分析表明,在利西亚微地方性的蝾螈中存在长期的原地生存。
PLoS One. 2020 Jan 13;15(1):e0226326. doi: 10.1371/journal.pone.0226326. eCollection 2020.
7
The changing views on the evolutionary relationships of extant Salamandridae (Amphibia: Urodela).现存蝾螈科(两栖动物:有尾目)进化关系的变化观点。
PLoS One. 2018 Aug 1;13(8):e0198237. doi: 10.1371/journal.pone.0198237. eCollection 2018.
8
MAFFT online service: multiple sequence alignment, interactive sequence choice and visualization.MAFFT 在线服务:多序列比对、交互式序列选择和可视化。
Brief Bioinform. 2019 Jul 19;20(4):1160-1166. doi: 10.1093/bib/bbx108.
9
Application of the MAFFT sequence alignment program to large data-reexamination of the usefulness of chained guide trees.将MAFFT序列比对程序应用于对链式引导树实用性的大数据重新检验。
Bioinformatics. 2016 Nov 1;32(21):3246-3251. doi: 10.1093/bioinformatics/btw412. Epub 2016 Jul 4.
10
A simple method to control over-alignment in the MAFFT multiple sequence alignment program.一种在MAFFT多序列比对程序中控制过度比对的简单方法。
Bioinformatics. 2016 Jul 1;32(13):1933-42. doi: 10.1093/bioinformatics/btw108. Epub 2016 Feb 26.