DIVAA：多序列比对蛋白质序列中的氨基酸多样性分析

DIVAA: analysis of amino acid diversity in multiple aligned protein sequences.

作者信息

Rodi Diane J, Mandava Suneeta, Makowski Lee

机构信息

Combinatorial Biology Unit, Biosciences Division, Argonne National Laboratory, 9700 South Cass Avenue, Argonne, IL 60439, USA.

出版信息

Bioinformatics. 2004 Dec 12;20(18):3481-9. doi: 10.1093/bioinformatics/bth432. Epub 2004 Jul 29.

DOI:10.1093/bioinformatics/bth432

PMID:15284106

Abstract

MOTIVATION

Multiple alignments of proteins are an effective way of identifying conserved amino acids that provide clues to functional relationships among proteins. Quantitation of the abundances of amino acids found at each position in a sequence motif can provide a basis for understanding the structural and functional constraints at each point. Distribution of information across a motif has been used previously, but the non-intuitive nature of the analysis has limited its impact.

RESULTS

Here, we introduce a quantitative measure of amino acid sequence diversity (DIVAA) that has a simple, intuitive meaning. Diversity, as a measure of sequence conservation or variation, is inextricably linked to the probability of selecting identical pairs from a distribution. We demonstrate its utility through the analysis of four populations: ATP-binding P-loops, hypervariable domains of kappa light chains, signal sequences, and the N- and C- termini of proteins. DIVAA provides a simple means to generate hypotheses concerning the contribution of individual residues to the functional and evolutionary relationships among proteins.

AVAILABILITY

Access to DIVAA software is available at RELIC (http://relic.bio.anl.gov).

摘要

动机

蛋白质的多序列比对是识别保守氨基酸的有效方法，这些保守氨基酸为蛋白质之间的功能关系提供线索。对序列基序中每个位置发现的氨基酸丰度进行定量，可以为理解每个位点的结构和功能限制提供基础。信息在基序中的分布此前已被使用，但分析的非直观性质限制了其影响。

结果

在这里，我们引入了一种氨基酸序列多样性的定量度量（DIVAA），它具有简单、直观的含义。作为序列保守性或变异性的度量，多样性与从分布中选择相同对的概率有着千丝万缕的联系。我们通过对四个群体的分析证明了它的实用性：ATP结合P环、κ轻链的高变区、信号序列以及蛋白质的N端和C端。DIVAA提供了一种简单的方法来生成关于单个残基对蛋白质之间功能和进化关系贡献的假设。

可用性

可在RELIC（http://relic.bio.anl.gov）获取DIVAA软件。

相似文献

DIVAA: analysis of amino acid diversity in multiple aligned protein sequences.

Bioinformatics. 2004 Dec 12;20(18):3481-9. doi: 10.1093/bioinformatics/bth432. Epub 2004 Jul 29.

LS-SNP: large-scale annotation of coding non-synonymous SNPs based on multiple information sources.

Bioinformatics. 2005 Jun 15;21(12):2814-20. doi: 10.1093/bioinformatics/bti442. Epub 2005 Apr 12.

On the quality of tree-based protein classification.

Bioinformatics. 2005 May 1;21(9):1876-90. doi: 10.1093/bioinformatics/bti244. Epub 2005 Jan 12.

Tracking repeats using significance and transitivity.

Bioinformatics. 2004 Aug 4;20 Suppl 1:i311-7. doi: 10.1093/bioinformatics/bth911.

Predicting functionally important residues from sequence conservation.

Bioinformatics. 2007 Aug 1;23(15):1875-82. doi: 10.1093/bioinformatics/btm270. Epub 2007 May 22.

ARCS: an aggregated related column scoring scheme for aligned sequences.

Bioinformatics. 2006 Oct 1;22(19):2326-32. doi: 10.1093/bioinformatics/btl398. Epub 2006 Jul 26.

A novel method for estimating ancestral amino acid composition and its application to proteins of the Last Universal Ancestor.

Bioinformatics. 2004 Sep 22;20(14):2251-7. doi: 10.1093/bioinformatics/bth235. Epub 2004 Apr 8.

Prediction of functional specificity determinants from protein sequences using log-likelihood ratios.

Bioinformatics. 2006 Jan 15;22(2):164-71. doi: 10.1093/bioinformatics/bti766. Epub 2005 Nov 8.

Multiple alignment by sequence annealing.

Bioinformatics. 2007 Jan 15;23(2):e24-9. doi: 10.1093/bioinformatics/btl311.

Assessment of the probabilities for evolutionary structural changes in protein folds.

Bioinformatics. 2007 Apr 1;23(7):832-41. doi: 10.1093/bioinformatics/btm022. Epub 2007 Feb 4.

引用本文的文献

Histocompatibility in Botryllus schlosseri and the origins of adaptive immunity.

Immunogenetics. 2025 May 10;77(1):22. doi: 10.1007/s00251-025-01379-7.

Genetic and functional diversity of allorecognition receptors in the urochordate, .

bioRxiv. 2024 Oct 18:2024.10.16.618699. doi: 10.1101/2024.10.16.618699.

Antibody Design for the Quantification of Photosynthetic Proteins and Their Isoforms.

Methods Mol Biol. 2024;2790:405-416. doi: 10.1007/978-1-0716-3790-6_21.

Expansion and collapse of VEGF diversity in major clades of the animal kingdom.

Angiogenesis. 2023 Aug;26(3):437-461. doi: 10.1007/s10456-023-09874-9. Epub 2023 Apr 5.

mtProtEvol: the resource presenting molecular evolution analysis of proteins involved in the function of Vertebrate mitochondria.

BMC Evol Biol. 2019 Feb 26;19(Suppl 1):47. doi: 10.1186/s12862-019-1371-x.

Data on the time of integration of the human mitochondrial pseudogenes (NUMTs) into the nuclear genome.

Data Brief. 2017 May 17;13:536-544. doi: 10.1016/j.dib.2017.05.024. eCollection 2017 Aug.

Use of AAScatterPlot tool for monitoring the evolution of the hemagglutinin cleavage site in H9 avian influenza viruses.

Bioinformatics. 2017 Aug 15;33(16):2431-2435. doi: 10.1093/bioinformatics/btx203.

Molecular evolution and in vitro characterization of Botryllus histocompatibility factor.

Immunogenetics. 2015 Oct;67(10):605-23. doi: 10.1007/s00251-015-0870-1. Epub 2015 Sep 11.

Combining molecular evolution and environmental genomics to unravel adaptive processes of MHC class IIB diversity in European minnows (Phoxinus phoxinus).

Ecol Evol. 2013 Aug;3(8):2568-85. doi: 10.1002/ece3.650. Epub 2013 Jun 28.

Rate and breadth of protein evolution are only weakly correlated.

Biol Direct. 2012 Feb 15;7:8. doi: 10.1186/1745-6150-7-8.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

DIVAA：多序列比对蛋白质序列中的氨基酸多样性分析

DIVAA: analysis of amino acid diversity in multiple aligned protein sequences.

作者信息

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY

动机

结果

可用性

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献