Ramazzotti Matteo, Degl'Innocenti Donatella, Manao Giampaolo, Ramponi Giampietro
Department of Biochemical Sciences, University of Florence.
Ital J Biochem. 2004 Mar;53(1):16-22.
Amino acid sequence alignment is an extremely useful tool in protein family analysis. Most family characteristics, such as the localization of functional residues, structural constraints and evolutionary relationships may be retrieved through the observation of the conservation pattern highlighted by the alignments. A quantitative score for the conservation in the alignment allows different stages of an alignment to be compared and consequently the alignment information to be efficiently exploited. Many scoring methods have been proposed during the last three decades. Claude Shannon's theory of communication (1948) paved the way for a consistent scoring of protein alignments by considering the residue (or symbol) frequency. A number of modifications have been proposed since that time, but the core statistical approach is still considered one of the best. By combining many database managing tools for treatment of protein sequences, a ClustalW software integration, a flexible symbols treatment and gap normalization functions, Entropy Calculator software has been developed. This new tool provides a global and optimal approach to multiple sequence alignment scoring by offering an easy graphic interface and a series of modification options that help in interpreting alignments and allow conservation pattern inferences to be performed.
氨基酸序列比对是蛋白质家族分析中极为有用的工具。大多数家族特征,如功能残基的定位、结构限制和进化关系,都可以通过观察比对所突出显示的保守模式来获取。比对中保守性的定量评分允许对比对的不同阶段进行比较,从而有效地利用比对信息。在过去三十年中已经提出了许多评分方法。克劳德·香农的通信理论(1948年)通过考虑残基(或符号)频率,为蛋白质比对的一致性评分铺平了道路。自那时以来已经提出了许多改进方法,但核心统计方法仍被认为是最佳方法之一。通过结合许多用于处理蛋白质序列的数据库管理工具、ClustalW软件集成、灵活的符号处理和空位归一化功能,开发了熵计算器软件。这个新工具通过提供一个简单的图形界面和一系列有助于解释比对并允许进行保守模式推断的修改选项,为多序列比对评分提供了一种全局且最优的方法。