Blanchette Mathieu
McGill Centre for Bioinformatics, McGill University, Montreal, Quebec, Canada. email:
Annu Rev Genomics Hum Genet. 2007;8:193-213. doi: 10.1146/annurev.genom.8.080706.092300.
Multi-sequence alignments of large genomic regions are at the core of many computational genome-annotation approaches aimed at identifying coding regions, RNA genes, regulatory regions, and other functional features. Such alignments also underlie many genome-evolution studies. Here we review recent computational advances in the area of multi-sequence alignment, focusing on methods suitable for aligning whole vertebrate genomes. We introduce the key algorithmic ideas in use today, and identify publicly available resources for computing, accessing, and visualizing genomic alignments. Finally, we describe the latest alignment-based approaches to identify and characterize various types of functional sequences. Key areas of research are identified and directions for future improvements are suggested.
大型基因组区域的多序列比对是许多旨在识别编码区域、RNA基因、调控区域和其他功能特征的计算基因组注释方法的核心。此类比对也是许多基因组进化研究的基础。在这里,我们回顾了多序列比对领域的最新计算进展,重点关注适用于比对整个脊椎动物基因组的方法。我们介绍了当今使用的关键算法思想,并确定了用于计算、访问和可视化基因组比对的公共可用资源。最后,我们描述了基于比对的最新方法,以识别和表征各种类型的功能序列。确定了关键研究领域,并提出了未来改进的方向。