Stone Eric A, Cooper Gregory M, Sidow Arend
Department of Statistics, Stanford University, Stanford, California 94305, USA.
Annu Rev Genomics Hum Genet. 2005;6:143-64. doi: 10.1146/annurev.genom.6.080604.162146.
As whole-genome sequencing efforts extend beyond more traditional model organisms to include a deep diversity of species, comparative genomic analyses will be further empowered to reveal insights into the human genome and its evolution. The discovery and annotation of functional genomic elements is a necessary step toward a detailed understanding of our biology, and sequence comparisons have proven to be an integral tool for that task. This review is structured to broadly reflect the statistical challenges in discriminating these functional elements from the bulk of the genome that has evolved neutrally. Specifically, we review the comparative genomics literature in terms of specificity, sensitivity, and phylogenetic scope, as well as the trade-offs that relate these factors in standard analyses. We consider the impact of an expanding diversity of orthologous sequences on our ability to resolve functional elements. This impact is assessed through both recent comparative analyses of deep alignments and mathematical modeling.
随着全基因组测序工作从更传统的模式生物扩展到涵盖种类繁多的物种,比较基因组分析将更有能力揭示人类基因组及其进化的相关见解。功能基因组元件的发现和注释是详细了解我们生物学特性的必要步骤,而序列比较已被证明是完成这项任务不可或缺的工具。本综述的结构旨在广泛反映在从已中性进化的基因组主体中区分这些功能元件时所面临的统计挑战。具体而言,我们从特异性、敏感性和系统发育范围,以及在标准分析中关联这些因素的权衡等方面来综述比较基因组学文献。我们考虑直系同源序列多样性的增加对我们解析功能元件能力的影响。这种影响通过近期对深度比对的比较分析和数学建模来评估。