Department of Ecology and Evolutionary Biology, Brown University, Providence, RI 02912;
Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA 90095.
Proc Natl Acad Sci U S A. 2018 Jan 16;115(3):E409-E417. doi: 10.1073/pnas.1707515115. Epub 2018 Jan 4.
There is considerable interest in comparing functional genomic data across species. One goal of such work is to provide an integrated understanding of genome and phenotype evolution. Most comparative functional genomic studies have relied on multiple pairwise comparisons between species, an approach that does not incorporate information about the evolutionary relationships among species. The statistical problems that arise from not considering these relationships can lead pairwise approaches to the wrong conclusions and are a missed opportunity to learn about biology that can only be understood in an explicit phylogenetic context. Here, we examine two recently published studies that compare gene expression across species with pairwise methods, and find reason to question the original conclusions of both. One study interpreted pairwise comparisons of gene expression as support for the ortholog conjecture, the hypothesis that orthologs tend to have more similar attributes (expression in this case) than paralogs. The other study interpreted pairwise comparisons of embryonic gene expression across distantly related animals as evidence for a distinct evolutionary process that gave rise to phyla. In each study, distinct patterns of pairwise similarity among species were originally interpreted as evidence of particular evolutionary processes, but instead, we find that they reflect species relationships. These reanalyses concretely show the inadequacy of pairwise comparisons for analyzing functional genomic data across species. It will be critical to adopt phylogenetic comparative methods in future functional genomic work. Fortunately, phylogenetic comparative biology is also a rapidly advancing field with many methods that can be directly applied to functional genomic data.
人们对跨物种比较功能基因组数据非常感兴趣。此类工作的目标之一是提供对基因组和表型进化的综合理解。大多数比较功能基因组学研究都依赖于物种之间的多次两两比较,这种方法没有纳入物种进化关系的信息。不考虑这些关系会产生统计问题,导致两两方法得出错误的结论,并且错失了以明确的系统发育背景了解生物学的机会。在这里,我们研究了两项最近发表的使用两两方法比较跨物种基因表达的研究,并发现有理由质疑这两项研究的原始结论。一项研究将基因表达的两两比较解释为支持直系同源假说的证据,该假说假设直系同源物往往具有比旁系同源物更相似的属性(在这种情况下是表达)。另一项研究将远缘动物胚胎基因表达的两两比较解释为一个独特的进化过程的证据,该过程导致了门的形成。在每项研究中,物种之间最初被解释为特定进化过程证据的两两相似性模式,但实际上,我们发现它们反映了物种关系。这些重新分析具体地表明,两两比较不适合分析跨物种的功能基因组数据。在未来的功能基因组学工作中采用系统发育比较方法至关重要。幸运的是,系统发育比较生物学也是一个快速发展的领域,有许多方法可以直接应用于功能基因组数据。