Bioinformatics and Genomics Programme. Centre for Genomic Regulation (CRG) and UPF. Doctor Aiguader, 88. 08003 Barcelona, Spain, Universitat Pompeu Fabra (UPF). 08003 Barcelona, Spain, University of Kaiserslautern, Molecular Phylogenetics, Postfach 3049, 67653 Kaiserslautern, Germany and Institució Catalana de Recerca i Estudis Avançats (ICREA), Pg. Lluís Companys 23, 08010 Barcelona, Spain.
Nucleic Acids Res. 2014 Apr;42(7):e54. doi: 10.1093/nar/gku071. Epub 2014 Jan 29.
Reconstructing the evolutionary relationships of species is a major goal in biology. Despite the increasing number of completely sequenced genomes, a large number of phylogenetic projects rely on targeted sequencing and analysis of a relatively small sample of marker genes. The selection of these phylogenetic markers should ideally be based on accurate predictions of their combined, rather than individual, potential to accurately resolve the phylogeny of interest. Here we present and validate a new phylogenomics strategy to efficiently select a minimal set of stable markers able to reconstruct the underlying species phylogeny. In contrast to previous approaches, our methodology does not only rely on the ability of individual genes to reconstruct a known phylogeny, but it also explores the combined power of sets of concatenated genes to accurately infer phylogenetic relationships of species not previously analyzed. We applied our approach to two broad sets of cyanobacterial and ascomycetous fungal species, and provide two minimal sets of six and four genes, respectively, necessary to fully resolve the target phylogenies. This approach paves the way for the informed selection of phylogenetic markers in the effort of reconstructing the tree of life.
重建物种的进化关系是生物学的主要目标之一。尽管越来越多的基因组已经完全测序,但许多系统发育项目仍依赖于对相对较小样本的标记基因进行靶向测序和分析。这些系统发育标记的选择理想情况下应基于对其组合而不是个体潜力的准确预测,以准确解决感兴趣的系统发育。在这里,我们提出并验证了一种新的系统发生基因组学策略,以有效地选择一组最小的稳定标记物,能够重建潜在的物种系统发育。与以前的方法不同,我们的方法不仅依赖于单个基因重建已知系统发育的能力,还探索了串联基因集的组合能力,以准确推断以前未分析过的物种的系统发育关系。我们将我们的方法应用于两个广泛的蓝藻和子囊菌门真菌物种集,并分别提供了六和四个基因的最小集合,这是完全解析目标系统发育所必需的。该方法为重建生命之树过程中系统发育标记的选择提供了依据。