Drummond A, Rodrigo A G
School of Biological Sciences, University of Auckland, Auckland, New Zealand.
Mol Biol Evol. 2000 Dec;17(12):1807-15. doi: 10.1093/oxfordjournals.molbev.a026281.
Reconstruction of evolutionary relationships from noncontemporaneous molecular samples provides a new challenge for phylogenetic reconstruction methods. With recent biotechnological advances there has been an increase in molecular sequencing throughput, and the potential to obtain serial samples of sequences from populations, including rapidly evolving pathogens, is fast being realized. A new method called the serial-sample unweighted pair grouping method with arithmetic means (sUPGMA) is presented that reconstructs a genealogy or phylogeny of sequences sampled serially in time using a matrix of pairwise distances. The resulting tree depicts the terminal lineages of each sample ending at a different level consistent with the sample's temporal order. Since sUPGMA is a variant of UPGMA, it will perform best when sequences have evolved at a constant rate (i.e., according to a molecular clock). On simulated data, this new method performs better than standard cluster analysis under a variety of longitudinal sampling strategies. Serial-sample UPGMA is particularly useful for analysis of longitudinal samples of viruses and bacteria, as well as ancient DNA samples, with the minimal requirement that samples of sequences be ordered in time.
从非同期分子样本重建进化关系,给系统发育重建方法带来了新挑战。随着近期生物技术的进步,分子测序通量有所增加,从包括快速进化病原体在内的群体中获取序列连续样本的潜力正迅速得以实现。本文提出了一种名为序列样本算术平均非加权对组法(sUPGMA)的新方法,该方法利用成对距离矩阵重建随时间连续采样的序列的谱系或系统发育。所得树状图描绘了每个样本的终端谱系,这些谱系在与样本时间顺序一致的不同层次结束。由于sUPGMA是UPGMA的一个变体,当序列以恒定速率进化(即根据分子钟进化)时,它的表现最佳。在模拟数据上,这种新方法在各种纵向采样策略下比标准聚类分析表现更好。序列样本UPGMA对于分析病毒和细菌的纵向样本以及古代DNA样本特别有用,对序列样本按时间排序的要求最低。