Roje Dawn M
Department of Ichthyology, American Museum of Natural History, New York, New York, United States of America.
PLoS One. 2014 Feb 18;9(2):e87428. doi: 10.1371/journal.pone.0087428. eCollection 2014.
Nucleotide substitution models used in molecular phylogenetics do not account for nucleotide sequences evolving under selection, yet selection is rarely tested for. If non-neutral markers violate these models (i.e. non-independence of sites), it is expected that their reconstructed topologies be incongruent with those inferred from neutral ones and conclusions made from those phylogenies should be reexamined. Using rhodopsin as a phylogenetic marker has recently been called into question for exactly this reason. Rhodopsin is assumed to have evolved under strong positive selection for organisms that inhabit similar aquatic environments, making it unsuitable for the phylogenetics of aquatic organisms, but it is unclear what the effects of non-neutrality on phylogeny estimation are. To evaluate potential incongruence of neutral versus non-neutral markers, and the notion that rhodopsin should not be used in the molecular phylogenetics of fishes, a molecular dataset of 78 acanthomorph taxa and sequences from four nuclear, protein coding loci (including rhodopsin), were examined. Only one marker was found to be neutral while the remaining tests, for all other loci, rejected the null hypothesis of neutrality. To evaluate the possible effect(s) of positively versus negatively selected sites, the three non-neutral markers were analyzed to determine the presence of positively and negatively selected codons. To determine congruence in topology among ML trees inferred by individual neutral and non-neutral markers, as well as the combined (concatenated) dataset, tree, comparisons of distances among trees and hypothesis (topology) testing were carried out. Results of the tree distance metrics and topology testing support the notion that neutrality alone does not determine congruence in topology, and those data that are inferred to have evolved under selection should not necessarily be excluded. In addition, the number of sites inferred to have evolved under positive selection does not predict congruence with other markers or the topology inferred with the concatenated dataset.
分子系统发育学中使用的核苷酸替换模型并未考虑在选择作用下进化的核苷酸序列,然而很少有人对选择进行检验。如果非中性标记违反了这些模型(即位点的非独立性),那么可以预期它们重建的拓扑结构将与从中性标记推断出的拓扑结构不一致,并且基于这些系统发育得出的结论应该重新审视。正是出于这个原因,最近有人对使用视紫红质作为系统发育标记提出了质疑。对于栖息在相似水生环境中的生物,视紫红质被认为是在强烈的正选择作用下进化的,这使得它不适用于水生生物的系统发育研究,但尚不清楚非中性对系统发育估计的影响是什么。为了评估中性与非中性标记之间潜在的不一致性,以及视紫红质不应被用于鱼类分子系统发育研究的观点,我们研究了一个包含78个棘鳍类分类单元的分子数据集以及来自四个核蛋白编码基因座(包括视紫红质)的序列。仅发现一个标记是中性的,而对所有其他基因座的其余测试都拒绝了中性的零假设。为了评估正选择和负选择位点可能产生的影响,我们对这三个非中性标记进行了分析,以确定正选择和负选择密码子的存在。为了确定由单个中性和非中性标记以及合并(串联)数据集推断出的最大似然树之间在拓扑结构上的一致性,我们进行了树之间距离的比较以及假设(拓扑结构)检验。树距离度量和拓扑结构检验的结果支持这样一种观点,即仅中性并不决定拓扑结构的一致性,那些被推断在选择作用下进化的数据不一定应该被排除。此外,推断在正选择作用下进化的位点数量并不能预测与其他标记的一致性或与串联数据集推断出的拓扑结构的一致性。