Centre for Genomic Regulation, Dr. Aiguader 88, 08003 Barcelona, Spain.
BMC Bioinformatics. 2012 Aug 28;13:216. doi: 10.1186/1471-2105-13-216.
A number of software packages are available to generate DNA multiple sequence alignments (MSAs) evolved under continuous-time Markov processes on phylogenetic trees. On the other hand, methods of simulating the DNA MSA directly from the transition matrices do not exist. Moreover, existing software restricts to the time-reversible models and it is not optimized to generate nonhomogeneous data (i.e. placing distinct substitution rates at different lineages).
We present the first package designed to generate MSAs evolving under discrete-time Markov processes on phylogenetic trees, directly from probability substitution matrices. Based on the input model and a phylogenetic tree in the Newick format (with branch lengths measured as the expected number of substitutions per site), the algorithm produces DNA alignments of desired length. GenNon-h is publicly available for download.
The software presented here is an efficient tool to generate DNA MSAs on a given phylogenetic tree. GenNon-h provides the user with the nonstationary or nonhomogeneous phylogenetic data that is well suited for testing complex biological hypotheses, exploring the limits of the reconstruction algorithms and their robustness to such models.
有许多软件包可用于在系统发育树上的连续时间马尔可夫过程下生成 DNA 多序列比对 (MSA)。另一方面,不存在直接从转移矩阵模拟 DNA MSA 的方法。此外,现有的软件仅限于时间可逆模型,并且没有针对生成非均匀数据(即在不同谱系上放置不同的替代率)进行优化。
我们提出了第一个设计用于直接从概率替代矩阵在系统发育树上的离散时间马尔可夫过程下生成 MSA 的软件包。基于输入模型和以 Newick 格式表示的系统发育树(分支长度表示为每个位点的预期替换数),该算法生成所需长度的 DNA 比对。GenNon-h 可公开下载。
这里介绍的软件是在给定系统发育树上生成 DNA MSA 的有效工具。GenNon-h 为用户提供了适合测试复杂生物学假设、探索重建算法的极限及其对这些模型的稳健性的非平稳或非均匀系统发育数据。