Department of Computer Science, Université de Sherbrooke, 2500 Bd de l'université, Sherbrooke, QC, J1K2R1, Canada.
BMC Bioinformatics. 2024 Jul 11;25(1):235. doi: 10.1186/s12859-024-05853-z.
SimSpliceEvol is a tool for simulating the evolution of eukaryotic gene sequences that integrates exon-intron structure evolution as well as the evolution of the sets of transcripts produced from genes. It takes a guide gene tree as input and generates a gene sequence with its transcripts for each node of the tree, from the root to the leaves. However, the sets of transcripts simulated at different nodes of the guide gene tree lack evolutionary connections. Consequently, SimSpliceEvol is not suitable for evaluating methods for transcript phylogeny inference or gene phylogeny inference that rely on transcript conservation.
Here, we introduce SimSpliceEvol2, which, compared to the first version, incorporates an explicit model of transcript evolution for simulating alternative transcripts along the branches of a guide gene tree, as well as the transcript phylogenies inferred. We offer a comprehensive software with a graphical user interface and an updated version of the web server, ensuring easy and user-friendly access to the tool.
SimSpliceEvol2 generates synthetic datasets that are useful for evaluating methods and tools for spliced RNA sequence analysis, such as spliced alignment methods, methods for identifying conserved transcripts, and transcript phylogeny reconstruction methods. The web server is accessible at https://simspliceevol.cobius.usherbrooke.ca , where you can also download the standalone software. Comprehensive documentation for the software is available at the same address. For developers interested in the source code, which requires the installation of all prerequisites to run, it is provided at https://github.com/UdeS-CoBIUS/SimSpliceEvol .
SimSpliceEvol 是一种用于模拟真核基因序列进化的工具,它集成了外显子-内含子结构进化以及从基因产生的转录本集的进化。它以指导基因树作为输入,并为树的每个节点(从根到叶)生成带有其转录本的基因序列。然而,在指导基因树的不同节点模拟的转录本集缺乏进化联系。因此,SimSpliceEvol 不适合评估依赖转录本保守性的转录物系统发育推断或基因系统发育推断方法。
在这里,我们介绍了 SimSpliceEvol2,与第一版相比,它包含了一种显式的转录本进化模型,用于模拟指导基因树分支上的替代转录本,以及推断的转录本系统发育。我们提供了一个带有图形用户界面的综合软件和更新的网络服务器版本,确保了工具的轻松和用户友好的访问。
SimSpliceEvol2 生成了有用的合成数据集,可用于评估拼接 RNA 序列分析的方法和工具,如拼接对齐方法、识别保守转录本的方法以及转录本系统发育重建方法。网络服务器可在 https://simspliceevol.cobius.usherbrooke.ca 访问,您也可以在那里下载独立软件。软件的综合文档可在同一地址获得。对于有兴趣使用需要安装所有先决条件才能运行的源代码的开发人员,它可在 https://github.com/UdeS-CoBIUS/SimSpliceEvol 获得。