NU-IN：用于EvolSimulator基因组模拟平台的核苷酸进化与输入模块。

NU-IN: Nucleotide evolution and input module for the EvolSimulator genome simulation platform.

作者信息

Dlugosch Katrina M, Barker Michael S, Rieseberg Loren H

机构信息

Department of Botany, University of British Columbia, Vancouver, BC V6T1Z4, Canada.

出版信息

BMC Res Notes. 2010 Aug 2;3:217. doi: 10.1186/1756-0500-3-217.

DOI:10.1186/1756-0500-3-217

PMID:20678216

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3161368/

Abstract

BACKGROUND

There is increasing demand to test hypotheses that contrast the evolution of genes and gene families among genomes, using simulations that work across these levels of organization. The EvolSimulator program was developed recently to provide a highly flexible platform for forward simulations of amino acid evolution in multiple related lineages of haploid genomes, permitting copy number variation and lateral gene transfer. Synonymous nucleotide evolution is not currently supported, however, and would be highly advantageous for comparisons to full genome, transcriptome, and single nucleotide polymorphism (SNP) datasets. In addition, EvolSimulator creates new genomes for each simulation, and does not allow the input of user-specified sequences and gene family information, limiting the incorporation of further biological realism and/or user manipulations of the data.

FINDINGS

We present modified C++ source code for the EvolSimulator platform, which we provide as the extension module NU-IN. With NU-IN, synonymous and non-synonymous nucleotide evolution is fully implemented, and the user has the ability to use real or previously-simulated sequence data to initiate a simulation of one or more lineages. Gene family membership can be optionally specified, as well as gene retention probabilities that model biased gene retention. We provide PERL scripts to assist the user in deriving this information from previous simulations. We demonstrate the features of NU-IN by simulating genome duplication (polyploidy) in the presence of ongoing copy number variation in an evolving lineage. This example is initiated with real genomic data, and produces output that we analyse directly with existing bioinformatic pipelines.

CONCLUSIONS

The NU-IN extension module is a publicly available open source software (GNU GPLv3 license) extension to EvolSimulator. With the NU-IN module, users are now able to simulate both drift and selection at the nucleotide, amino acid, copy number, and gene family levels across sets of related genomes, for user-specified starting sequences and associated parameters. These features can be used to generate simulated genomic datasets under an extremely broad array of conditions, and with a high degree of biological realism.

摘要

背景

利用跨越这些组织层次的模拟来检验对比基因组间基因和基因家族进化的假设的需求日益增加。EvolSimulator程序最近被开发出来，为单倍体基因组多个相关谱系中氨基酸进化的正向模拟提供了一个高度灵活的平台，允许拷贝数变异和横向基因转移。然而，目前该程序不支持同义核苷酸进化，而这对于与全基因组、转录组和单核苷酸多态性（SNP）数据集进行比较将非常有利。此外，EvolSimulator为每次模拟创建新的基因组，不允许输入用户指定的序列和基因家族信息，限制了进一步纳入生物学真实性和/或用户对数据的操作。

研究结果

我们展示了EvolSimulator平台的修改后的C++源代码，作为扩展模块NU-IN提供。通过NU-IN，完全实现了同义核苷酸和非同义核苷酸进化，用户能够使用真实的或先前模拟的序列数据来启动一个或多个谱系的模拟。可以选择指定基因家族成员身份，以及模拟偏向性基因保留的基因保留概率。我们提供了PERL脚本，以帮助用户从先前的模拟中获取此信息。我们通过在一个进化谱系中存在持续拷贝数变异的情况下模拟基因组加倍（多倍体）来展示NU-IN的功能。这个例子以真实的基因组数据开始，并产生我们直接用现有生物信息学管道分析的输出。

结论

NU-IN扩展模块是EvolSimulator的一个公开可用的开源软件（GNU GPLv3许可）扩展。通过NU-IN模块，用户现在能够针对用户指定的起始序列和相关参数，在相关基因组集合中模拟核苷酸、氨基酸、拷贝数和基因家族水平上的漂变和选择。这些功能可用于在极其广泛的条件下生成具有高度生物学真实性的模拟基因组数据集。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e042/3161368/ff772f9e05f5/1756-0500-3-217-1.jpg

相似文献

NU-IN: Nucleotide evolution and input module for the EvolSimulator genome simulation platform.

BMC Res Notes. 2010 Aug 2;3:217. doi: 10.1186/1756-0500-3-217.

A simulation test bed for hypotheses of genome evolution.

Bioinformatics. 2007 Apr 1;23(7):825-31. doi: 10.1093/bioinformatics/btm024. Epub 2007 Jan 31.

Pyvolve: A Flexible Python Module for Simulating Sequences along Phylogenies.

PLoS One. 2015 Sep 23;10(9):e0139047. doi: 10.1371/journal.pone.0139047. eCollection 2015.

SodaPop: a forward simulation suite for the evolutionary dynamics of asexual populations on protein fitness landscapes.

Bioinformatics. 2019 Oct 15;35(20):4053-4062. doi: 10.1093/bioinformatics/btz175.

HexSE: Simulating evolution in overlapping reading frames.

Virus Evol. 2023 Feb 23;9(1):vead009. doi: 10.1093/ve/vead009. eCollection 2023.

The C- and G-value paradox with polyploidy, repeatomes, introns, phenomes and cell economy.

Genes Genomics. 2020 Jul;42(7):699-714. doi: 10.1007/s13258-020-00941-9. Epub 2020 May 22.

simuG: a general-purpose genome simulator.

Bioinformatics. 2019 Nov 1;35(21):4442-4444. doi: 10.1093/bioinformatics/btz424.

SimPhy: Phylogenomic Simulation of Gene, Locus, and Species Trees.

Syst Biol. 2016 Mar;65(2):334-44. doi: 10.1093/sysbio/syv082. Epub 2015 Nov 1.

SNPGenie: estimating evolutionary parameters to detect natural selection using pooled next-generation sequencing data.

Bioinformatics. 2015 Nov 15;31(22):3709-11. doi: 10.1093/bioinformatics/btv449. Epub 2015 Jul 29.

引用本文的文献

EvoPipes.net: Bioinformatic Tools for Ecological and Evolutionary Genomics.

Evol Bioinform Online. 2010 Oct 20;6:143-9. doi: 10.4137/EBO.S5861.

本文引用的文献

Paleopolyploidy in the Brassicales: analyses of the Cleome transcriptome elucidate the history of genome duplications in Arabidopsis and other Brassicales.

Genome Biol Evol. 2009 Oct 5;1:391-9. doi: 10.1093/gbe/evp040.

Origins and functional impact of copy number variation in the human genome.

Nature. 2010 Apr 1;464(7289):704-12. doi: 10.1038/nature08516. Epub 2009 Oct 7.

Bias in plant gene content following different sorts of duplication: tandem, whole-genome, segmental, or by transposition.

Annu Rev Plant Biol. 2009;60:433-53. doi: 10.1146/annurev.arplant.043008.092122.

The impact of reticulate evolution on genome phylogeny.

Syst Biol. 2008 Dec;57(6):844-56. doi: 10.1080/10635150802559265.

Multiple paleopolyploidizations during the evolution of the Compositae reveal parallel patterns of duplicate gene retention after millions of years.

Mol Biol Evol. 2008 Nov;25(11):2445-55. doi: 10.1093/molbev/msn187. Epub 2008 Aug 26.

Positive selection and expression divergence following gene duplication in the sunflower CYCLOIDEA gene family.

Mol Biol Evol. 2008 Jul;25(7):1260-73. doi: 10.1093/molbev/msn001. Epub 2008 Apr 3.

Accelerated rate of gene gain and loss in primates.

Genetics. 2007 Nov;177(3):1941-9. doi: 10.1534/genetics.107.080077. Epub 2007 Oct 18.

Autoimmune response as a mechanism for a Dobzhansky-Muller-type incompatibility syndrome in plants.

PLoS Biol. 2007 Sep;5(9):e236. doi: 10.1371/journal.pbio.0050236.

A simulation test bed for hypotheses of genome evolution.

Bioinformatics. 2007 Apr 1;23(7):825-31. doi: 10.1093/bioinformatics/btm024. Epub 2007 Jan 31.

Widespread paleopolyploidy in model plant species inferred from age distributions of duplicate genes.

Plant Cell. 2004 Jul;16(7):1667-78. doi: 10.1105/tpc.021345. Epub 2004 Jun 18.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

NU-IN：用于EvolSimulator基因组模拟平台的核苷酸进化与输入模块。

NU-IN: Nucleotide evolution and input module for the EvolSimulator genome simulation platform.

作者信息

Dlugosch Katrina M, Barker Michael S, Rieseberg Loren H

机构信息

Department of Botany, University of British Columbia, Vancouver, BC V6T1Z4, Canada.

出版信息

BMC Res Notes. 2010 Aug 2;3:217. doi: 10.1186/1756-0500-3-217.

DOI:10.1186/1756-0500-3-217

PMID:20678216

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3161368/

Abstract

BACKGROUND

FINDINGS

CONCLUSIONS

摘要

NU-IN：用于EvolSimulator基因组模拟平台的核苷酸进化与输入模块。

NU-IN: Nucleotide evolution and input module for the EvolSimulator genome simulation platform.

作者信息

机构信息

出版信息

BACKGROUND

FINDINGS

CONCLUSIONS

背景

研究结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

NU-IN：用于EvolSimulator基因组模拟平台的核苷酸进化与输入模块。

NU-IN: Nucleotide evolution and input module for the EvolSimulator genome simulation platform.

作者信息

机构信息

出版信息

BACKGROUND

FINDINGS

CONCLUSIONS

背景

研究结果

结论

相似文献

引用本文的文献

本文引用的文献