Departamento de Botânica, Instituto de Biociências, Universidade de São Paulo, São Paulo, 05508-090, SP, Brazil.
Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, NE, USA.
Genome Biol Evol. 2024 Oct 9;16(10). doi: 10.1093/gbe/evae212.
Molecular evolution analysis typically involves identifying selection pressure and reconstructing evolutionary trends. This process usually requires access to specific data related to a target gene or gene family within a particular group of organisms. While recent advancements in high-throughput sequencing techniques have resulted in the rapid accumulation of extensive genomics and transcriptomics data and the creation of new databases in public repositories, extracting valuable insights from such vast data sets remains a significant challenge for researchers. Here, we elucidated the evolutionary history of THI1, a gene responsible for encoding thiamine thiazole synthase. The thiazole ring is a precursor for vitamin B1 and a crucial cofactor in primary metabolic pathways. A thorough search of complete genomes available within public repositories reveals 702 THI1 homologs of Archaea and Eukarya. Throughout its diversification, the plant lineage has preserved the THI1 gene by incorporating the N-terminus and targeting the chloroplasts. Likewise, evolutionary pressures and lifestyle appear to be associated with retention of TPP riboswitch sites and consequent dual posttranscriptional regulation of the de novo biosynthesis pathway in basal groups. Multicopy retention of THI1 is not a typical plant pattern, even after successive genome duplications. Examining cis-regulatory sites in plants uncovers two shared motifs across all plant lineages. A data mining of 484 transcriptome data sets supports the THI1 homolog expression under a light/dark cycle response and a tissue-specific pattern. Finally, the work presented brings a new look at public repositories as an opportunity to explore evolutionary trends to THI1.
分子进化分析通常涉及识别选择压力和重建进化趋势。这个过程通常需要访问特定的目标基因或特定生物群体内的基因家族的相关数据。虽然高通量测序技术的最新进展导致了广泛的基因组学和转录组学数据的快速积累,并在公共存储库中创建了新的数据库,但从如此庞大的数据集中提取有价值的见解仍然是研究人员面临的重大挑战。在这里,我们阐明了 THI1 的进化历史,THI1 基因负责编码硫胺素噻唑合酶。噻唑环是维生素 B1 的前体,也是主要代谢途径中的关键辅酶。在公共存储库中对完整基因组进行全面搜索,揭示了古菌和真核生物中存在 702 个 THI1 同源物。在其多样化过程中,植物谱系通过整合 N 端并靶向叶绿体来保留 THI1 基因。同样,进化压力和生活方式似乎与 TPP 核糖开关位点的保留以及随后对基础群中新合成途径的双重转录后调控有关。THI1 的多拷贝保留不是典型的植物模式,即使在连续的基因组加倍后也是如此。对植物顺式调控位点的研究揭示了所有植物谱系中存在两个共同的基序。对 484 个转录组数据集的挖掘支持了 THI1 同源物在光/暗循环反应和组织特异性模式下的表达。最后,所呈现的工作为探索公共存储库作为探索 THI1 进化趋势的机会提供了新的视角。