Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czech Republic.
Department of Physics, Faculty of Science, University of Ostrava, Ostrava, Czech Republic.
BMC Res Notes. 2023 Jun 20;16(1):109. doi: 10.1186/s13104-023-06386-z.
Chalcone synthase (CHS) catalyzes the initial step of the flavonoid biosynthesis. The CHS encoding gene is well studied in numerous plant species. Rapidly growing sequence databases contain hundreds of CHS entries that are the result of automatic annotation. In this study, we evaluated apparent multiplication of CHS domains in CHS gene models of four plant species.
CHS genes with an apparent triplication of the CHS domain encoding part were discovered through database searches. Such genes were found in Macadamia integrifolia, Musa balbisiana, Musa troglodytarum, and Nymphaea colorata. A manual inspection of the CHS gene models in these four species with massive RNA-seq data suggests that these gene models are the result of artificial fusions in the annotation process. While there are hundreds of seemingly correct CHS records in the databases, it is not clear why these annotation artifacts appeared.
查尔酮合酶(CHS)催化类黄酮生物合成的初始步骤。CHS 编码基因在许多植物物种中得到了很好的研究。快速增长的序列数据库包含数百个 CHS 条目,这些都是自动注释的结果。在这项研究中,我们评估了四个植物物种的 CHS 基因模型中 CHS 结构域的明显倍增。
通过数据库搜索发现了 CHS 结构域编码部分明显三倍的 CHS 基因。在 Macadamia integrifolia、Musa balbisiana、Musa troglodytarum 和 Nymphaea colorata 中发现了这样的基因。对这四个物种的 CHS 基因模型进行大量 RNA-seq 数据的手动检查表明,这些基因模型是注释过程中人工融合的结果。尽管数据库中有数百个看似正确的 CHS 记录,但尚不清楚为什么会出现这些注释伪像。