Suppr超能文献

植物质体基因组注释的进展、挑战与展望

Progress, challenge and prospect of plant plastome annotation.

作者信息

Qu Xiao-Jian, Zou Dan, Zhang Rui-Yu, Stull Gregory W, Yi Ting-Shuang

机构信息

Shandong Provincial Key Laboratory of Plant Stress Research, College of Life Sciences, Shandong Normal University, Ji'nan, Shandong, China.

Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, China.

出版信息

Front Plant Sci. 2023 May 30;14:1166140. doi: 10.3389/fpls.2023.1166140. eCollection 2023.

Abstract

The plastome (plastid genome) represents an indispensable molecular data source for studying phylogeny and evolution in plants. Although the plastome size is much smaller than that of nuclear genome, and multiple plastome annotation tools have been specifically developed, accurate annotation of plastomes is still a challenging task. Different plastome annotation tools apply different principles and workflows, and annotation errors frequently occur in published plastomes and those issued in GenBank. It is therefore timely to compare available annotation tools and establish standards for plastome annotation. In this review, we review the basic characteristics of plastomes, trends in the publication of new plastomes, the annotation principles and application of major plastome annotation tools, and common errors in plastome annotation. We propose possible methods to judge pseudogenes and RNA-editing genes, jointly consider sequence similarity, customed algorithms, conserved domain or protein structure. We also propose the necessity of establishing a database of reference plastomes with standardized annotations, and put forward a set of quantitative standards for evaluating plastome annotation quality for the scientific community. In addition, we discuss how to generate standardized GenBank annotation flatfiles for submission and downstream analysis. Finally, we prospect future technologies for plastome annotation integrating plastome annotation approaches with diverse evidences and algorithms of nuclear genome annotation tools. This review will help researchers more efficiently use available tools to achieve high-quality plastome annotation, and promote the process of standardized annotation of the plastome.

摘要

质体基因组(质体基因组)是研究植物系统发育和进化不可或缺的分子数据源。尽管质体基因组的大小远小于核基因组,并且已经专门开发了多种质体基因组注释工具,但质体基因组的准确注释仍然是一项具有挑战性的任务。不同的质体基因组注释工具应用不同的原理和工作流程,并且在已发表的质体基因组以及GenBank中发布的质体基因组中经常出现注释错误。因此,及时比较现有的注释工具并建立质体基因组注释标准是很有必要的。在这篇综述中,我们回顾了质体基因组的基本特征、新质体基因组的发表趋势、主要质体基因组注释工具的注释原理和应用,以及质体基因组注释中的常见错误。我们提出了判断假基因和RNA编辑基因的可能方法,综合考虑序列相似性、定制算法、保守结构域或蛋白质结构。我们还提出了建立具有标准化注释的参考质体基因组数据库的必要性,并为科学界提出了一套评估质体基因组注释质量的定量标准。此外,我们讨论了如何生成用于提交和下游分析的标准化GenBank注释扁平文件。最后,我们展望了未来质体基因组注释技术,将质体基因组注释方法与核基因组注释工具的多种证据和算法相结合。这篇综述将帮助研究人员更有效地使用现有工具实现高质量的质体基因组注释,并促进质体基因组标准化注释的进程。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af0f/10266425/80c014fea2a6/fpls-14-1166140-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验