Plastaumatic：叶绿体基因组组装与注释自动化

Plastaumatic: Automating plastome assembly and annotation.

作者信息

Chen Wenyi, Achakkagari Sai Reddy, Strömvik Martina

机构信息

Department of Plant Science, McGill University, Sainte-Anne-de-Bellevue, QC, Canada.

出版信息

Front Plant Sci. 2022 Nov 3;13:1011948. doi: 10.3389/fpls.2022.1011948. eCollection 2022.

DOI:10.3389/fpls.2022.1011948

PMID:36407635

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9669643/

Abstract

Plastome sequence data is most often extracted from plant whole genome sequencing data and need to be assembled and annotated separately from the nuclear genome sequence. In projects comprising multiple genomes, it is labour intense to individually process the plastomes as it requires many steps and software. This study developed - an automated pipeline for both assembly and annotation of plastomes, with the scope of the researcher being able to load whole genome sequence data with minimal manual input, and therefore a faster runtime. The main structure of the current automated pipeline includes trimming of adaptor and low-quality sequences using , plastome assembly using , standardization and quality checking of the assembled genomes through a custom script utilizing and , annotation of the assembled genomes using , and finally generating the required files for NCBI GenBank submissions. The pipeline is demonstrated with 12 potato accessions and three soybean accessions.

摘要

质体基因组序列数据通常是从植物全基因组测序数据中提取的，需要与核基因组序列分开进行组装和注释。在包含多个基因组的项目中，单独处理质体基因组需要耗费大量人力，因为这需要许多步骤和软件。本研究开发了一种用于质体基因组组装和注释的自动化流程，研究人员只需进行最少的手动输入就能加载全基因组序列数据，从而实现更快的运行时间。当前自动化流程的主要结构包括使用修剪接头和低质量序列，使用进行质体基因组组装，通过一个利用和的自定义脚本对组装好的基因组进行标准化和质量检查，使用对组装好的基因组进行注释，最后生成提交给NCBI GenBank所需的文件。该流程在12个马铃薯种质和3个大豆种质上进行了演示。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/860f/9669643/896d30e3210e/fpls-13-1011948-g001.jpg

相似文献

Plastaumatic: Automating plastome assembly and annotation.Plastaumatic：叶绿体基因组组装与注释自动化

Front Plant Sci. 2022 Nov 3;13:1011948. doi: 10.3389/fpls.2022.1011948. eCollection 2022.

PGA: a software package for rapid, accurate, and flexible batch annotation of plastomes.PGA：一个用于叶绿体基因组快速、准确且灵活批量注释的软件包。

Plant Methods. 2019 May 21;15:50. doi: 10.1186/s13007-019-0435-7. eCollection 2019.

Progress, challenge and prospect of plant plastome annotation.植物质体基因组注释的进展、挑战与展望

Front Plant Sci. 2023 May 30;14:1166140. doi: 10.3389/fpls.2023.1166140. eCollection 2023.

ReFernment: An R package for annotating RNA editing in plastid genomes.ReFernment：一个用于注释质体基因组中RNA编辑的R软件包。

Appl Plant Sci. 2019 Jan 30;7(2):e01216. doi: 10.1002/aps3.1216. eCollection 2019 Feb.

ECuADOR-Easy Curation of Angiosperm Duplicated Organellar Regions, a tool for cleaning and curating plastomes assembled from next generation sequencing pipelines.厄瓜多尔-被子植物重复细胞器区域的简易管理，一种用于清理和管理从下一代测序流程组装的质体基因组的工具。

PeerJ. 2020 Apr 7;8:e8699. doi: 10.7717/peerj.8699. eCollection 2020.

Chloroplast Genomes of Two Species of : Expanded Genome Size and Proliferation of AT-Biased Repeat Sequences.两种[物种名称未给出]的叶绿体基因组：基因组大小扩展与AT偏向重复序列的增殖

Front Plant Sci. 2021 Feb 9;12:609729. doi: 10.3389/fpls.2021.609729. eCollection 2021.

Comprehensive genomic analyses with 115 plastomes from algae to seed plants: structure, gene contents, GC contents, and introns.综合基因组分析 115 个藻类到种子植物的质体基因组：结构、基因组成、GC 含量和内含子。

Genes Genomics. 2020 May;42(5):553-570. doi: 10.1007/s13258-020-00923-x. Epub 2020 Mar 21.

MEGAnnotator: a user-friendly pipeline for microbial genomes assembly and annotation.MEGAnnotator：一个用于微生物基因组组装和注释的用户友好型流程。

FEMS Microbiol Lett. 2016 Apr;363(7). doi: 10.1093/femsle/fnw049. Epub 2016 Mar 1.

The complete plastome sequences of nine diploid potato clones.九个二倍体马铃薯克隆的完整质体基因组序列。

Mitochondrial DNA B Resour. 2021 Mar 11;6(3):811-813. doi: 10.1080/23802359.2021.1883486.

NOVOPlasty: de novo assembly of organelle genomes from whole genome data.NOVOPlasty：从头组装细胞器基因组的全基因组数据。

Nucleic Acids Res. 2017 Feb 28;45(4):e18. doi: 10.1093/nar/gkw955.

引用本文的文献

PlastidHub: An integrated analysis platform for plastid phylogenomics and comparative genomics.质体中心：一个用于质体系统发育基因组学和比较基因组学的综合分析平台。

Plant Divers. 2025 May 22;47(4):544-560. doi: 10.1016/j.pld.2025.05.005. eCollection 2025 Jul.

Whole transcriptome sequencing of testis and epididymis reveals genes associated with sperm development in roosters.对鸡睾丸和附睪的转录组测序揭示了与精子发生相关的基因。

BMC Genomics. 2024 Nov 4;25(1):1029. doi: 10.1186/s12864-024-10836-8.

A Snakemake Toolkit for the Batch Assembly, Annotation and Phylogenetic Analysis of Mitochondrial Genomes and Ribosomal Genes From Genome Skims of Museum Collections.一种用于对博物馆馆藏基因组草图中的线粒体基因组和核糖体基因进行批量组装、注释及系统发育分析的Snakemake工具包。

Mol Ecol Resour. 2025 Jan;25(1):e14036. doi: 10.1111/1755-0998.14036. Epub 2024 Oct 28.

The phased Solanum okadae genome and Petota pangenome analysis of 23 other potato wild relatives and hybrids.分期的 Solanum okadae 基因组和 23 种其他马铃薯野生近缘种和杂种的 Petota 泛基因组分析。

Sci Data. 2024 May 4;11(1):454. doi: 10.1038/s41597-024-03300-5.

The complete chloroplast genome sequence and phylogenetic relationship analysis of Eomecon chionantha, one species unique to China.中国特有的独蒜兰属植物全叶绿体基因组序列及系统发育关系分析。

J Plant Res. 2024 Jul;137(4):575-587. doi: 10.1007/s10265-024-01539-y. Epub 2024 Apr 23.

Progress, challenge and prospect of plant plastome annotation.植物质体基因组注释的进展、挑战与展望

Front Plant Sci. 2023 May 30;14:1166140. doi: 10.3389/fpls.2023.1166140. eCollection 2023.

本文引用的文献

Complete chloroplast genome data for (Araceae) from Peninsular Malaysia.来自马来西亚半岛天南星科植物的完整叶绿体基因组数据。

Data Brief. 2022 Mar 23;42:108075. doi: 10.1016/j.dib.2022.108075. eCollection 2022 Jun.

The plastome of Arctic (Fabaceae) is significantly different from that of and other related species.北极豆科植物的质体基因组与[此处原文缺失相关物种名称]及其他相关物种的质体基因组有显著差异。

Genome. 2022 May;65(5):301-313. doi: 10.1139/gen-2021-0059. Epub 2022 Mar 4.

Phased, chromosome-scale genome assemblies of tetraploid potato reveal a complex genome, transcriptome, and predicted proteome landscape underpinning genetic diversity.四倍体马铃薯分步染色体规模基因组组装揭示了遗传多样性的复杂基因组、转录组和预测蛋白质组景观。

Mol Plant. 2022 Mar 7;15(3):520-536. doi: 10.1016/j.molp.2022.01.003. Epub 2022 Jan 10.

Sustainable data analysis with Snakemake.使用 Snakemake 进行可持续数据分析。

F1000Res. 2021 Jan 18;10:33. doi: 10.12688/f1000research.29032.2. eCollection 2021.

NOVOWrap: An automated solution for plastid genome assembly and structure standardization.NOVOWrap：一种用于质体基因组组装和结构标准化的自动化解决方案。

Mol Ecol Resour. 2021 Aug;21(6):2177-2186. doi: 10.1111/1755-0998.13410. Epub 2021 May 25.

The complete plastome sequences of nine diploid potato clones.九个二倍体马铃薯克隆的完整质体基因组序列。

Mitochondrial DNA B Resour. 2021 Mar 11;6(3):811-813. doi: 10.1080/23802359.2021.1883486.

Twelve years of SAMtools and BCFtools.SAMtools 和 BCFtools 十二年。

Gigascience. 2021 Feb 16;10(2). doi: 10.1093/gigascience/giab008.

Characterization of the complete plastome of (Cyperaceae).莎草科（Cyperaceae）某植物完整质体基因组的特征分析

Mitochondrial DNA B Resour. 2021 Jan 13;6(1):58-59. doi: 10.1080/23802359.2020.1845999.

Complete plastome assemblies from a panel of 13 diverse potato taxa.从 13 种不同的马铃薯类群中获得了完整的质体基因组组装。

PLoS One. 2020 Oct 8;15(10):e0240124. doi: 10.1371/journal.pone.0240124. eCollection 2020.

Complete Chloroplast Genome Sequence of a Black Spruce (Picea mariana) from Eastern Canada.来自加拿大东部的黑云杉（Picea mariana）叶绿体全基因组序列

Microbiol Resour Announc. 2020 Sep 24;9(39):e00877-20. doi: 10.1128/MRA.00877-20.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

Plastaumatic：叶绿体基因组组装与注释自动化

Plastaumatic: Automating plastome assembly and annotation.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献