• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

TIPPo:一个用于利用高保真数据从头组装细胞器基因组的用户友好型工具。

TIPPo: A User-Friendly Tool for De Novo Assembly of Organellar Genomes with High-Fidelity Data.

作者信息

Xian Wenfei, Bezrukov Ilja, Bao Zhigui, Vorbrugg Sebastian, Gautam Anupam, Weigel Detlef

机构信息

Department of Molecular Biology, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany.

Algorithms in Bioinformatics, Institute for Bioinformatics and Medical Informatics, University of Tübingen, 72076 Tübingen, Germany.

出版信息

Mol Biol Evol. 2025 Jan 6;42(1). doi: 10.1093/molbev/msae247.

DOI:10.1093/molbev/msae247
PMID:39800935
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11725521/
Abstract

Plant cells have two major organelles with their own genomes: chloroplasts and mitochondria. While chloroplast genomes tend to be structurally conserved, the mitochondrial genomes of plants, which are much larger than those of animals, are characterized by complex structural variation. We introduce TIPPo, a user-friendly, reference-free assembly tool that uses PacBio high-fidelity long-read data and that does not rely on genomes from related species or nuclear genome information for the assembly of organellar genomes. TIPPo employs a deep learning model for initial read classification and leverages k-mer counting for further refinement, significantly reducing the impact of nuclear insertions of organellar DNA on the assembly process. We used TIPPo to completely assemble a set of 54 complete chloroplast genomes. No other tool was able to completely assemble this set. TIPPo is comparable with PMAT in assembling mitochondrial genomes from most species but does achieve even higher completeness for several species. We also used the assembled organelle genomes to identify instances of nuclear plastid DNA (NUPTs) and nuclear mitochondrial DNA (NUMTs) insertions. The cumulative length of NUPTs/NUMTs positively correlates with the size of the nuclear genome, suggesting that insertions occur stochastically. NUPTs/NUMTs show predominantly C:G to T:A changes, with the mutated cytosines typically found in CG and CHG contexts, suggesting that degradation of NUPT and NUMT sequences is driven by the known elevated mutation rate of methylated cytosines. Small interfering RNA loci are enriched in NUPTs and NUMTs, consistent with the RdDM pathway mediating DNA methylation in these sequences.

摘要

植物细胞有两种拥有自身基因组的主要细胞器

叶绿体和线粒体。虽然叶绿体基因组在结构上往往较为保守,但植物的线粒体基因组比动物的线粒体基因组大得多,其特点是结构复杂多变。我们引入了TIPPo,这是一种用户友好、无需参考序列的组装工具,它使用PacBio高保真长读长数据,并且在组装细胞器基因组时不依赖相关物种的基因组或核基因组信息。TIPPo采用深度学习模型进行初始读段分类,并利用k-mer计数进行进一步优化,显著降低了细胞器DNA的核插入对组装过程的影响。我们使用TIPPo完全组装了一组54个完整的叶绿体基因组。没有其他工具能够完全组装这一组基因组。在从大多数物种组装线粒体基因组方面,TIPPo与PMAT相当,但对于几个物种,TIPPo确实实现了更高的完整性。我们还使用组装好的细胞器基因组来识别核质体DNA(NUPTs)和核线粒体DNA(NUMTs)插入的实例。NUPTs/NUMTs的累积长度与核基因组大小呈正相关,这表明插入是随机发生的。NUPTs/NUMTs主要显示C:G到T:A的变化,突变的胞嘧啶通常出现在CG和CHG环境中,这表明NUPT和NUMT序列的降解是由已知的甲基化胞嘧啶升高的突变率驱动的。小干扰RNA位点在NUPTs和NUMTs中富集,这与RNA介导的DNA甲基化途径在这些序列中起作用一致。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fd4/11725521/58e2fbd0024c/msae247f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fd4/11725521/68696ed63034/msae247f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fd4/11725521/85ca95c10dc6/msae247f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fd4/11725521/f554b5e0b9f0/msae247f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fd4/11725521/3cadc8263273/msae247f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fd4/11725521/c3d604e45c9e/msae247f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fd4/11725521/4a099449c641/msae247f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fd4/11725521/58e2fbd0024c/msae247f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fd4/11725521/68696ed63034/msae247f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fd4/11725521/85ca95c10dc6/msae247f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fd4/11725521/f554b5e0b9f0/msae247f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fd4/11725521/3cadc8263273/msae247f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fd4/11725521/c3d604e45c9e/msae247f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fd4/11725521/4a099449c641/msae247f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fd4/11725521/58e2fbd0024c/msae247f7.jpg

相似文献

1
TIPPo: A User-Friendly Tool for De Novo Assembly of Organellar Genomes with High-Fidelity Data.TIPPo:一个用于利用高保真数据从头组装细胞器基因组的用户友好型工具。
Mol Biol Evol. 2025 Jan 6;42(1). doi: 10.1093/molbev/msae247.
2
Intergenomic gene transfer in diploid and allopolyploid Gossypium.二倍体和异源多倍体棉属中的种间基因转移。
BMC Plant Biol. 2019 Nov 12;19(1):492. doi: 10.1186/s12870-019-2041-2.
3
Recent and frequent insertions of chloroplast DNA into maize nuclear chromosomes.叶绿体DNA近期频繁插入玉米核染色体。
Cytogenet Genome Res. 2010 Jul;129(1-3):17-23. doi: 10.1159/000312724. Epub 2010 Jul 13.
4
Quantifying the Number of Independent Organelle DNA Insertions in Genome Evolution and Human Health.量化基因组进化和人类健康中独立细胞器DNA插入的数量
Genome Biol Evol. 2017 May 1;9(5):1190-1203. doi: 10.1093/gbe/evx078.
5
Analysis of nuclear mitochondrial DNAs and factors affecting patterns of integration in plant species.植物物种中核线粒体DNA分析及影响整合模式的因素
Genes Genet Syst. 2017 Sep 12;92(1):27-33. doi: 10.1266/ggs.16-00039. Epub 2017 Feb 22.
6
Organelle_PBA, a pipeline for assembling chloroplast and mitochondrial genomes from PacBio DNA sequencing data.细胞器_PBA,一种用于从PacBio DNA测序数据组装叶绿体和线粒体基因组的流程。
BMC Genomics. 2017 Jan 7;18(1):49. doi: 10.1186/s12864-016-3412-9.
7
Nuclear Integrants of Organellar DNA Contribute to Genome Structure and Evolution in Plants.细胞器 DNA 的核整合体影响植物的基因组结构和进化。
Int J Mol Sci. 2020 Jan 21;21(3):707. doi: 10.3390/ijms21030707.
8
Analysis of plastid and mitochondrial DNA insertions in the nucleus (NUPTs and NUMTs) of six plant species: size, relative age and chromosomal localization.分析 6 种植物核基因组中的质体和线粒体 DNA 插入序列(NUPTs 和 NUMTs):大小、相对年龄和染色体定位。
Heredity (Edinb). 2013 Oct;111(4):314-20. doi: 10.1038/hdy.2013.51. Epub 2013 May 29.
9
Mosaic mitochondrial-plastid insertions into the nuclear genome show evidence of both non-homologous end joining and homologous recombination.线粒体-质体嵌合体插入核基因组显示出非同源末端连接和同源重组的证据。
BMC Evol Biol. 2018 Nov 3;18(1):162. doi: 10.1186/s12862-018-1279-x.
10
MitoHiFi: a python pipeline for mitochondrial genome assembly from PacBio high fidelity reads.MitoHiFi:一个从 PacBio 高保真reads 组装线粒体基因组的 Python 分析流程
BMC Bioinformatics. 2023 Jul 18;24(1):288. doi: 10.1186/s12859-023-05385-y.

引用本文的文献

1
PMAT2: An efficient graphical assembly toolkit for comprehensive organellar genomes.PMAT2:用于完整细胞器基因组的高效图形化组装工具包。
Imeta. 2025 Jul 1;4(4):e70064. doi: 10.1002/imt2.70064. eCollection 2025 Aug.
2
The evolutionary dynamics of organellar pan-genomes in Arabidopsis thaliana.拟南芥细胞器泛基因组的进化动力学
Genome Biol. 2025 Aug 11;26(1):240. doi: 10.1186/s13059-025-03717-0.
3
Oatk: a de novo assembly tool for complex plant organelle genomes.Oatk:一种用于复杂植物细胞器基因组的从头组装工具。

本文引用的文献

1
PMAT: an efficient plant mitogenome assembly toolkit using low-coverage HiFi sequencing data.PMAT:一种利用低覆盖度HiFi测序数据的高效植物线粒体基因组组装工具包。
Hortic Res. 2024 Jan 23;11(3):uhae023. doi: 10.1093/hr/uhae023. eCollection 2024 Mar.
2
CLAW: An automated Snakemake workflow for the assembly of chloroplast genomes from long-read data.CLAW:一个用于从长读数据组装叶绿体基因组的自动化 Snakemake 工作流程。
PLoS Comput Biol. 2024 Feb 9;20(2):e1011870. doi: 10.1371/journal.pcbi.1011870. eCollection 2024 Feb.
3
Methods for assembling complex mitochondrial genomes in land plants.
Genome Biol. 2025 Aug 7;26(1):235. doi: 10.1186/s13059-025-03676-6.
4
PlastidHub: An integrated analysis platform for plastid phylogenomics and comparative genomics.质体中心:一个用于质体系统发育基因组学和比较基因组学的综合分析平台。
Plant Divers. 2025 May 22;47(4):544-560. doi: 10.1016/j.pld.2025.05.005. eCollection 2025 Jul.
5
Minimizing detection bias of somatic mutations in a highly heterozygous oak genome.最小化高度杂合的橡树基因组中体细胞突变的检测偏差。
G3 (Bethesda). 2025 Aug 6;15(8). doi: 10.1093/g3journal/jkaf143.
组装陆地植物复杂线粒体基因组的方法。
J Exp Bot. 2024 Sep 11;75(17):5169-5174. doi: 10.1093/jxb/erae034.
4
Plant organellar genomes: much done, much more to do.植物细胞器基因组:完成了很多,还有更多要做。
Trends Plant Sci. 2024 Jul;29(7):754-769. doi: 10.1016/j.tplants.2023.12.014. Epub 2024 Jan 13.
5
A complete assembly of the rice Nipponbare reference genome.日本晴水稻参考基因组的完整组装。
Mol Plant. 2023 Aug 7;16(8):1232-1236. doi: 10.1016/j.molp.2023.08.003.
6
MitoHiFi: a python pipeline for mitochondrial genome assembly from PacBio high fidelity reads.MitoHiFi:一个从 PacBio 高保真reads 组装线粒体基因组的 Python 分析流程
BMC Bioinformatics. 2023 Jul 18;24(1):288. doi: 10.1186/s12859-023-05385-y.
7
Cycles of satellite and transposon evolution in Arabidopsis centromeres.拟南芥着丝粒卫星和转座子的演化循环。
Nature. 2023 Jun;618(7965):557-565. doi: 10.1038/s41586-023-06062-z. Epub 2023 May 17.
8
Plastid Genome Assembly Using Long-read data.利用长读长数据进行质体基因组组装。
Mol Ecol Resour. 2023 Aug;23(6):1442-1457. doi: 10.1111/1755-0998.13787. Epub 2023 Apr 2.
9
Master graph: an essential integrated assembly model for the plant mitogenome based on a graph-based framework.主图:基于图框架的植物线粒体基因组的基本整合组装模型。
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac522.
10
Pushing the limits of HiFi assemblies reveals centromere diversity between two Arabidopsis thaliana genomes.推高 HiFi 组装的极限揭示了两个拟南芥基因组之间着丝粒的多样性。
Nucleic Acids Res. 2022 Nov 28;50(21):12309-12327. doi: 10.1093/nar/gkac1115.