• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Chlomito:一种从核基因组组装中精确消除细胞器基因组污染的新型工具。

Chlomito: a novel tool for precise elimination of organelle genome contamination from nuclear genome assembly.

作者信息

Song Wei, Li Chong, Lu Yanming, Shen Dawei, Jia Yunxiao, Huo Yixin, Piao Weilan, Jin Hua

机构信息

Laboratory of Genetics and Disorders, Key Laboratory of Molecular Medicine and Biotherapy, Aerospace Center Hospital, School of Life Science, Beijing Institute of Technology, Beijing, China.

Research Institute for Science and Technology, Beijing Institute of Technology, Beijing, China.

出版信息

Front Plant Sci. 2024 Aug 27;15:1430443. doi: 10.3389/fpls.2024.1430443. eCollection 2024.

DOI:10.3389/fpls.2024.1430443
PMID:39258299
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11385003/
Abstract

INTRODUCTION

Accurate reference genomes are fundamental to understanding biological evolution, biodiversity, hereditary phenomena and diseases. However, many assembled nuclear chromosomes are often contaminated by organelle genomes, which will mislead bioinformatic analysis, and genomic and transcriptomic data interpretation.

METHODS

To address this issue, we developed a tool named Chlomito, aiming at precise identification and elimination of organelle genome contamination from nuclear genome assembly. Compared to conventional approaches, Chlomito utilized new metrics, alignment length coverage ratio (ALCR) and sequencing depth ratio (SDR), thereby effectively distinguishing true organelle genome sequences from those transferred into nuclear genomes via horizontal gene transfer (HGT).

RESULTS

The accuracy of Chlomito was tested using sequencing data from Plum, Mango and . The results confirmed that Chlomito can accurately detect contigs originating from the organelle genomes, and the identified contigs covered most regions of the organelle reference genomes, demonstrating efficiency and precision of Chlomito. Considering user convenience, we further packaged this method into a Docker image, simplified the data processing workflow.

DISCUSSION

Overall, Chlomito provides an efficient, accurate and convenient method for identifying and removing contigs derived from organelle genomes in genomic assembly data, contributing to the improvement of genome assembly quality.

摘要

引言

准确的参考基因组对于理解生物进化、生物多样性、遗传现象和疾病至关重要。然而,许多组装的核染色体经常被细胞器基因组污染,这会误导生物信息学分析以及基因组和转录组数据的解读。

方法

为解决这一问题,我们开发了一种名为Chlomito的工具,旨在精确识别并消除核基因组组装中的细胞器基因组污染。与传统方法相比,Chlomito使用了新的指标,即比对长度覆盖率(ALCR)和测序深度比(SDR),从而有效地将真正的细胞器基因组序列与那些通过水平基因转移(HGT)转移到核基因组中的序列区分开来。

结果

使用来自李子、芒果等的测序数据对Chlomito的准确性进行了测试。结果证实,Chlomito能够准确检测出来自细胞器基因组的重叠群,并且所识别的重叠群覆盖了细胞器参考基因组的大部分区域,证明了Chlomito的效率和准确性。考虑到用户的便利性,我们进一步将此方法打包成一个Docker镜像,简化了数据处理流程。

讨论

总体而言,Chlomito为识别和去除基因组组装数据中源自细胞器基因组的重叠群提供了一种高效、准确且便捷的方法,有助于提高基因组组装质量。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/88f8/11385003/28acc7d4007a/fpls-15-1430443-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/88f8/11385003/9a2bfa1541cf/fpls-15-1430443-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/88f8/11385003/48bb265f937b/fpls-15-1430443-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/88f8/11385003/136f31160efc/fpls-15-1430443-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/88f8/11385003/844a2adc6fbe/fpls-15-1430443-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/88f8/11385003/855baf5f8246/fpls-15-1430443-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/88f8/11385003/cc1825e35039/fpls-15-1430443-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/88f8/11385003/28acc7d4007a/fpls-15-1430443-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/88f8/11385003/9a2bfa1541cf/fpls-15-1430443-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/88f8/11385003/48bb265f937b/fpls-15-1430443-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/88f8/11385003/136f31160efc/fpls-15-1430443-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/88f8/11385003/844a2adc6fbe/fpls-15-1430443-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/88f8/11385003/855baf5f8246/fpls-15-1430443-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/88f8/11385003/cc1825e35039/fpls-15-1430443-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/88f8/11385003/28acc7d4007a/fpls-15-1430443-g007.jpg

相似文献

1
Chlomito: a novel tool for precise elimination of organelle genome contamination from nuclear genome assembly.Chlomito:一种从核基因组组装中精确消除细胞器基因组污染的新型工具。
Front Plant Sci. 2024 Aug 27;15:1430443. doi: 10.3389/fpls.2024.1430443. eCollection 2024.
2
Organelle_PBA, a pipeline for assembling chloroplast and mitochondrial genomes from PacBio DNA sequencing data.细胞器_PBA,一种用于从PacBio DNA测序数据组装叶绿体和线粒体基因组的流程。
BMC Genomics. 2017 Jan 7;18(1):49. doi: 10.1186/s12864-016-3412-9.
3
NOVOPlasty: de novo assembly of organelle genomes from whole genome data.NOVOPlasty:从头组装细胞器基因组的全基因组数据。
Nucleic Acids Res. 2017 Feb 28;45(4):e18. doi: 10.1093/nar/gkw955.
4
Fragmentation and Coverage Variation in Viral Metagenome Assemblies, and Their Effect in Diversity Calculations.病毒宏基因组组装中的碎片化和覆盖度变化,及其对多样性计算的影响。
Front Bioeng Biotechnol. 2015 Sep 17;3:141. doi: 10.3389/fbioe.2015.00141. eCollection 2015.
5
Comparative analysis of nuclear, chloroplast, and mitochondrial genomes of watermelon and melon provides evidence of gene transfer.西瓜和瓜类的核、叶绿体和线粒体基因组的比较分析为基因转移提供了证据。
Sci Rep. 2021 Jan 15;11(1):1595. doi: 10.1038/s41598-020-80149-9.
6
Complete sequences of organelle genomes from the medicinal plant Rhazya stricta (Apocynaceae) and contrasting patterns of mitochondrial genome evolution across asterids.药用植物刺山柑(夹竹桃科)细胞器基因组的完整序列以及菊类植物线粒体基因组进化的对比模式。
BMC Genomics. 2014 May 28;15(1):405. doi: 10.1186/1471-2164-15-405.
7
De novo assembly of the carrot mitochondrial genome using next generation sequencing of whole genomic DNA provides first evidence of DNA transfer into an angiosperm plastid genome.利用全基因组 DNA 下一代测序对胡萝卜线粒体基因组进行从头组装,为 DNA 转移到被子植物质体基因组中提供了第一个证据。
BMC Plant Biol. 2012 May 1;12:61. doi: 10.1186/1471-2229-12-61.
8
Identification of Medicinal Plants for Quality Control Based on Organelle Genomes.基于细胞器基因组的药用植物质量控制鉴定
Front Pharmacol. 2022 Feb 14;13:842131. doi: 10.3389/fphar.2022.842131. eCollection 2022.
9
Odintifier--A computational method for identifying insertions of organellar origin from modern and ancient high-throughput sequencing data based on haplotype phasing.Odintifier——一种基于单倍型分型从现代和古代高通量测序数据中识别细胞器起源插入序列的计算方法。
BMC Bioinformatics. 2015 Jul 28;16(1):232. doi: 10.1186/s12859-015-0682-1.
10
Comparative analysis of chloroplast and mitochondrial genomes of sweet potato provides evidence of gene transfer.甘薯叶绿体和线粒体基因组的比较分析为基因转移提供了证据。
Sci Rep. 2024 Feb 24;14(1):4547. doi: 10.1038/s41598-024-55150-1.

本文引用的文献

1
A chromosomal-scale genome assembly of modern cultivated hybrid sugarcane provides insights into origination and evolution.现代栽培杂交甘蔗的染色体级基因组组装为起源和进化提供了新见解。
Nat Commun. 2024 Apr 8;15(1):3041. doi: 10.1038/s41467-024-47390-6.
2
Chromosome-level genome assembly and annotation of the yellow grouper, Epinephelus awoara.黄色石斑鱼 Epinephelus awoara 的染色体水平基因组组装和注释。
Sci Data. 2024 Jan 31;11(1):151. doi: 10.1038/s41597-024-02989-8.
3
Chromosome-level genome of spider Pardosa pseudoannulata and cuticle protein genes in environmental stresses.
蜘蛛拟环纹豹蛛的染色体水平基因组和环境胁迫下的表皮蛋白基因。
Sci Data. 2024 Jan 24;11(1):121. doi: 10.1038/s41597-024-02966-1.
4
Plant organellar genomes: much done, much more to do.植物细胞器基因组:完成了很多,还有更多要做。
Trends Plant Sci. 2024 Jul;29(7):754-769. doi: 10.1016/j.tplants.2023.12.014. Epub 2024 Jan 13.
5
Chromosome-level genomes of three key Allium crops and their trait evolution.三种重要葱属作物的染色体水平基因组及其性状进化。
Nat Genet. 2023 Nov;55(11):1976-1986. doi: 10.1038/s41588-023-01546-0. Epub 2023 Nov 6.
6
Chromosome-level genome assembly of provides new insights into heat stress responses in mosses.[物种名称]的染色体水平基因组组装为苔藓热应激反应提供了新见解。
Front Plant Sci. 2023 Oct 18;14:1271357. doi: 10.3389/fpls.2023.1271357. eCollection 2023.
7
Chromosome-level genome assembly of the Asian aspen Populus davidiana Dode.亚洲山杨(Populus davidiana Dode.)染色体水平基因组组装
Sci Data. 2023 Jul 6;10(1):431. doi: 10.1038/s41597-023-02350-5.
8
ODNA: identification of organellar DNA by machine learning.线粒体 DNA 通过机器学习进行鉴定。
Bioinformatics. 2023 May 4;39(5). doi: 10.1093/bioinformatics/btad326.
9
Plastid Genome Assembly Using Long-read data.利用长读长数据进行质体基因组组装。
Mol Ecol Resour. 2023 Aug;23(6):1442-1457. doi: 10.1111/1755-0998.13787. Epub 2023 Apr 2.
10
A haplotype resolved chromosomal level avocado genome allows analysis of novel avocado genes.一个单倍型解析的染色体水平鳄梨基因组有助于对新的鳄梨基因进行分析。
Hortic Res. 2022 Mar 30;9:uhac157. doi: 10.1093/hr/uhac157. eCollection 2022.