• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

《Unicycler 混合组装中避免质粒内重复序列崩溃和假基因丢失的实用指南和 Galaxy 工作流程》

A practical guide and Galaxy workflow to avoid inter-plasmidic repeat collapse and false gene loss in Unicycler's hybrid assemblies.

机构信息

Julius Kühn Institute (JKI) - Federal Research Centre for Cultivated Plants, Institute for Biological Control, Schwabenheimer Str. 101, 69221 Dossenheim, Germany.

出版信息

Microb Genom. 2024 Jan;10(1). doi: 10.1099/mgen.0.001173.

DOI:10.1099/mgen.0.001173
PMID:38197876
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10868617/
Abstract

Generating complete, high-quality genome assemblies is key for any downstream analysis, such as comparative genomics. For bacterial genome assembly, various algorithms and fully automated pipelines exist, which are free-of-charge and easily accessible. However, these assembly tools often cannot unambiguously resolve a bacterial genome, for example due to the presence of sequence repeat structures on the chromosome or on plasmids. Then, a more sophisticated approach and/or manual curation is needed. Such modifications can be challenging, especially for non-bioinformaticians, because they are generally not considered as a straightforward process. In this study, we propose a standardized approach for manual genome completion focusing on the popular hybrid assembly pipeline Unicycler. The provided Galaxy workflow addresses two weaknesses in Unicycler's hybrid assemblies: (i) collapse of inter-plasmidic repeats and (ii) false loss of single-copy sequences. To demonstrate and validate how to detect and resolve these assembly errors, we use two genomes from the group. By applying the proposed pipeline following an automated assembly, the genome sequence quality can be significantly improved.

摘要

生成完整、高质量的基因组组装对于任何下游分析(如比较基因组学)都是关键。对于细菌基因组组装,存在各种算法和全自动化的管道,它们是免费的,并且易于访问。然而,这些组装工具往往无法明确解析细菌基因组,例如由于染色体或质粒上存在序列重复结构。然后,需要更复杂的方法和/或手动编辑。这种修改可能具有挑战性,特别是对于非生物信息学家而言,因为它们通常不被认为是一个简单的过程。在这项研究中,我们针对流行的混合组装管道 Unicycler 提出了一种标准化的手动基因组完成方法。提供的 Galaxy 工作流程解决了 Unicycler 混合组装中的两个弱点:(i)质粒间重复结构的折叠和(ii)单拷贝序列的错误丢失。为了演示和验证如何检测和解决这些组装错误,我们使用了来自 组的两个基因组。通过在自动化组装后应用提出的管道,可以显著提高基因组序列质量。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c780/10868617/bec410087c0d/mgen-10-1173-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c780/10868617/38d7728fa601/mgen-10-1173-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c780/10868617/b7f84aee5caf/mgen-10-1173-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c780/10868617/ca57a72abd36/mgen-10-1173-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c780/10868617/26a0a6df007a/mgen-10-1173-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c780/10868617/974277ee9856/mgen-10-1173-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c780/10868617/55054a8ac4fb/mgen-10-1173-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c780/10868617/144edae9f5a0/mgen-10-1173-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c780/10868617/bec410087c0d/mgen-10-1173-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c780/10868617/38d7728fa601/mgen-10-1173-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c780/10868617/b7f84aee5caf/mgen-10-1173-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c780/10868617/ca57a72abd36/mgen-10-1173-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c780/10868617/26a0a6df007a/mgen-10-1173-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c780/10868617/974277ee9856/mgen-10-1173-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c780/10868617/55054a8ac4fb/mgen-10-1173-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c780/10868617/144edae9f5a0/mgen-10-1173-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c780/10868617/bec410087c0d/mgen-10-1173-g008.jpg

相似文献

1
A practical guide and Galaxy workflow to avoid inter-plasmidic repeat collapse and false gene loss in Unicycler's hybrid assemblies.《Unicycler 混合组装中避免质粒内重复序列崩溃和假基因丢失的实用指南和 Galaxy 工作流程》
Microb Genom. 2024 Jan;10(1). doi: 10.1099/mgen.0.001173.
2
Benchmarking hybrid assembly approaches for genomic analyses of bacterial pathogens using Illumina and Oxford Nanopore sequencing.使用 Illumina 和 Oxford Nanopore 测序对细菌病原体进行基因组分析的混合组装方法的基准测试。
BMC Genomics. 2020 Sep 14;21(1):631. doi: 10.1186/s12864-020-07041-8.
3
Completion of draft bacterial genomes by long-read sequencing of synthetic genomic pools.通过合成基因组文库的长读长测序完成细菌基因组草图
BMC Genomics. 2020 Jul 29;21(1):519. doi: 10.1186/s12864-020-06910-6.
4
Complete hybrid genome assembly of clinical multidrug-resistant isolates enables comprehensive identification of antimicrobial-resistance genes and plasmids.完成临床多重耐药株的全混合基因组组装,可全面鉴定抗生素耐药基因和质粒。
Microb Genom. 2019 Nov;5(11). doi: 10.1099/mgen.0.000312.
5
Comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes.比较长读长测序技术在复杂细菌基因组混合组装中的应用。
Microb Genom. 2019 Sep;5(9). doi: 10.1099/mgen.0.000294. Epub 2019 Aug 30.
6
A hybrid and poly-polish workflow for the complete and accurate assembly of phage genomes: a case study of ten przondoviruses.一种用于噬菌体基因组完整准确组装的混合与多轮抛光工作流程:以十种普宗多病毒为例
Microb Genom. 2023 Jul;9(7). doi: 10.1099/mgen.0.001065.
7
Hybrid De Novo Genome Assembly for the Generation of Complete Genomes of Urinary Bacteria using Short- and Long-read Sequencing Technologies.利用短读长读测序技术进行尿液细菌的从头杂交基因组组装,生成完整基因组。
J Vis Exp. 2021 Aug 20(174). doi: 10.3791/62872.
8
Significantly improving the quality of genome assemblies through curation.通过编辑显著提高基因组组装的质量。
Gigascience. 2021 Jan 9;10(1). doi: 10.1093/gigascience/giaa153.
9
Genomic analyses of multidrug-resistant Salmonella Indiana, Typhimurium, and Enteritidis isolates using MinION and MiSeq sequencing technologies.使用 MinION 和 MiSeq 测序技术对多重耐药性印第安纳沙门氏菌、鼠伤寒沙门氏菌和肠炎沙门氏菌分离株进行基因组分析。
PLoS One. 2020 Jul 2;15(7):e0235641. doi: 10.1371/journal.pone.0235641. eCollection 2020.
10
Assessing the benefits of using mate-pairs to resolve repeats in de novo short-read prokaryotic assemblies.评估使用 Mate-Pairs 解决从头组装的短读 prokaryotic 重复的好处。
BMC Bioinformatics. 2011 Apr 13;12:95. doi: 10.1186/1471-2105-12-95.

引用本文的文献

1
Mitochondrial genome complexity in Erodium stephanianum (Geraniaceae): nanopore sequencing reveals chloroplast gene transfer and DNA rearrangements.牻牛儿苗(牻牛儿苗科)线粒体基因组的复杂性:纳米孔测序揭示叶绿体基因转移和DNA重排
Front Genet. 2025 Jul 15;16:1641368. doi: 10.3389/fgene.2025.1641368. eCollection 2025.
2
Contributions of Long-Read Sequencing for the Detection of Antimicrobial Resistance.长读长测序在抗菌药物耐药性检测中的贡献
Pathogens. 2024 Aug 28;13(9):730. doi: 10.3390/pathogens13090730.
3
Whole-genome sequence of , isolate recovered from the gut of feeding on cabbage.

本文引用的文献

1
Time- and memory-efficient genome assembly with Raven.使用Raven进行高效省时的基因组组装。
Nat Comput Sci. 2021 May;1(5):332-336. doi: 10.1038/s43588-021-00073-4. Epub 2021 May 20.
2
Long read genome assemblers struggle with small plasmids.长读基因组组装器难以处理小型质粒。
Microb Genom. 2023 May;9(5). doi: 10.1099/mgen.0.001024.
3
Elucidating the genomic history of commercially used subsp. strain NB176.阐明商业上使用的 subsp. 菌株 NB176 的基因组历史。
从以卷心菜为食的[具体生物名称]肠道中分离出的[具体菌株名称]的全基因组序列。 需注意,你提供的原文中部分关键信息缺失,我根据格式进行了补充说明以便更完整地呈现句子意思,你可根据实际情况进行调整。
Microbiol Resour Announc. 2024 Aug 13;13(8):e0033024. doi: 10.1128/mra.00330-24. Epub 2024 Jul 22.
Front Cell Infect Microbiol. 2023 Mar 20;13:1129177. doi: 10.3389/fcimb.2023.1129177. eCollection 2023.
4
Comparing Long-Read Assemblers to Explore the Potential of a Sustainable Low-Cost, Low-Infrastructure Approach to Sequence Antimicrobial Resistant Bacteria With Oxford Nanopore Sequencing.比较长读长测序组装工具,以探索一种可持续的低成本、低基础设施方法利用牛津纳米孔测序对耐抗菌细菌进行测序的潜力。
Front Microbiol. 2022 Mar 3;13:796465. doi: 10.3389/fmicb.2022.796465. eCollection 2022.
5
Nanopore sequencing technology, bioinformatics and applications.纳米孔测序技术、生物信息学及其应用。
Nat Biotechnol. 2021 Nov;39(11):1348-1365. doi: 10.1038/s41587-021-01108-x. Epub 2021 Nov 8.
6
A comprehensive evaluation of long read error correction methods.长读错误纠正方法的综合评价。
BMC Genomics. 2020 Dec 21;21(Suppl 6):889. doi: 10.1186/s12864-020-07227-0.
7
Large-scale network analysis captures biological features of bacterial plasmids.大规模网络分析捕获了细菌质粒的生物学特征。
Nat Commun. 2020 May 15;11(1):2452. doi: 10.1038/s41467-020-16282-w.
8
Opportunities and challenges in long-read sequencing data analysis.长读测序数据分析中的机遇与挑战。
Genome Biol. 2020 Feb 7;21(1):30. doi: 10.1186/s13059-020-1935-5.
9
Benchmarking of long-read assemblers for prokaryote whole genome sequencing.原核生物全基因组测序的长读长组装器基准测试。
F1000Res. 2019 Dec 23;8:2138. doi: 10.12688/f1000research.21782.4. eCollection 2019.
10
Tandem repeats lead to sequence assembly errors and impose multi-level challenges for genome and protein databases.串联重复导致序列组装错误,并对基因组和蛋白质数据库提出了多层次的挑战。
Nucleic Acids Res. 2019 Dec 2;47(21):10994-11006. doi: 10.1093/nar/gkz841.