• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用CycloneSEQ高效构建完整基因组以填补细菌草图组装中的缺口。

Efficiently constructing complete genomes with CycloneSEQ to fill gaps in bacterial draft assemblies.

作者信息

Liang Hewei, Zou Yuanqiang, Wang Mengmeng, Hu Tongyuan, Wang Haoyu, He Wenxin, Ju Yanmei, Guo Ruijin, Chen Junyi, Guo Fei, Zeng Tao, Dong Yuliang, Zhang Yuning, Wang Bo, Liu Chuanyu, Jin Xin, Zhang Wenwei, Xu Xun, Xiao Liang

机构信息

BGI Research, Shenzhen 518083, China.

BGI Research, Wuhan 430074, China.

出版信息

GigaByte. 2025 Apr 25;2025:gigabyte154. doi: 10.46471/gigabyte.154. eCollection 2025.

DOI:10.46471/gigabyte.154
PMID:40329937
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12051259/
Abstract

Current microbial sequencing relies on short-read platforms like Illumina and DNBSEQ, which are cost-effective and accurate but often produce fragmented draft genomes. Here, we used CycloneSEQ for long-read sequencing of ATCC BAA-835, producing long-reads with an average length of 11.6 kbp and an average quality score of 14.4. Hybrid assembly with short-reads data resulted in an error rate of only 0.04 mismatches and 0.08 indels per 100 kbp compared to the reference genome. This method, validated across nine species, successfully assembled complete circular genomes. Hybrid assembly significantly enhances genome completeness by using long-reads to fill gaps and accurately assembling multi-copy rRNA genes, unlike short-reads alone. Data subsampling showed that combining over 500 Mbp of short-read data with 100 Mbp of long-read data yields high-quality circular assemblies. CycloneSEQ long-reads improves the assembly of circular complete genomes from mixed microbial communities; however, its base quality needs improving. Integrating DNBSEQ short-reads improved accuracy, resulting in complete and accurate assemblies.

摘要

当前的微生物测序依赖于Illumina和DNBSEQ等短读长平台,这些平台具有成本效益且准确,但通常会产生碎片化的基因组草图。在这里,我们使用CycloneSEQ对ATCC BAA-835进行长读长测序,产生的长读长平均长度为11.6 kbp,平均质量分数为14.4。与短读长数据进行混合组装,与参考基因组相比,每100 kbp的错误率仅为0.04个错配和0.08个插入缺失。该方法在九个物种中得到验证,成功组装出完整的环状基因组。与仅使用短读长不同,混合组装通过使用长读长填补缺口并准确组装多拷贝rRNA基因,显著提高了基因组的完整性。数据二次抽样表明,将超过500 Mbp的短读长数据与100 Mbp的长读长数据相结合,可产生高质量的环状组装。CycloneSEQ长读长改进了来自混合微生物群落的环状完整基因组的组装;然而,其碱基质量需要改进。整合DNBSEQ短读长提高了准确性,从而实现了完整且准确的组装。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f4/12051259/7d487a005a3b/gigabyte-2025-154-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f4/12051259/2d54eeda790d/gigabyte-2025-154-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f4/12051259/76eabff35f2f/gigabyte-2025-154-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f4/12051259/f03a9a2bf21e/gigabyte-2025-154-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f4/12051259/065ef02c7018/gigabyte-2025-154-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f4/12051259/787712c3b05f/gigabyte-2025-154-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f4/12051259/a6b18c4d394f/gigabyte-2025-154-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f4/12051259/1f0ab516ebf2/gigabyte-2025-154-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f4/12051259/82436ad6cddd/gigabyte-2025-154-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f4/12051259/4c4853aad804/gigabyte-2025-154-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f4/12051259/8e2a49446093/gigabyte-2025-154-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f4/12051259/b350344fdd82/gigabyte-2025-154-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f4/12051259/7d487a005a3b/gigabyte-2025-154-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f4/12051259/2d54eeda790d/gigabyte-2025-154-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f4/12051259/76eabff35f2f/gigabyte-2025-154-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f4/12051259/f03a9a2bf21e/gigabyte-2025-154-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f4/12051259/065ef02c7018/gigabyte-2025-154-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f4/12051259/787712c3b05f/gigabyte-2025-154-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f4/12051259/a6b18c4d394f/gigabyte-2025-154-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f4/12051259/1f0ab516ebf2/gigabyte-2025-154-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f4/12051259/82436ad6cddd/gigabyte-2025-154-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f4/12051259/4c4853aad804/gigabyte-2025-154-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f4/12051259/8e2a49446093/gigabyte-2025-154-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f4/12051259/b350344fdd82/gigabyte-2025-154-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f4/12051259/7d487a005a3b/gigabyte-2025-154-g012.jpg

相似文献

1
Efficiently constructing complete genomes with CycloneSEQ to fill gaps in bacterial draft assemblies.使用CycloneSEQ高效构建完整基因组以填补细菌草图组装中的缺口。
GigaByte. 2025 Apr 25;2025:gigabyte154. doi: 10.46471/gigabyte.154. eCollection 2025.
2
Completion of draft bacterial genomes by long-read sequencing of synthetic genomic pools.通过合成基因组文库的长读长测序完成细菌基因组草图
BMC Genomics. 2020 Jul 29;21(1):519. doi: 10.1186/s12864-020-06910-6.
3
Polishing the Oxford Nanopore long-read assemblies of bacterial pathogens with Illumina short reads to improve genomic analyses.用 Illumina 短读序列对牛津纳米孔长读序列组装的细菌病原体进行打磨,以改进基因组分析。
Genomics. 2021 May;113(3):1366-1377. doi: 10.1016/j.ygeno.2021.03.018. Epub 2021 Mar 11.
4
Evaluating the accuracy of Listeria monocytogenes assemblies from quasimetagenomic samples using long and short reads.评估使用长读长和短读长的准宏基因组样本中单核细胞增生李斯特菌组装的准确性。
BMC Genomics. 2021 May 26;22(1):389. doi: 10.1186/s12864-021-07702-2.
5
A comprehensive investigation of metagenome assembly by linked-read sequencing.基于链接读取测序的宏基因组组装综合研究。
Microbiome. 2020 Nov 11;8(1):156. doi: 10.1186/s40168-020-00929-3.
6
Comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes.比较长读长测序技术在复杂细菌基因组混合组装中的应用。
Microb Genom. 2019 Sep;5(9). doi: 10.1099/mgen.0.000294. Epub 2019 Aug 30.
7
Assembly of Mitochondrial Genomes Using Nanopore Long-Read Technology in Three Sea Chubs (Teleostei: Kyphosidae).利用纳米孔长读长技术组装三种海鲷(硬骨鱼纲:舵鱼科)的线粒体基因组
Mol Ecol Resour. 2025 Jan;25(1):e14034. doi: 10.1111/1755-0998.14034. Epub 2024 Oct 15.
8
Assembly of chloroplast genomes with long- and short-read data: a comparison of approaches using Eucalyptus pauciflora as a test case.利用长读长和短读数据组装叶绿体基因组:以白千层作为测试案例的方法比较。
BMC Genomics. 2018 Dec 29;19(1):977. doi: 10.1186/s12864-018-5348-8.
9
Are we there yet? Benchmarking low-coverage nanopore long-read sequencing for the assembling of mitochondrial genomes using the vulnerable silky shark Carcharhinus falciformis.我们到了吗?使用脆弱的灰鲭鲨(Carcharhinus falciformis)对低覆盖度纳米孔长读测序进行线粒体基因组组装的基准测试。
BMC Genomics. 2022 Apr 22;23(1):320. doi: 10.1186/s12864-022-08482-z.
10
Evaluation and Validation of Assembling Corrected PacBio Long Reads for Microbial Genome Completion via Hybrid Approaches.通过混合方法组装校正后的PacBio长读段以完成微生物基因组的评估与验证
PLoS One. 2015 Dec 7;10(12):e0144305. doi: 10.1371/journal.pone.0144305. eCollection 2015.

引用本文的文献

1
Telomere-to-telomere African wild rice (Oryza longistaminata) reference genome reveals segmental and structural variation.端粒到端粒的非洲野生稻(长雄蕊野生稻)参考基因组揭示了片段和结构变异。
Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf074.

本文引用的文献

1
Sequencing-based analysis of microbiomes.基于测序的微生物组分析。
Nat Rev Genet. 2024 Dec;25(12):829-845. doi: 10.1038/s41576-024-00746-6. Epub 2024 Jun 25.
2
A treasure trove of 1034 actinomycete genomes.1034 株放线菌基因组宝库。
Nucleic Acids Res. 2024 Jul 22;52(13):7487-7503. doi: 10.1093/nar/gkae523.
3
Ultrafast one-pass FASTQ data preprocessing, quality control, and deduplication using fastp.使用fastp进行超快速单通道FASTQ数据预处理、质量控制和重复数据删除。
Imeta. 2023 May 8;2(2):e107. doi: 10.1002/imt2.107. eCollection 2023 May.
4
The Helicobacter pylori Genome Project: insights into H. pylori population structure from analysis of a worldwide collection of complete genomes.幽门螺杆菌基因组项目:从全球完整基因组集合分析中对 H. pylori 种群结构的深入了解。
Nat Commun. 2023 Dec 11;14(1):8184. doi: 10.1038/s41467-023-43562-y.
5
CheckM2: a rapid, scalable and accurate tool for assessing microbial genome quality using machine learning.CheckM2:一种使用机器学习快速、可扩展且准确评估微生物基因组质量的工具。
Nat Methods. 2023 Aug;20(8):1203-1212. doi: 10.1038/s41592-023-01940-w. Epub 2023 Jul 27.
6
A catalog of bacterial reference genomes from cultivated human oral bacteria.从培养的人类口腔细菌中获得的细菌参考基因组目录。
NPJ Biofilms Microbiomes. 2023 Jul 3;9(1):45. doi: 10.1038/s41522-023-00414-3.
7
NCTC3000: a century of bacterial strain collecting leads to a rich genomic data resource.NCTC3000:百年菌株收集成就丰富的基因组学数据资源。
Microb Genom. 2023 May;9(5). doi: 10.1099/mgen.0.000976.
8
The genomic landscape of reference genomes of cultivated human gut bacteria.人类肠道共生菌培养参考基因组的基因组景观。
Nat Commun. 2023 Mar 25;14(1):1663. doi: 10.1038/s41467-023-37396-x.
9
Evaluation of taxonomic classification and profiling methods for long-read shotgun metagenomic sequencing datasets.评价长读 shotgun 宏基因组测序数据集的分类和分析方法。
BMC Bioinformatics. 2022 Dec 13;23(1):541. doi: 10.1186/s12859-022-05103-0.
10
Short- and long-read metagenomics expand individualized structural variations in gut microbiomes.短读和长读宏基因组学扩展了肠道微生物组的个体化结构变异。
Nat Commun. 2022 Jun 8;13(1):3175. doi: 10.1038/s41467-022-30857-9.