• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用 Meta-NanoSim 对宏基因组纳米孔测序数据进行特征描述和模拟。

Characterization and simulation of metagenomic nanopore sequencing data with Meta-NanoSim.

机构信息

Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, BC, V5Z 4S6, Canada.

Bioinformatics Graduate Program, University of British Columbia, Genome Sciences Centre, BCCA 100-570 West 7th Avenue, Vancouver, BC, V5Z 4S6, Canada.

出版信息

Gigascience. 2023 Mar 20;12. doi: 10.1093/gigascience/giad013.

DOI:10.1093/gigascience/giad013
PMID:36939007
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10025935/
Abstract

BACKGROUND

Nanopore sequencing is crucial to metagenomic studies as its kilobase-long reads can contribute to resolving genomic structural differences among microbes. However, sequencing platform-specific challenges, including high base-call error rate, nonuniform read lengths, and the presence of chimeric artifacts, necessitate specifically designed analytical algorithms. The use of simulated datasets with characteristics that are true to the sequencing platform under evaluation is a cost-effective way to assess the performance of bioinformatics tools with the ground truth in a controlled environment.

RESULTS

Here, we present Meta-NanoSim, a fast and versatile utility that characterizes and simulates the unique properties of nanopore metagenomic reads. It improves upon state-of-the-art methods on microbial abundance estimation through a base-level quantification algorithm. Meta-NanoSim can simulate complex microbial communities composed of both linear and circular genomes and can stream reference genomes from online servers directly. Simulated datasets showed high congruence with experimental data in terms of read length, error profiles, and abundance levels. We demonstrate that Meta-NanoSim simulated data can facilitate the development of metagenomic algorithms and guide experimental design through a metagenome assembly benchmarking task.

CONCLUSIONS

The Meta-NanoSim characterization module investigates read features, including chimeric information and abundance levels, while the simulation module simulates large and complex multisample microbial communities with different abundance profiles. All trained models and the software are freely accessible at GitHub: https://github.com/bcgsc/NanoSim.

摘要

背景

纳米孔测序对于宏基因组研究至关重要,因为其长达千碱基的读长有助于解决微生物之间的基因组结构差异。然而,测序平台特有的挑战,包括高碱基调用错误率、不均匀的读长和嵌合伪影的存在,需要专门设计分析算法。使用具有评估测序平台特征的模拟数据集是在受控环境中使用真实数据评估生物信息学工具性能的一种经济有效的方法。

结果

在这里,我们提出了 Meta-NanoSim,这是一种快速而通用的实用程序,用于描述和模拟纳米孔宏基因组读长的独特特性。它通过一种基于碱基的定量算法改进了微生物丰度估计的最先进方法。Meta-NanoSim 可以模拟由线性和圆形基因组组成的复杂微生物群落,并可以直接从在线服务器流式传输参考基因组。模拟数据集在读长、错误分布和丰度水平方面与实验数据高度一致。我们通过宏基因组组装基准测试任务证明,Meta-NanoSim 模拟数据可以促进宏基因组算法的开发并指导实验设计。

结论

Meta-NanoSim 的特征描述模块研究了读长的特征,包括嵌合信息和丰度水平,而模拟模块则模拟了具有不同丰度分布的大型复杂多样本微生物群落。所有训练的模型和软件都可以在 GitHub 上免费获取:https://github.com/bcgsc/NanoSim。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8976/10025935/df973cb1458c/giad013fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8976/10025935/1d2761cadae7/giad013fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8976/10025935/df0a8725af77/giad013fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8976/10025935/7cabeea51fb0/giad013fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8976/10025935/6f56093769f3/giad013fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8976/10025935/df973cb1458c/giad013fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8976/10025935/1d2761cadae7/giad013fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8976/10025935/df0a8725af77/giad013fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8976/10025935/7cabeea51fb0/giad013fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8976/10025935/6f56093769f3/giad013fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8976/10025935/df973cb1458c/giad013fig5.jpg

相似文献

1
Characterization and simulation of metagenomic nanopore sequencing data with Meta-NanoSim.利用 Meta-NanoSim 对宏基因组纳米孔测序数据进行特征描述和模拟。
Gigascience. 2023 Mar 20;12. doi: 10.1093/gigascience/giad013.
2
NanoSim: nanopore sequence read simulator based on statistical characterization.NanoSim:基于统计特征的纳米孔序列读取模拟器。
Gigascience. 2017 Apr 1;6(4):1-6. doi: 10.1093/gigascience/gix010.
3
Trans-NanoSim characterizes and simulates nanopore RNA-sequencing data.跨纳米模拟技术对纳米孔 RNA 测序数据进行了特征描述和模拟。
Gigascience. 2020 Jun 1;9(6). doi: 10.1093/gigascience/giaa061.
4
Advancing metagenome-assembled genome-based pathogen identification: unraveling the power of long-read assembly algorithms in Oxford Nanopore sequencing.推进宏基因组组装基因组为基础的病原体鉴定:揭示长读长组装算法在牛津纳米孔测序中的强大功能。
Microbiol Spectr. 2024 Jun 4;12(6):e0011724. doi: 10.1128/spectrum.00117-24. Epub 2024 Apr 30.
5
MinION™ nanopore sequencing of environmental metagenomes: a synthetic approach.环境宏基因组的MinION™纳米孔测序:一种合成方法。
Gigascience. 2017 Mar 1;6(3):1-10. doi: 10.1093/gigascience/gix007.
6
Assembly methods for nanopore-based metagenomic sequencing: a comparative study.基于纳米孔的宏基因组测序的组装方法:一项比较研究。
Sci Rep. 2020 Aug 12;10(1):13588. doi: 10.1038/s41598-020-70491-3.
7
cgMSI: pathogen detection within species from nanopore metagenomic sequencing data.cgMSI:从纳米孔宏基因组测序数据中检测种内病原体。
BMC Bioinformatics. 2023 Oct 12;24(1):387. doi: 10.1186/s12859-023-05512-9.
8
Evaluation of taxonomic classification and profiling methods for long-read shotgun metagenomic sequencing datasets.评价长读 shotgun 宏基因组测序数据集的分类和分析方法。
BMC Bioinformatics. 2022 Dec 13;23(1):541. doi: 10.1186/s12859-022-05103-0.
9
CAMISIM: simulating metagenomes and microbial communities.CAMISIM:模拟宏基因组和微生物群落。
Microbiome. 2019 Feb 8;7(1):17. doi: 10.1186/s40168-019-0633-6.
10
Ultra-deep, long-read nanopore sequencing of mock microbial community standards.超深度、长读长纳米孔测序模拟微生物群落标准品。
Gigascience. 2019 May 1;8(5). doi: 10.1093/gigascience/giz043.

引用本文的文献

1
Detecting Foldback Artifacts in Long Reads.检测长读段中的回环伪影。
bioRxiv. 2025 Jul 18:2025.07.15.664946. doi: 10.1101/2025.07.15.664946.
2
Species-resolved profiling of antibiotic resistance genes in complex metagenomes through long-read overlapping with Argo.通过与Argo的长读长重叠对复杂宏基因组中的抗生素抗性基因进行物种解析分析。
Nat Commun. 2025 Feb 18;16(1):1744. doi: 10.1038/s41467-025-57088-y.
3
MeStanG-Resource for High-Throughput Sequencing Standard Data Sets Generation for Bioinformatic Methods Evaluation and Validation.

本文引用的文献

1
Nanopore adaptive sampling: a tool for enrichment of low abundance species in metagenomic samples.纳米孔自适应采样:一种用于宏基因组样本中低丰度物种富集的工具。
Genome Biol. 2022 Jan 24;23(1):11. doi: 10.1186/s13059-021-02582-x.
2
Comparison of long-read sequencing technologies in interrogating bacteria and fly genomes.比较长读测序技术在细菌和果蝇基因组分析中的应用。
G3 (Bethesda). 2021 Jun 17;11(6). doi: 10.1093/g3journal/jkab083.
3
Alvis: a tool for contig and read ALignment VISualisation and chimera detection.Alvis:用于重叠群和读取序列比对可视化和嵌合体检测的工具。
MeStanG - 用于生物信息学方法评估和验证的高通量测序标准数据集生成资源。
Biology (Basel). 2025 Jan 14;14(1):69. doi: 10.3390/biology14010069.
4
Melon: metagenomic long-read-based taxonomic identification and quantification using marker genes.甜瓜:基于宏基因组长读长的标记基因进行分类鉴定和定量。
Genome Biol. 2024 Aug 19;25(1):226. doi: 10.1186/s13059-024-03363-y.
5
Easing genomic surveillance: A comprehensive performance evaluation of long-read assemblers across multi-strain mixture data of HIV-1 and Other pathogenic viruses for constructing a user-friendly bioinformatic pipeline.简化基因组监测:针对 HIV-1 和其他病原性病毒的多菌株混合数据,对长读长组装器进行全面性能评估,以构建用户友好的生物信息学管道。
F1000Res. 2024 May 31;13:556. doi: 10.12688/f1000research.149577.1. eCollection 2024.
6
MCSS: microbial community simulator based on structure.MCSS:基于结构的微生物群落模拟器
Front Microbiol. 2024 Mar 7;15:1358257. doi: 10.3389/fmicb.2024.1358257. eCollection 2024.
7
TKSM: highly modular, user-customizable, and scalable transcriptomic sequencing long-read simulator.TKSM:高度模块化、用户可自定义和可扩展的转录组测序长读模拟程序。
Bioinformatics. 2024 Feb 1;40(2). doi: 10.1093/bioinformatics/btae051.
BMC Bioinformatics. 2021 Mar 16;22(1):124. doi: 10.1186/s12859-021-04056-0.
4
Ensembl 2021.Ensembl 2021.
Nucleic Acids Res. 2021 Jan 8;49(D1):D884-D891. doi: 10.1093/nar/gkaa942.
5
metaFlye: scalable long-read metagenome assembly using repeat graphs.metaFlye:使用重复图进行可扩展的长读长宏基因组组装。
Nat Methods. 2020 Nov;17(11):1103-1110. doi: 10.1038/s41592-020-00971-x. Epub 2020 Oct 5.
6
Trans-NanoSim characterizes and simulates nanopore RNA-sequencing data.跨纳米模拟技术对纳米孔 RNA 测序数据进行了特征描述和模拟。
Gigascience. 2020 Jun 1;9(6). doi: 10.1093/gigascience/giaa061.
7
yacrd and fpa: upstream tools for long-read genome assembly.YACRD 和 FPA:用于长读长基因组组装的上游工具。
Bioinformatics. 2020 Jun 1;36(12):3894-3896. doi: 10.1093/bioinformatics/btaa262.
8
A familial cluster of pneumonia associated with the 2019 novel coronavirus indicating person-to-person transmission: a study of a family cluster.一个涉及 2019 年新型冠状病毒的家庭聚集性肺炎病例,提示存在人际传播:一项家庭聚集性研究。
Lancet. 2020 Feb 15;395(10223):514-523. doi: 10.1016/S0140-6736(20)30154-9. Epub 2020 Jan 24.
9
Strain-level metagenomic assignment and compositional estimation for long reads with MetaMaps.使用 MetaMaps 对长读进行菌株水平宏基因组分配和组成估计。
Nat Commun. 2019 Jul 11;10(1):3066. doi: 10.1038/s41467-019-10934-2.
10
Nanopore metagenomics enables rapid clinical diagnosis of bacterial lower respiratory infection.纳米孔宏基因组学可快速临床诊断细菌性下呼吸道感染。
Nat Biotechnol. 2019 Jul;37(7):783-792. doi: 10.1038/s41587-019-0156-5. Epub 2019 Jun 24.