• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MeStanG - 用于生物信息学方法评估和验证的高通量测序标准数据集生成资源。

MeStanG-Resource for High-Throughput Sequencing Standard Data Sets Generation for Bioinformatic Methods Evaluation and Validation.

作者信息

Ramos Lopez Daniel, Flores Francisco J, Espindola Andres S

机构信息

Institute for Biosecurity and Microbial Forensics (IBMF), Oklahoma State University, Stillwater, OK 74078, USA.

Department of Entomology and Plant Pathology, Oklahoma State University, Stillwater, OK 74078, USA.

出版信息

Biology (Basel). 2025 Jan 14;14(1):69. doi: 10.3390/biology14010069.

DOI:10.3390/biology14010069
PMID:39857299
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11762867/
Abstract

Metagenomics analysis has enabled the measurement of the microbiome diversity in environmental samples without prior targeted enrichment. Functional and phylogenetic studies based on microbial diversity retrieved using HTS platforms have advanced from detecting known organisms and discovering unknown species to applications in disease diagnostics. Robust validation processes are essential for test reliability, requiring standard samples and databases deriving from real samples and in silico generated artificial controls. We propose a MeStanG as a resource for generating HTS Nanopore data sets to evaluate present and emerging bioinformatics pipelines. MeStanG allows samples to be designed with user-defined organism abundances expressed as number of reads, reference sequences, and predetermined or custom errors by sequencing profiles. The simulator pipeline was evaluated by analyzing its output mock metagenomic samples containing known read abundances using read mapping, genome assembly, and taxonomic classification on three scenarios: a bacterial community composed of nine different organisms, samples resembling pathogen-infected wheat plants, and a viral pathogen serial dilution sampling. The evaluation was able to report consistently the same organisms, and their read abundances as provided in the mock metagenomic sample design. Based on this performance and its novel capacity of generating exact number of reads, MeStanG can be used by scientists to develop mock metagenomic samples (artificial HTS data sets) to assess the diagnostic performance metrics of bioinformatic pipelines, allowing the user to choose predetermined or customized models for research and training.

摘要

宏基因组学分析能够在无需事先进行靶向富集的情况下,对环境样本中的微生物群落多样性进行测量。基于使用高通量测序(HTS)平台获取的微生物多样性开展的功能和系统发育研究,已从检测已知生物体和发现未知物种发展到应用于疾病诊断。强大的验证流程对于测试可靠性至关重要,这需要源自真实样本的标准样本和数据库以及计算机生成的人工对照。我们提出了MeStanG,作为一种用于生成HTS纳米孔数据集以评估当前和新兴生物信息学流程的资源。MeStanG允许用户根据测序图谱,以读取数、参考序列以及预定或自定义错误来设计具有用户定义生物体丰度的样本。通过在三种情况下对包含已知读取丰度的模拟宏基因组样本输出进行分析,即由九种不同生物体组成的细菌群落、类似病原体感染小麦植株的样本以及病毒病原体系列稀释采样,对模拟器流程进行了评估。评估能够一致地报告模拟宏基因组样本设计中所提供的相同生物体及其读取丰度。基于这一性能及其生成精确读取数的新能力,科学家可以使用MeStanG来开发模拟宏基因组样本(人工HTS数据集),以评估生物信息学流程的诊断性能指标,允许用户选择预定或定制模型用于研究和培训。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b8b/11762867/9c89358297cf/biology-14-00069-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b8b/11762867/77979a5cb3bc/biology-14-00069-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b8b/11762867/46e3768f6bad/biology-14-00069-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b8b/11762867/6d6100f49532/biology-14-00069-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b8b/11762867/543e2d55ac93/biology-14-00069-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b8b/11762867/9c89358297cf/biology-14-00069-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b8b/11762867/77979a5cb3bc/biology-14-00069-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b8b/11762867/46e3768f6bad/biology-14-00069-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b8b/11762867/6d6100f49532/biology-14-00069-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b8b/11762867/543e2d55ac93/biology-14-00069-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b8b/11762867/9c89358297cf/biology-14-00069-g005.jpg

相似文献

1
MeStanG-Resource for High-Throughput Sequencing Standard Data Sets Generation for Bioinformatic Methods Evaluation and Validation.MeStanG - 用于生物信息学方法评估和验证的高通量测序标准数据集生成资源。
Biology (Basel). 2025 Jan 14;14(1):69. doi: 10.3390/biology14010069.
2
UltraSEQ, a Universal Bioinformatic Platform for Information-Based Clinical Metagenomics and Beyond.UltraSEQ,一个基于信息的临床宏基因组学及其他领域的通用生物信息学平台。
Microbiol Spectr. 2023 Jun 15;11(3):e0416022. doi: 10.1128/spectrum.04160-22. Epub 2023 Apr 11.
3
METAnnotatorX2: a Comprehensive Tool for Deep and Shallow Metagenomic Data Set Analyses.METAnnotatorX2:用于深度和浅层宏基因组数据集分析的综合工具。
mSystems. 2021 Jun 29;6(3):e0058321. doi: 10.1128/mSystems.00583-21.
4
Advancing metagenome-assembled genome-based pathogen identification: unraveling the power of long-read assembly algorithms in Oxford Nanopore sequencing.推进宏基因组组装基因组为基础的病原体鉴定:揭示长读长组装算法在牛津纳米孔测序中的强大功能。
Microbiol Spectr. 2024 Jun 4;12(6):e0011724. doi: 10.1128/spectrum.00117-24. Epub 2024 Apr 30.
5
Impact of microbiological molecular methodologies on adaptive sampling using nanopore sequencing in metagenomic studies.微生物分子方法对宏基因组研究中使用纳米孔测序进行适应性采样的影响。
Environ Microbiome. 2025 May 5;20(1):47. doi: 10.1186/s40793-025-00704-7.
6
Comparative analysis of metagenomic classifiers for long-read sequencing datasets.长读测序数据集的宏基因组分类器的比较分析。
BMC Bioinformatics. 2024 Jan 11;25(1):15. doi: 10.1186/s12859-024-05634-8.
7
Ultra-deep, long-read nanopore sequencing of mock microbial community standards.超深度、长读长纳米孔测序模拟微生物群落标准品。
Gigascience. 2019 May 1;8(5). doi: 10.1093/gigascience/giz043.
8
MinION™ nanopore sequencing of environmental metagenomes: a synthetic approach.环境宏基因组的MinION™纳米孔测序:一种合成方法。
Gigascience. 2017 Mar 1;6(3):1-10. doi: 10.1093/gigascience/gix007.
9
Concatenation of paired-end reads improves taxonomic classification of amplicons for profiling microbial communities.拼接成对的末端读取可提高微生物群落分析中扩增子分类的分类学分类。
BMC Bioinformatics. 2021 Oct 12;22(1):493. doi: 10.1186/s12859-021-04410-2.
10
PRONAME: a user-friendly pipeline to process long-read nanopore metabarcoding data by generating high-quality consensus sequences.PRONAME:一个用户友好型流程,通过生成高质量的一致性序列来处理长读长纳米孔宏条形码数据。
Front Bioinform. 2024 Dec 20;4:1483255. doi: 10.3389/fbinf.2024.1483255. eCollection 2024.

本文引用的文献

1
Simulated High Throughput Sequencing Datasets: A Crucial Tool for Validating Bioinformatic Pathogen Detection Pipelines.模拟高通量测序数据集:验证生物信息病原体检测流程的关键工具。
Biology (Basel). 2024 Sep 6;13(9):700. doi: 10.3390/biology13090700.
2
High-throughput nanopore DNA sequencing of large insert fosmid clones directly from bacterial colonies.高通量纳米孔 DNA 测序直接从细菌菌落中进行大片段 fosmid 克隆。
Appl Environ Microbiol. 2024 Jun 18;90(6):e0024324. doi: 10.1128/aem.00243-24. Epub 2024 May 20.
3
High-quality metagenome assembly from long accurate reads with metaMDBG.
使用 metaMDBG 从长而准确的读取中进行高质量的宏基因组组装。
Nat Biotechnol. 2024 Sep;42(9):1378-1383. doi: 10.1038/s41587-023-01983-6. Epub 2024 Jan 2.
4
Characterization and simulation of metagenomic nanopore sequencing data with Meta-NanoSim.利用 Meta-NanoSim 对宏基因组纳米孔测序数据进行特征描述和模拟。
Gigascience. 2023 Mar 20;12. doi: 10.1093/gigascience/giad013.
5
Nanopore Technology Applied to Targeted Detection of Tomato Brown Rugose Fruit Virus Allows Sequencing of Related Viruses and the Diagnosis of Mixed Infections.应用于番茄褐色皱纹果病毒靶向检测的纳米孔技术可对相关病毒进行测序并诊断混合感染。
Plants (Basel). 2023 Feb 22;12(5):999. doi: 10.3390/plants12050999.
6
Validation of a Metagenomic Next-Generation Sequencing Assay for Lower Respiratory Pathogen Detection.宏基因组下一代测序检测下呼吸道病原体的验证。
Microbiol Spectr. 2023 Feb 14;11(1):e0381222. doi: 10.1128/spectrum.03812-22. Epub 2022 Dec 12.
7
Performance evaluation of six popular short-read simulators.六种流行的短读序列模拟程序的性能评估。
Heredity (Edinb). 2023 Feb;130(2):55-63. doi: 10.1038/s41437-022-00577-3. Epub 2022 Dec 10.
8
Strain level microbial detection and quantification with applications to single cell metagenomics.利用单细胞宏基因组学进行菌株水平微生物检测和定量。
Nat Commun. 2022 Oct 28;13(1):6430. doi: 10.1038/s41467-022-33869-7.
9
Nanopore Sequencing for De Novo Bacterial Genome Assembly and Search for Single-Nucleotide Polymorphism.纳米孔测序用于从头组装细菌基因组和寻找单核苷酸多态性。
Int J Mol Sci. 2022 Aug 2;23(15):8569. doi: 10.3390/ijms23158569.
10
Opportunities and challenges of using metagenomic data to bring uncultured microbes into cultivation.利用宏基因组数据将未培养微生物带入培养的机遇与挑战。
Microbiome. 2022 May 12;10(1):76. doi: 10.1186/s40168-022-01272-5.