• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

特定微生物模拟群落的下一代测序数据。

Next generation sequencing data of a defined microbial mock community.

作者信息

Singer Esther, Andreopoulos Bill, Bowers Robert M, Lee Janey, Deshpande Shweta, Chiniquy Jennifer, Ciobanu Doina, Klenk Hans-Peter, Zane Matthew, Daum Christopher, Clum Alicia, Cheng Jan-Fang, Copeland Alex, Woyke Tanja

机构信息

DOE Joint Genome Institute, Walnut Creek, California 94598, USA.

Newcastle University, Newcastle upon Tyne, NE1 7RU, UK.

出版信息

Sci Data. 2016 Sep 27;3:160081. doi: 10.1038/sdata.2016.81.

DOI:10.1038/sdata.2016.81
PMID:27673566
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5037974/
Abstract

Generating sequence data of a defined community composed of organisms with complete reference genomes is indispensable for the benchmarking of new genome sequence analysis methods, including assembly and binning tools. Moreover the validation of new sequencing library protocols and platforms to assess critical components such as sequencing errors and biases relies on such datasets. We here report the next generation metagenomic sequence data of a defined mock community (Mock Bacteria ARchaea Community; MBARC-26), composed of 23 bacterial and 3 archaeal strains with finished genomes. These strains span 10 phyla and 14 classes, a range of GC contents, genome sizes, repeat content and encompass a diverse abundance profile. Short read Illumina and long-read PacBio SMRT sequences of this mock community are described. These data represent a valuable resource for the scientific community, enabling extensive benchmarking and comparative evaluation of bioinformatics tools without the need to simulate data. As such, these data can aid in improving our current sequence data analysis toolkit and spur interest in the development of new tools.

摘要

生成由具有完整参考基因组的生物体组成的特定群落的序列数据,对于包括组装和分箱工具在内的新基因组序列分析方法的基准测试而言必不可少。此外,新测序文库协议和平台的验证,以评估诸如测序错误和偏差等关键组件,也依赖于此类数据集。我们在此报告了一个特定模拟群落(模拟细菌古菌群落;MBARC - 26)的新一代宏基因组序列数据,该群落由23种细菌和3种具有完整基因组的古菌菌株组成。这些菌株涵盖10个门和14个纲,具有一系列的GC含量、基因组大小、重复含量,并包含多样化的丰度分布。本文描述了该模拟群落的短读长Illumina序列和长读长PacBio SMRT序列。这些数据为科学界提供了宝贵的资源,无需模拟数据就能对生物信息学工具进行广泛的基准测试和比较评估。因此,这些数据有助于改进我们当前的序列数据分析工具包,并激发对新工具开发的兴趣。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6948/5037974/46230dfbd7cf/sdata201681-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6948/5037974/fe78ddce8ab2/sdata201681-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6948/5037974/2dd60be858e7/sdata201681-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6948/5037974/46230dfbd7cf/sdata201681-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6948/5037974/fe78ddce8ab2/sdata201681-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6948/5037974/2dd60be858e7/sdata201681-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6948/5037974/46230dfbd7cf/sdata201681-f3.jpg

相似文献

1
Next generation sequencing data of a defined microbial mock community.特定微生物模拟群落的下一代测序数据。
Sci Data. 2016 Sep 27;3:160081. doi: 10.1038/sdata.2016.81.
2
Shotgun metagenome data of a defined mock community using Oxford Nanopore, PacBio and Illumina technologies.使用 Oxford Nanopore、PacBio 和 Illumina 技术对一个定义的模拟群落进行的 shotgun 宏基因组数据。
Sci Data. 2019 Nov 26;6(1):285. doi: 10.1038/s41597-019-0287-z.
3
Optimized Illumina PCR-free library preparation for bacterial whole genome sequencing and analysis of factors influencing de novo assembly.用于细菌全基因组测序的优化Illumina无PCR文库制备及影响从头组装的因素分析
BMC Res Notes. 2016 May 12;9:269. doi: 10.1186/s13104-016-2072-9.
4
Toward Complete Bacterial Genome Sequencing Through the Combined Use of Multiple Next-Generation Sequencing Platforms.通过联合使用多种下一代测序平台实现完整细菌基因组测序
J Microbiol Biotechnol. 2016 Jan;26(1):207-12. doi: 10.4014/jmb.1507.07055.
5
Sequence data for Clostridium autoethanogenum using three generations of sequencing technologies.使用三代测序技术获得产乙醇梭菌的序列数据。
Sci Data. 2015 Apr 14;2:150014. doi: 10.1038/sdata.2015.14. eCollection 2015.
6
Characterization of sequence-specific errors in various next-generation sequencing systems.各种新一代测序系统中序列特异性错误的特征分析。
Mol Biosyst. 2016 Mar;12(3):914-22. doi: 10.1039/c5mb00750j.
7
Evaluation and Validation of Assembling Corrected PacBio Long Reads for Microbial Genome Completion via Hybrid Approaches.通过混合方法组装校正后的PacBio长读段以完成微生物基因组的评估与验证
PLoS One. 2015 Dec 7;10(12):e0144305. doi: 10.1371/journal.pone.0144305. eCollection 2015.
8
Long read and single molecule DNA sequencing simplifies genome assembly and TAL effector gene analysis of Xanthomonas translucens.长读长和单分子DNA测序简化了油菜黄单胞菌的基因组组装和TAL效应子基因分析。
BMC Genomics. 2016 Jan 5;17:21. doi: 10.1186/s12864-015-2348-9.
9
Pipeline for amplifying and analyzing amplicons of the V1-V3 region of the 16S rRNA gene.用于扩增和分析16S rRNA基因V1 - V3区域扩增子的流程
BMC Res Notes. 2016 Aug 2;9:380. doi: 10.1186/s13104-016-2172-6.
10
mockrobiota: a Public Resource for Microbiome Bioinformatics Benchmarking.微生物群落模拟物:微生物组生物信息学基准测试的公共资源。
mSystems. 2016 Oct 18;1(5). doi: 10.1128/mSystems.00062-16. eCollection 2016 Sep-Oct.

引用本文的文献

1
Benchmarking and optimizing qualitative and quantitative pipelines in environmental metatranscriptomics using mixture controlling experiments.利用混合控制实验对环境宏转录组学中的定性和定量流程进行基准测试与优化。
ISME Commun. 2025 May 29;5(1):ycaf090. doi: 10.1093/ismeco/ycaf090. eCollection 2025 Jan.
2
Bioindicator "fingerprints" of methane-emitting thermokarst features in Alaskan soils.阿拉斯加土壤中甲烷排放热喀斯特地貌的生物指示“指纹”。
Front Microbiol. 2025 Feb 21;15:1462941. doi: 10.3389/fmicb.2024.1462941. eCollection 2024.
3
Flowtigs: Safety in flow decompositions for assembly graphs.

本文引用的文献

1
MeCorS: Metagenome-enabled error correction of single cell sequencing reads.MeCorS:基于宏基因组的单细胞测序读数纠错
Bioinformatics. 2016 Jul 15;32(14):2199-201. doi: 10.1093/bioinformatics/btw144. Epub 2016 Mar 15.
2
High-resolution phylogenetic microbial community profiling.高分辨率系统发育微生物群落分析
ISME J. 2016 Aug;10(8):2020-32. doi: 10.1038/ismej.2015.249. Epub 2016 Feb 9.
3
Library preparation methodology can influence genomic and functional predictions in human microbiome research.文库制备方法会影响人类微生物组研究中的基因组和功能预测。
Flowtigs:装配图流分解中的安全性
iScience. 2024 Oct 25;27(12):111208. doi: 10.1016/j.isci.2024.111208. eCollection 2024 Dec 20.
4
MIMt: a curated 16S rRNA reference database with less redundancy and higher accuracy at species-level identification.MIMt:一个经过整理的16S rRNA参考数据库,在物种水平鉴定上具有更低的冗余度和更高的准确性。
Environ Microbiome. 2024 Nov 9;19(1):88. doi: 10.1186/s40793-024-00634-w.
5
Unveiling errors in soil microbial community sequencing: a case for reference soils and improved diagnostics for nanopore sequencing.揭示土壤微生物群落测序中的错误:以参比土壤为例,以及对纳米孔测序进行改进的诊断。
Commun Biol. 2024 Jul 28;7(1):913. doi: 10.1038/s42003-024-06594-8.
6
Metapresence: a tool for accurate species detection in metagenomics based on the genome-wide distribution of mapping reads.元现生:一种基于映射读取的全基因组分布的宏基因组学中准确物种检测的工具。
mSystems. 2024 Aug 20;9(8):e0021324. doi: 10.1128/msystems.00213-24. Epub 2024 Jul 9.
7
Benchmarking short-, long- and hybrid-read assemblers for metagenome sequencing of complex microbial communities.对用于复杂微生物群落宏基因组测序的短读长、长读长和混合读长组装器进行基准测试。
Microbiology (Reading). 2024 Jun;170(6). doi: 10.1099/mic.0.001469.
8
Deciphering and designing microbial communities by genome-scale metabolic modelling.通过基因组规模代谢建模解析和设计微生物群落
Comput Struct Biotechnol J. 2024 Apr 22;23:1990-2000. doi: 10.1016/j.csbj.2024.04.055. eCollection 2024 Dec.
9
MAGICIAN: MAG simulation for investigating criteria for bioinformatic analysis.魔术师:用于研究生物信息学分析标准的 MAG 模拟。
BMC Genomics. 2024 Jan 12;25(1):55. doi: 10.1186/s12864-023-09912-2.
10
Assessment of DNA extraction methods for human gut mycobiome analysis.用于人类肠道真菌群落分析的DNA提取方法评估
R Soc Open Sci. 2024 Jan 10;11(1):231129. doi: 10.1098/rsos.231129. eCollection 2024 Jan.
Proc Natl Acad Sci U S A. 2015 Nov 10;112(45):14024-9. doi: 10.1073/pnas.1519288112. Epub 2015 Oct 28.
4
Impact of library preparation protocols and template quantity on the metagenomic reconstruction of a mock microbial community.文库制备方案和模板量对模拟微生物群落宏基因组重建的影响
BMC Genomics. 2015 Oct 24;16:856. doi: 10.1186/s12864-015-2063-6.
5
Evaluation of the Ion Torrent Personal Genome Machine for Gene-Targeted Studies Using Amplicons of the Nitrogenase Gene nifH.使用固氮酶基因nifH的扩增子对Ion Torrent个人基因组测序仪进行基因靶向研究的评估。
Appl Environ Microbiol. 2015 Jul;81(13):4536-45. doi: 10.1128/AEM.00111-15. Epub 2015 Apr 24.
6
Analysis, optimization and verification of Illumina-generated 16S rRNA gene amplicon surveys.对Illumina生成的16S rRNA基因扩增子检测进行分析、优化和验证。
PLoS One. 2014 Apr 10;9(4):e94249. doi: 10.1371/journal.pone.0094249. eCollection 2014.
7
Predictive functional profiling of microbial communities using 16S rRNA marker gene sequences.基于 16S rRNA 标记基因序列对微生物群落进行功能预测分析。
Nat Biotechnol. 2013 Sep;31(9):814-21. doi: 10.1038/nbt.2676. Epub 2013 Aug 25.
8
UPARSE: highly accurate OTU sequences from microbial amplicon reads.UPARSE:从微生物扩增子读取中获得高度准确的 OTU 序列。
Nat Methods. 2013 Oct;10(10):996-8. doi: 10.1038/nmeth.2604. Epub 2013 Aug 18.
9
The advantages of SMRT sequencing.SMRT 测序的优势。
Genome Biol. 2013 Jul 3;14(7):405. doi: 10.1186/gb-2013-14-6-405.
10
Development of a dual-index sequencing strategy and curation pipeline for analyzing amplicon sequence data on the MiSeq Illumina sequencing platform.开发一种双索引测序策略和策管管道,用于分析 MiSeq Illumina 测序平台上的扩增子序列数据。
Appl Environ Microbiol. 2013 Sep;79(17):5112-20. doi: 10.1128/AEM.01043-13. Epub 2013 Jun 21.