• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

短克隆还是长克隆?宏基因组学中使用配对reads 的模拟研究。

Short clones or long clones? A simulation study on the use of paired reads in metagenomics.

机构信息

Center for Bioinformatics ZBIT, Tübingen University, Sand 14, 72076 Tübingen, Germany.

出版信息

BMC Bioinformatics. 2010 Jan 18;11 Suppl 1(Suppl 1):S12. doi: 10.1186/1471-2105-11-S1-S12.

DOI:10.1186/1471-2105-11-S1-S12
PMID:20122183
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3009484/
Abstract

BACKGROUND

Metagenomics is the study of environmental samples using sequencing. Rapid advances in sequencing technology are fueling a vast increase in the number and scope of metagenomics projects. Most metagenome sequencing projects so far have been based on Sanger or Roche-454 sequencing, as only these technologies provide long enough reads, while Illumina sequencing has not been considered suitable for metagenomic studies due to a short read length of only 35 bp. However, now that reads of length 75 bp can be sequenced in pairs, Illumina sequencing has become a viable option for metagenome studies.

RESULTS

This paper addresses the problem of taxonomical analysis of paired reads. We describe a new feature of our metagenome analysis software MEGAN that allows one to process sequencing reads in pairs and makes assignments of such reads based on the combined bit scores of their matches to reference sequences. Using this new software in a simulation study, we investigate the use of Illumina paired-sequencing in taxonomical analysis and compare the performance of single reads, short clones and long clones. In addition, we also compare against simulated Roche-454 sequencing runs.

CONCLUSION

This work shows that paired reads perform better than single reads, as expected, but also, perhaps slightly less obviously, that long clones allow more specific assignments than short ones. A new version of the program MEGAN that explicitly takes paired reads into account is available from our website.

摘要

背景

宏基因组学是使用测序技术研究环境样本的学科。测序技术的快速发展推动了宏基因组学项目数量和范围的大幅增加。到目前为止,大多数宏基因组测序项目都是基于 Sanger 或 Roche-454 测序的,因为只有这些技术能够提供足够长的读长,而 Illumina 测序由于读长仅为 35bp 而不适合宏基因组学研究。然而,现在可以对长度为 75bp 的读长进行双端测序,因此 Illumina 测序已经成为宏基因组学研究的一种可行选择。

结果

本文解决了双端测序的分类分析问题。我们描述了我们的宏基因组分析软件 MEGAN 的一个新功能,该功能允许对测序读长进行双端处理,并基于其与参考序列匹配的综合位得分来对这些读长进行分配。使用这种新软件进行模拟研究,我们调查了 Illumina 双端测序在分类分析中的应用,并比较了单读长、短克隆和长克隆的性能。此外,我们还与模拟的 Roche-454 测序运行进行了比较。

结论

这项工作表明,双端读长的表现优于单端读长,这是预期的结果,但也可能不太明显的是,长克隆比短克隆允许更具体的分配。我们的网站上提供了一个新版本的 MEGAN 程序,该程序明确考虑了双端读长。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5db5/3009484/67ef824ad3c2/1471-2105-11-S1-S12-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5db5/3009484/68190bce798a/1471-2105-11-S1-S12-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5db5/3009484/738ddf0c33c2/1471-2105-11-S1-S12-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5db5/3009484/4d8909cc07ae/1471-2105-11-S1-S12-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5db5/3009484/94e8649b87bf/1471-2105-11-S1-S12-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5db5/3009484/df4bb02b01da/1471-2105-11-S1-S12-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5db5/3009484/f7310cac8d9f/1471-2105-11-S1-S12-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5db5/3009484/e40e323a4726/1471-2105-11-S1-S12-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5db5/3009484/ea24b58e7c2c/1471-2105-11-S1-S12-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5db5/3009484/67ef824ad3c2/1471-2105-11-S1-S12-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5db5/3009484/68190bce798a/1471-2105-11-S1-S12-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5db5/3009484/738ddf0c33c2/1471-2105-11-S1-S12-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5db5/3009484/4d8909cc07ae/1471-2105-11-S1-S12-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5db5/3009484/94e8649b87bf/1471-2105-11-S1-S12-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5db5/3009484/df4bb02b01da/1471-2105-11-S1-S12-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5db5/3009484/f7310cac8d9f/1471-2105-11-S1-S12-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5db5/3009484/e40e323a4726/1471-2105-11-S1-S12-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5db5/3009484/ea24b58e7c2c/1471-2105-11-S1-S12-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5db5/3009484/67ef824ad3c2/1471-2105-11-S1-S12-9.jpg

相似文献

1
Short clones or long clones? A simulation study on the use of paired reads in metagenomics.短克隆还是长克隆?宏基因组学中使用配对reads 的模拟研究。
BMC Bioinformatics. 2010 Jan 18;11 Suppl 1(Suppl 1):S12. doi: 10.1186/1471-2105-11-S1-S12.
2
MEGAN-LR: new algorithms allow accurate binning and easy interactive exploration of metagenomic long reads and contigs.MEGAN-LR:新算法允许对宏基因组长读段和 contigs 进行准确的分箱和轻松的交互式探索。
Biol Direct. 2018 Apr 20;13(1):6. doi: 10.1186/s13062-018-0208-7.
3
Minerva: an alignment- and reference-free approach to deconvolve Linked-Reads for metagenomics.Minerva:一种无需对齐和参考的方法,用于解析宏基因组学的 Linked-Reads。
Genome Res. 2019 Jan;29(1):116-124. doi: 10.1101/gr.235499.118. Epub 2018 Dec 6.
4
MinION™ nanopore sequencing of environmental metagenomes: a synthetic approach.环境宏基因组的MinION™纳米孔测序:一种合成方法。
Gigascience. 2017 Mar 1;6(3):1-10. doi: 10.1093/gigascience/gix007.
5
A comprehensive investigation of metagenome assembly by linked-read sequencing.基于链接读取测序的宏基因组组装综合研究。
Microbiome. 2020 Nov 11;8(1):156. doi: 10.1186/s40168-020-00929-3.
6
Fast and simple protein-alignment-guided assembly of orthologous gene families from microbiome sequencing reads.快速而简单的基于蛋白质比对的微生物组测序读段中直系同源基因家族组装方法。
Microbiome. 2017 Jan 25;5(1):11. doi: 10.1186/s40168-017-0233-2.
7
Improving the sensitivity of long read overlap detection using grouped short k-mer matches.利用分组短 k-mer 匹配提高长读重叠检测的灵敏度。
BMC Genomics. 2019 Apr 4;20(Suppl 2):190. doi: 10.1186/s12864-019-5475-x.
8
A better sequence-read simulator program for metagenomics.一个更好的宏基因组学序列读取模拟程序。
BMC Bioinformatics. 2014;15 Suppl 9(Suppl 9):S14. doi: 10.1186/1471-2105-15-S9-S14. Epub 2014 Sep 10.
9
Joining Illumina paired-end reads for classifying phylogenetic marker sequences.将 Illumina 配对末端读取用于分类系统发育标记序列。
BMC Bioinformatics. 2020 Mar 14;21(1):105. doi: 10.1186/s12859-020-3445-6.
10
Unlocking short read sequencing for metagenomics.解锁宏基因组学的短读测序。
PLoS One. 2010 Jul 28;5(7):e11840. doi: 10.1371/journal.pone.0011840.

引用本文的文献

1
Assessment of Common and Emerging Bioinformatics Pipelines for Targeted Metagenomics.针对靶向宏基因组学的常见及新兴生物信息学流程评估
PLoS One. 2017 Jan 4;12(1):e0169563. doi: 10.1371/journal.pone.0169563. eCollection 2017.
2
Taxonomic Characterization of Honey Bee (Apis mellifera) Pollen Foraging Based on Non-Overlapping Paired-End Sequencing of Nuclear Ribosomal Loci.基于核糖体基因座非重叠双端测序的蜜蜂(西方蜜蜂)花粉采集的分类学特征分析
PLoS One. 2015 Dec 23;10(12):e0145365. doi: 10.1371/journal.pone.0145365. eCollection 2015.
3
Environmental genes and genomes: understanding the differences and challenges in the approaches and software for their analyses.

本文引用的文献

1
Visual and statistical comparison of metagenomes.元基因组的可视化和统计比较。
Bioinformatics. 2009 Aug 1;25(15):1849-55. doi: 10.1093/bioinformatics/btp341. Epub 2009 Jun 10.
2
Integron gene cassettes and degradation of compounds associated with industrial waste: the case of the Sydney tar ponds.整合子基因盒与工业废物相关化合物的降解:以悉尼焦油池为例
PLoS One. 2009;4(4):e5276. doi: 10.1371/journal.pone.0005276. Epub 2009 Apr 23.
3
Assembling the marine metagenome, one cell at a time.一次一个细胞地组装海洋宏基因组。
环境基因与基因组:理解其分析方法和软件中的差异与挑战。
Brief Bioinform. 2015 Sep;16(5):745-58. doi: 10.1093/bib/bbv001. Epub 2015 Feb 11.
4
Binpairs: utilization of Illumina paired-end information for improving efficiency of taxonomic binning of metagenomic sequences.双端序列对:利用Illumina双端测序信息提高宏基因组序列分类分箱的效率
PLoS One. 2014 Dec 31;9(12):e114814. doi: 10.1371/journal.pone.0114814. eCollection 2014.
5
Evaluation of short read metagenomic assembly.短读宏基因组组装评估。
BMC Genomics. 2011;12 Suppl 2(Suppl 2):S8. doi: 10.1186/1471-2164-12-S2-S8. Epub 2011 Jul 27.
6
A metagenomic study of methanotrophic microorganisms in Coal Oil Point seep sediments.煤焦油点渗漏沉积物中甲烷营养微生物的宏基因组研究。
BMC Microbiol. 2011 Oct 4;11:221. doi: 10.1186/1471-2180-11-221.
7
Reanalyze unassigned reads in Sanger based metagenomic data using conserved gene adjacency.使用保守基因邻接关系重新分析基于 Sanger 的宏基因组数据中的未分配读取。
BMC Bioinformatics. 2010 Nov 18;11:565. doi: 10.1186/1471-2105-11-565.
8
Unlocking short read sequencing for metagenomics.解锁宏基因组学的短读测序。
PLoS One. 2010 Jul 28;5(7):e11840. doi: 10.1371/journal.pone.0011840.
PLoS One. 2009;4(4):e5299. doi: 10.1371/journal.pone.0005299. Epub 2009 Apr 23.
4
Comparative metagenomics of Daphnia symbionts.水蚤共生体的比较宏基因组学
BMC Genomics. 2009 Apr 21;10:172. doi: 10.1186/1471-2164-10-172.
5
Application of 'next-generation' sequencing technologies to microbial genetics.“下一代”测序技术在微生物遗传学中的应用。
Nat Rev Microbiol. 2009 Apr;7(4):287-96. doi: 10.1038/nrmicro2122.
6
The mitochondrial genome sequence of the Tasmanian tiger (Thylacinus cynocephalus).袋狼(袋狼属)的线粒体基因组序列。
Genome Res. 2009 Feb;19(2):213-20. doi: 10.1101/gr.082628.108. Epub 2009 Jan 12.
7
A bioinformatician's guide to metagenomics.宏基因组学的生物信息学指南。
Microbiol Mol Biol Rev. 2008 Dec;72(4):557-78, Table of Contents. doi: 10.1128/MMBR.00009-08.
8
Accurate whole human genome sequencing using reversible terminator chemistry.使用可逆终止子化学法进行准确的全人类基因组测序。
Nature. 2008 Nov 6;456(7218):53-9. doi: 10.1038/nature07517.
9
MetaSim: a sequencing simulator for genomics and metagenomics.MetaSim:一款用于基因组学和宏基因组学的测序模拟器。
PLoS One. 2008 Oct 8;3(10):e3373. doi: 10.1371/journal.pone.0003373.
10
Simultaneous assessment of soil microbial community structure and function through analysis of the meta-transcriptome.通过宏转录组分析同时评估土壤微生物群落结构和功能。
PLoS One. 2008 Jun 25;3(6):e2527. doi: 10.1371/journal.pone.0002527.