• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PacBio长读长技术改进宏基因组组装、基因目录构建和基因组分箱。

PacBio Long Reads Improve Metagenomic Assemblies, Gene Catalogs, and Genome Binning.

作者信息

Xie Haiying, Yang Caiyun, Sun Yamin, Igarashi Yasuo, Jin Tao, Luo Feng

机构信息

Research Center of Bioenergy and Bioremediation, College of Resources and Environment, Southwest University, Chongqing, China.

PUROTON Gene Medical Institute Co., Ltd., Chongqing, China.

出版信息

Front Genet. 2020 Sep 8;11:516269. doi: 10.3389/fgene.2020.516269. eCollection 2020.

DOI:10.3389/fgene.2020.516269
PMID:33101371
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7506068/
Abstract

PacBio long reads sequencing presents several potential advantages for DNA assembly, including being able to provide more complete gene profiling of metagenomic samples. However, lower single-pass accuracy can make gene discovery and assembly for low-abundance organisms difficult. To evaluate the application and performance of PacBio long reads and Illumina HiSeq short reads in metagenomic analyses, we directly compared various assemblies involving PacBio and Illumina sequencing reads based on two anaerobic digestion microbiome samples from a biogas fermenter. Using a PacBio platform, 1.58 million long reads (19.6 Gb) were produced with an average length of 7,604 bp. Using an Illumina HiSeq platform, 151.2 million read pairs (45.4 Gb) were produced. Hybrid assemblies using PacBio long reads and HiSeq contigs produced improvements in assembly statistics, including an increase in the average contig length, contig N50 size, and number of large contigs. Interestingly, depth-based hybrid assemblies generated a higher percentage of complete genes (98.86%) compared to those based on HiSeq contigs only (40.29%), because the PacBio reads were long enough to cover many repeating short elements and capture multiple genes in a single read. Additionally, the incorporation of PacBio long reads led to considerable advantages regarding reducing contig numbers and increasing the completeness of the genome reconstruction, which was poorly assembled and binned when using HiSeq data alone. From this comparison of PacBio long reads with Illumina HiSeq short reads related to complex microbiome samples, we conclude that PacBio long reads can produce longer contigs, more complete genes, and better genome binning, thereby offering more information about metagenomic samples.

摘要

PacBio长读长测序在DNA组装方面具有几个潜在优势,包括能够提供更完整的宏基因组样本基因谱。然而,较低的单通道准确性会使低丰度生物的基因发现和组装变得困难。为了评估PacBio长读长和Illumina HiSeq短读长在宏基因组分析中的应用和性能,我们基于来自沼气发酵罐的两个厌氧消化微生物组样本,直接比较了涉及PacBio和Illumina测序读长的各种组装。使用PacBio平台,产生了158万个长读长(19.6 Gb),平均长度为7604 bp。使用Illumina HiSeq平台,产生了1.512亿对读段(45.4 Gb)。使用PacBio长读长和HiSeq重叠群进行的混合组装在组装统计方面有改进,包括平均重叠群长度、重叠群N50大小和大重叠群数量的增加。有趣的是,与仅基于HiSeq重叠群的组装(40.29%)相比,基于深度的混合组装产生了更高比例的完整基因(98.86%),因为PacBio读长足够长,可以覆盖许多重复的短元件,并在单个读长中捕获多个基因。此外,纳入PacBio长读长在减少重叠群数量和提高基因组重建完整性方面带来了相当大的优势,仅使用HiSeq数据时,基因组重建组装效果不佳且分类也不好。通过对与复杂微生物组样本相关的PacBio长读长和Illumina HiSeq短读长的比较,我们得出结论,PacBio长读长可以产生更长的重叠群、更完整的基因和更好的基因组分类,从而提供更多关于宏基因组样本的信息。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a604/7506068/96377d174508/fgene-11-516269-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a604/7506068/e6b417f19e8b/fgene-11-516269-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a604/7506068/432dbbe0611d/fgene-11-516269-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a604/7506068/64bbf33060cf/fgene-11-516269-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a604/7506068/96377d174508/fgene-11-516269-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a604/7506068/e6b417f19e8b/fgene-11-516269-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a604/7506068/432dbbe0611d/fgene-11-516269-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a604/7506068/64bbf33060cf/fgene-11-516269-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a604/7506068/96377d174508/fgene-11-516269-g004.jpg

相似文献

1
PacBio Long Reads Improve Metagenomic Assemblies, Gene Catalogs, and Genome Binning.PacBio长读长技术改进宏基因组组装、基因目录构建和基因组分箱。
Front Genet. 2020 Sep 8;11:516269. doi: 10.3389/fgene.2020.516269. eCollection 2020.
2
Improved metagenome assemblies and taxonomic binning using long-read circular consensus sequence data.利用长读长环形一致序列数据改进宏基因组组装和分类分箱
Sci Rep. 2016 May 9;6:25373. doi: 10.1038/srep25373.
3
An evaluation of the PacBio RS platform for sequencing and de novo assembly of a chloroplast genome.评价 PacBio RS 平台在叶绿体基因组测序和从头组装方面的应用。
BMC Genomics. 2013 Oct 1;14:670. doi: 10.1186/1471-2164-14-670.
4
PacBio But Not Illumina Technology Can Achieve Fast, Accurate and Complete Closure of the High GC, Complex Two-Chromosome Genome.PacBio技术而非Illumina技术能够实现对高GC含量、复杂的双染色体基因组的快速、准确且完整的封闭。
Front Microbiol. 2017 Aug 2;8:1448. doi: 10.3389/fmicb.2017.01448. eCollection 2017.
5
Comparisons of genome assembly tools for characterization of genomes using hybrid sequencing technologies.利用混合测序技术对基因组进行特征分析的基因组组装工具比较。
PeerJ. 2024 Aug 29;12:e17964. doi: 10.7717/peerj.17964. eCollection 2024.
6
Fragmentation and Coverage Variation in Viral Metagenome Assemblies, and Their Effect in Diversity Calculations.病毒宏基因组组装中的碎片化和覆盖度变化,及其对多样性计算的影响。
Front Bioeng Biotechnol. 2015 Sep 17;3:141. doi: 10.3389/fbioe.2015.00141. eCollection 2015.
7
MetaCluster-TA: taxonomic annotation for metagenomic data based on assembly-assisted binning.MetaCluster-TA:基于组装辅助分箱的宏基因组数据分类注释。
BMC Genomics. 2014;15 Suppl 1(Suppl 1):S12. doi: 10.1186/1471-2164-15-S1-S12. Epub 2014 Jan 24.
8
A comprehensive investigation of metagenome assembly by linked-read sequencing.基于链接读取测序的宏基因组组装综合研究。
Microbiome. 2020 Nov 11;8(1):156. doi: 10.1186/s40168-020-00929-3.
9
Completion of draft bacterial genomes by long-read sequencing of synthetic genomic pools.通过合成基因组文库的长读长测序完成细菌基因组草图
BMC Genomics. 2020 Jul 29;21(1):519. doi: 10.1186/s12864-020-06910-6.
10
Efficient and accurate whole genome assembly and methylome profiling of E. coli.高效准确的大肠杆菌全基因组组装和甲基组分析。
BMC Genomics. 2013 Oct 3;14(1):675. doi: 10.1186/1471-2164-14-675.

引用本文的文献

1
A metagenomic approach to demystify the anaerobic digestion black box and achieve higher biogas yield: a review.一种用于揭开厌氧消化黑箱之谜并实现更高沼气产量的宏基因组学方法:综述
Front Microbiol. 2024 Oct 11;15:1437098. doi: 10.3389/fmicb.2024.1437098. eCollection 2024.
2
Metagenomic assemblies tend to break around antibiotic resistance genes.元基因组组装往往在抗生素抗性基因周围断裂。
BMC Genomics. 2024 Oct 14;25(1):959. doi: 10.1186/s12864-024-10876-0.
3
Sample and Library Preparation for PacBio Long-Read Sequencing in Grapevine.

本文引用的文献

1
Insights into the ecological roles and evolution of methyl-coenzyme M reductase-containing hot spring Archaea.洞悉含甲基辅酶 M 还原酶的温泉古菌的生态作用和进化。
Nat Commun. 2019 Oct 8;10(1):4574. doi: 10.1038/s41467-019-12574-y.
2
MetaBAT 2: an adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies.MetaBAT 2:一种用于从宏基因组组装中进行稳健且高效的基因组重建的自适应分箱算法。
PeerJ. 2019 Jul 26;7:e7359. doi: 10.7717/peerj.7359. eCollection 2019.
3
Hybrid metagenomic assembly enables high-resolution analysis of resistance determinants and mobile elements in human microbiomes.
葡萄 PacBio 长读测序的样品和文库制备。
Methods Mol Biol. 2024;2787:183-197. doi: 10.1007/978-1-0716-3778-4_12.
4
The long and short of it: benchmarking viromics using Illumina, Nanopore and PacBio sequencing technologies.简而言之:使用Illumina、Nanopore和PacBio测序技术对病毒组进行基准测试。
Microb Genom. 2024 Feb;10(2). doi: 10.1099/mgen.0.001198.
5
Unraveling metagenomics through long-read sequencing: a comprehensive review.通过长读测序揭示宏基因组学:全面综述。
J Transl Med. 2024 Jan 28;22(1):111. doi: 10.1186/s12967-024-04917-1.
6
Metagenomic binning of PacBio HiFi data prior to assembly reveals a complete genome of (Germar) (Coleopterea: Curculionidae, Dryophthorinae) the most damaging arthropod pest of bananas and plantains.PacBio HiFi 数据的宏基因组 binning 在组装前揭示了 (Germar) (鞘翅目:象甲科,切根象甲族)的完整基因组,它是香蕉和大蕉最具破坏性的节肢动物害虫。
PeerJ. 2023 Nov 22;11:e16276. doi: 10.7717/peerj.16276. eCollection 2023.
7
Biosynthetic gene cluster synteny: Orthologous polyketide synthases in Hypogymnia physodes, Hypogymnia tubulosa, and Parmelia sulcata.生物合成基因簇同线性:Hypogymnia physodes、Hypogymnia tubulosa 和 Parmelia sulcata 中的同源聚酮合酶。
Microbiologyopen. 2023 Oct;12(5):e1386. doi: 10.1002/mbo3.1386.
8
The Application of Metagenomics to Study Microbial Communities and Develop Desirable Traits in Fermented Foods.宏基因组学在研究微生物群落及培育发酵食品优良特性中的应用
Foods. 2022 Oct 21;11(20):3297. doi: 10.3390/foods11203297.
9
Present and future outlooks on environmental DNA-based methods for antibiotic discovery.基于环境 DNA 的抗生素发现方法的现状与展望。
Curr Opin Microbiol. 2023 Oct;75:102335. doi: 10.1016/j.mib.2023.102335. Epub 2023 Jun 14.
10
Biosynthetic Potential of Holobionts: Insights into Secondary Metabolite Pathways.共生生物的生物合成潜力:对次生代谢物途径的洞察
J Fungi (Basel). 2023 May 9;9(5):546. doi: 10.3390/jof9050546.
混合宏基因组组装可实现人类微生物组中抗性决定因子和移动元件的高分辨率分析。
Nat Biotechnol. 2019 Aug;37(8):937-944. doi: 10.1038/s41587-019-0191-2. Epub 2019 Jul 29.
4
Long-read based de novo assembly of low-complexity metagenome samples results in finished genomes and reveals insights into strain diversity and an active phage system.基于长读长测序的从头组装方法可用于低复杂度宏基因组样本,从而获得完成的基因组,并深入了解菌株多样性和活跃的噬菌体系统。
BMC Microbiol. 2019 Jun 25;19(1):143. doi: 10.1186/s12866-019-1500-0.
5
Genetic repertoires of anaerobic microbiomes driving generation of biogas.驱动沼气产生的厌氧微生物群落的基因库。
Biotechnol Biofuels. 2018 Sep 20;11:255. doi: 10.1186/s13068-018-1258-x. eCollection 2018.
6
Microbial effects of part-stream low-frequency ultrasonic pretreatment on sludge anaerobic digestion as revealed by high-throughput sequencing-based metagenomics and metatranscriptomics.基于高通量测序的宏基因组学和宏转录组学揭示部分流低频超声预处理对污泥厌氧消化的微生物效应
Biotechnol Biofuels. 2018 Feb 21;11:47. doi: 10.1186/s13068-018-1042-y. eCollection 2018.
7
Landscape topography structures the soil microbiome in arctic polygonal tundra.地貌地形塑造了北极多边形苔原的土壤微生物群落。
Nat Commun. 2018 Feb 22;9(1):777. doi: 10.1038/s41467-018-03089-z.
8
Recovery of nearly 8,000 metagenome-assembled genomes substantially expands the tree of life.近 8000 个宏基因组组装基因组的恢复极大地扩展了生命之树。
Nat Microbiol. 2017 Nov;2(11):1533-1542. doi: 10.1038/s41564-017-0012-7. Epub 2017 Sep 11.
9
Canu: scalable and accurate long-read assembly via adaptive -mer weighting and repeat separation.Canu:通过自适应k-mer加权和重复序列分离实现可扩展且准确的长读长序列拼接
Genome Res. 2017 May;27(5):722-736. doi: 10.1101/gr.215087.116. Epub 2017 Mar 15.
10
DBG2OLC: Efficient Assembly of Large Genomes Using Long Erroneous Reads of the Third Generation Sequencing Technologies.DBG2OLC:利用第三代测序技术的长错误读长进行大规模基因组的高效组装。
Sci Rep. 2016 Aug 30;6:31900. doi: 10.1038/srep31900.