贝类病毒宏基因组文库构建及短读长注释方法的比较

Comparison of methods for library construction and short read annotation of shellfish viral metagenomes.

作者信息

Wei Hong-Ying, Huang Sheng, Wang Jiang-Yong, Gao Fang, Jiang Jing-Zhe

机构信息

Key Laboratory of Aquatic Product Processing, Ministry of Agriculture, South China Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Guangzhou, 510300, China.

Shanghai Ocean University, Shanghai, 201306, China.

出版信息

Genes Genomics. 2018 Mar;40(3):281-288. doi: 10.1007/s13258-017-0629-1. Epub 2017 Dec 2.

DOI:10.1007/s13258-017-0629-1

PMID:29892802

Abstract

The emergence and widespread use of high-throughput sequencing technologies have promoted metagenomic studies on environmental or animal samples. Library construction for metagenome sequencing and annotation of the produced sequence reads are important steps in such studies and influence the quality of metagenomic data. In this study, we collected some marine mollusk samples, such as Crassostrea hongkongensis, Chlamys farreri, and Ruditapes philippinarum, from coastal areas in South China. These samples were divided into two batches to compare two library construction methods for shellfish viral metagenome. Our analysis showed that reverse-transcribing RNA into cDNA and then amplifying it simultaneously with DNA by whole genome amplification (WGA) yielded a larger amount of DNA compared to using only WGA or WTA (whole transcriptome amplification). Moreover, higher quality libraries were obtained by agarose gel extraction rather than with AMPure bead size selection. However, the latter can also provide good results if combined with the adjustment of the filter parameters. This, together with its simplicity, makes it a viable alternative. Finally, we compared three annotation tools (BLAST, DIAMOND, and Taxonomer) and two reference databases (NCBI's NR and Uniprot's Uniref). Considering the limitations of computing resources and data transfer speed, we propose the use of DIAMOND with Uniref for annotating metagenomic short reads as its running speed can guarantee a good annotation rate. This study may serve as a useful reference for selecting methods for Shellfish viral metagenome library construction and read annotation.

摘要

高通量测序技术的出现和广泛应用推动了对环境或动物样本的宏基因组学研究。宏基因组测序的文库构建以及对所产生序列读数的注释是此类研究中的重要步骤，并会影响宏基因组数据的质量。在本研究中，我们从中国南方沿海地区采集了一些海洋软体动物样本，如香港牡蛎、栉孔扇贝和菲律宾蛤仔。这些样本被分为两批，以比较两种贝类病毒宏基因组文库构建方法。我们的分析表明，与仅使用全基因组扩增（WGA）或全转录组扩增（WTA）相比，将RNA逆转录为cDNA，然后通过全基因组扩增与DNA同时进行扩增，可产生更多的DNA。此外，通过琼脂糖凝胶提取获得的文库质量更高，而不是使用磁珠大小选择法。然而，如果结合过滤器参数的调整，后者也能提供良好结果。这一点，再加上其操作简便，使其成为一个可行的选择。最后，我们比较了三种注释工具（BLAST、DIAMOND和Taxonomer）和两个参考数据库（NCBI的NR和Uniprot的Uniref）。考虑到计算资源和数据传输速度的限制，我们建议使用DIAMOND和Uniref对宏基因组短读段进行注释，因为其运行速度能够保证较高的注释率。本研究可为选择贝类病毒宏基因组文库构建和读段注释方法提供有益参考。

相似文献

Comparison of methods for library construction and short read annotation of shellfish viral metagenomes.贝类病毒宏基因组文库构建及短读长注释方法的比较

Genes Genomics. 2018 Mar;40(3):281-288. doi: 10.1007/s13258-017-0629-1. Epub 2017 Dec 2.

Pre- and post-sequencing recommendations for functional annotation of human fecal metagenomes.人类粪便宏基因组功能注释的测序前和测序后建议。

BMC Bioinformatics. 2020 Feb 24;21(1):74. doi: 10.1186/s12859-020-3416-y.

ViraPipe: scalable parallel pipeline for viral metagenome analysis from next generation sequencing reads.ViraPipe：用于从下一代测序读取中进行病毒宏基因组分析的可扩展并行管道。

Bioinformatics. 2018 Mar 15;34(6):928-935. doi: 10.1093/bioinformatics/btx702.

Evaluating techniques for metagenome annotation using simulated sequence data.使用模拟序列数据评估宏基因组注释技术。

FEMS Microbiol Ecol. 2016 Jul;92(7). doi: 10.1093/femsec/fiw095. Epub 2016 May 8.

Exploiting topic modeling to boost metagenomic reads binning.利用主题建模来促进宏基因组读数分箱。

BMC Bioinformatics. 2015;16 Suppl 5(Suppl 5):S2. doi: 10.1186/1471-2105-16-S5-S2. Epub 2015 Mar 18.

Assembly and Annotation of Viral Metagenomes from Short-Read Sequencing Data.病毒宏基因组的短读测序数据组装与注释。

Methods Mol Biol. 2023;2649:317-337. doi: 10.1007/978-1-0716-3072-3_17.

Practical evaluation of 11 de novo assemblers in metagenome assembly.宏基因组组装中11种从头组装程序的实际评估

J Microbiol Methods. 2018 Aug;151:99-105. doi: 10.1016/j.mimet.2018.06.007. Epub 2018 Jun 25.

Assessment of REPLI-g Multiple Displacement Whole Genome Amplification (WGA) Techniques for Metagenomic Applications.用于宏基因组学应用的REPLI-g多重置换全基因组扩增（WGA）技术评估

J Biomol Tech. 2017 Apr;28(1):46-55. doi: 10.7171/jbt.17-2801-008. Epub 2017 Mar 21.

Joining Illumina paired-end reads for classifying phylogenetic marker sequences.将 Illumina 配对末端读取用于分类系统发育标记序列。

BMC Bioinformatics. 2020 Mar 14;21(1):105. doi: 10.1186/s12859-020-3445-6.

Preparation of metagenomic libraries from naturally occurring marine viruses.从天然存在的海洋病毒中制备宏基因组文库。

Methods Enzymol. 2013;531:143-65. doi: 10.1016/B978-0-12-407863-5.00008-3.

引用本文的文献

Molecular assessment of oyster microbiomes and viromes reveals their potential as pathogen and ecological sentinels.牡蛎微生物组和病毒组的分子评估揭示了它们作为病原体和生态哨兵的潜力。

One Health. 2025 Jan 13;20:100973. doi: 10.1016/j.onehlt.2025.100973. eCollection 2025 Jun.

Identification and classification of the genomes of novel microviruses in poultry slaughterhouse.家禽屠宰场中新型微小病毒基因组的鉴定与分类

Front Microbiol. 2024 May 2;15:1393153. doi: 10.3389/fmicb.2024.1393153. eCollection 2024.

Combined Analysis of the Whole Transcriptome of Piglets Infected with SADS-CoV Virulent and Avirulent Strains.感染SADS-CoV强毒株和无毒株仔猪全转录组的联合分析

Microorganisms. 2023 Feb 6;11(2):409. doi: 10.3390/microorganisms11020409.

本文引用的文献

Rapid protein alignment in the cloud: HAMOND combines fast DIAMOND alignments with Hadoop parallelism.快速的云端蛋白质序列比对：HAMOND 将快速的 DIAMOND 比对与 Hadoop 并行处理相结合。

J Biotechnol. 2017 Sep 10;257:58-60. doi: 10.1016/j.jbiotec.2017.02.020. Epub 2017 Feb 21.

Taxonomer: an interactive metagenomics analysis portal for universal pathogen detection and host mRNA expression profiling.分类学家：一个用于通用病原体检测和宿主mRNA表达谱分析的交互式宏基因组学分析门户。

Genome Biol. 2016 May 26;17(1):111. doi: 10.1186/s13059-016-0969-1.

CLARK: fast and accurate classification of metagenomic and genomic sequences using discriminative k-mers.克拉克：使用判别性k-mer对宏基因组和基因组序列进行快速准确分类

BMC Genomics. 2015 Mar 25;16(1):236. doi: 10.1186/s12864-015-1419-2.

Impacts of the Three Gorges Dam on microbial structure and potential function.三峡大坝对微生物结构和潜在功能的影响。

Sci Rep. 2015 Feb 27;5:8605. doi: 10.1038/srep08605.

Multiple displacement amplification of whole genomic DNA from urediospores of Puccinia striiformis f. sp. tritici.来自小麦条锈菌夏孢子全基因组DNA的多重置换扩增。

Curr Genet. 2015 May;61(2):221-30. doi: 10.1007/s00294-014-0470-x. Epub 2015 Feb 5.

Fast and sensitive protein alignment using DIAMOND.使用 DIAMOND 进行快速灵敏的蛋白质比对。

Nat Methods. 2015 Jan;12(1):59-60. doi: 10.1038/nmeth.3176. Epub 2014 Nov 17.

GHOSTX: an improved sequence homology search algorithm using a query suffix array and a database suffix array.GHOSTX：一种使用查询后缀数组和数据库后缀数组改进的序列同源搜索算法。

PLoS One. 2014 Aug 6;9(8):e103833. doi: 10.1371/journal.pone.0103833. eCollection 2014.

Kraken: ultrafast metagenomic sequence classification using exact alignments.克拉肯：使用精确比对的超快速宏基因组序列分类

Genome Biol. 2014 Mar 3;15(3):R46. doi: 10.1186/gb-2014-15-3-r46.

Comparison of phi29-based whole genome amplification and whole transcriptome amplification in dengue virus.基于 phi29 的全基因组扩增与全转录组扩增在登革病毒中的比较。

J Virol Methods. 2014 Jan;195:141-7. doi: 10.1016/j.jviromet.2013.10.005. Epub 2013 Oct 12.

Metagenomic analysis of viruses from bat fecal samples reveals many novel viruses in insectivorous bats in China.对来自中国食虫蝙蝠粪便样本中的病毒进行宏基因组分析，揭示了许多新型病毒。

J Virol. 2012 Apr;86(8):4620-30. doi: 10.1128/JVI.06671-11. Epub 2012 Feb 15.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

贝类病毒宏基因组文库构建及短读长注释方法的比较

Comparison of methods for library construction and short read annotation of shellfish viral metagenomes.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献