通过组装 Illumina 双端测序 reads 从复杂微生物群落中生成数百万个 16S rRNA 基因文库。
Generation of multimillion-sequence 16S rRNA gene libraries from complex microbial communities by assembling paired-end illumina reads.
机构信息
Department of Biology, University of Waterloo, Waterloo, Ontario, Canada N2L 3G1.
出版信息
Appl Environ Microbiol. 2011 Jun;77(11):3846-52. doi: 10.1128/AEM.02772-10. Epub 2011 Apr 1.
Microbial communities host unparalleled taxonomic diversity. Adequate characterization of environmental and host-associated samples remains a challenge for microbiologists, despite the advent of 16S rRNA gene sequencing. In order to increase the depth of sampling for diverse bacterial communities, we developed a method for sequencing and assembling millions of paired-end reads from the 16S rRNA gene (spanning the V3 region; ∼200 nucleotides) by using an Illumina genome analyzer. To confirm reproducibility and to identify a suitable computational pipeline for data analysis, sequence libraries were prepared in duplicate for both a defined mixture of DNAs from known cultured bacterial isolates (>1 million postassembly sequences) and an Arctic tundra soil sample (>6 million postassembly sequences). The Illumina 16S rRNA gene libraries represent a substantial increase in number of sequences over all extant next-generation sequencing approaches (e.g., 454 pyrosequencing), while the assembly of paired-end 125-base reads offers a methodological advantage by incorporating an initial quality control step for each 16S rRNA gene sequence. This method incorporates indexed primers to enable the characterization of multiple microbial communities in a single flow cell lane, may be modified readily to target other variable regions or genes, and demonstrates unprecedented and economical access to DNAs from organisms that exist at low relative abundances.
微生物群落拥有无与伦比的分类多样性。尽管 16S rRNA 基因测序已经问世,但对环境和宿主相关样本进行充分的特征描述仍然是微生物学家面临的挑战。为了增加对不同细菌群落的采样深度,我们开发了一种方法,使用 Illumina 基因组分析仪对 16S rRNA 基因(跨越 V3 区;约 200 个核苷酸)的数百万对末端读取进行测序和组装。为了确认重现性并确定适合数据分析的计算管道,我们为已知培养细菌分离物的 DNA(>100 万个组装后序列)和北极冻原生态土壤样本(>600 万个组装后序列)的定义混合物制备了重复的序列文库。与所有现有下一代测序方法(例如 454 焦磷酸测序)相比,Illumina 16S rRNA 基因文库的序列数量大大增加,而通过对每个 16S rRNA 基因序列进行初始质量控制步骤,对末端读取进行配对组装则提供了一种方法上的优势。该方法结合了索引引物,可在单个流动池通道中对多个微生物群落进行特征描述,可轻松修改以针对其他可变区域或基因,并且前所未有且经济地可用于从相对丰度较低的生物体中获取 DNA。
相似文献
BMC Bioinformatics. 2016-12-1
BMC Res Notes. 2016-8-2
引用本文的文献
PLoS One. 2025-7-3
Commun Earth Environ. 2024
本文引用的文献
PLoS One. 2010-8-12
Proc Natl Acad Sci U S A. 2010-6-3
Nat Methods. 2010-5
Environ Microbiol. 2010-3-11