Hahn Andrea, Sanyal Amit, Perez Geovanny F, Colberg-Poley Anamaris M, Campos Joseph, Rose Mary C, Pérez-Losada Marcos
Division of Infectious Diseases, Children's National Health System (CNHS), Washington, DC, USA; Research Center for Genetic Medicine, CNHS, Washington, DC, USA; Department of Pediatrics, George Washington University (GWU), Washington, DC, USA.
GWU School of Medicine, Washington, DC, USA.
J Microbiol Methods. 2016 Nov;130:95-99. doi: 10.1016/j.mimet.2016.09.002. Epub 2016 Sep 5.
Cystic fibrosis (CF) is an autosomal recessive disease characterized by recurrent lung infections. Studies of the lung microbiome have shown an association between decreasing diversity and progressive disease. 454 pyrosequencing has frequently been used to study the lung microbiome in CF, but will no longer be supported. We sought to identify the benefits and drawbacks of using two state-of-the-art next generation sequencing (NGS) platforms, MiSeq and PacBio RSII, to characterize the CF lung microbiome. Each has its advantages and limitations.
Twelve samples of extracted bacterial DNA were sequenced on both MiSeq and PacBio NGS platforms. DNA was amplified for the V4 region of the 16S rRNA gene and libraries were sequenced on the MiSeq sequencing platform, while the full 16S rRNA gene was sequenced on the PacBio RSII sequencing platform. Raw FASTQ files generated by the MiSeq and PacBio platforms were processed in mothur v1.35.1.
There was extreme discordance in alpha-diversity of the CF lung microbiome when using the two platforms. Because of its depth of coverage, sequencing of the 16S rRNA V4 gene region using MiSeq allowed for the observation of many more operational taxonomic units (OTUs) and higher Chao1 and Shannon indices than the PacBio RSII. Interestingly, several patients in our cohort had Escherichia, an unusual pathogen in CF. Also, likely because of its coverage of the complete 16S rRNA gene, only PacBio RSII was able to identify Burkholderia, an important CF pathogen.
When comparing microbiome diversity in clinical samples from CF patients using 16S sequences, MiSeq and PacBio NGS platforms may generate different results in microbial community composition and structure. It may be necessary to use different platforms when trying to correctly identify dominant pathogens versus measuring alpha-diversity estimates, and it would be important to use the same platform for comparisons to minimize errors in interpretation.
囊性纤维化(CF)是一种常染色体隐性疾病,其特征为反复的肺部感染。肺部微生物组研究表明,微生物多样性降低与疾病进展之间存在关联。454焦磷酸测序法曾经常用于研究CF患者的肺部微生物组,但该方法将不再被支持。我们试图确定使用两种最先进的新一代测序(NGS)平台,即MiSeq和PacBio RSII,来表征CF肺部微生物组的优缺点。每种平台都有其优势和局限性。
在MiSeq和PacBio NGS平台上对12份提取的细菌DNA样本进行测序。对16S rRNA基因的V4区域进行DNA扩增,并在MiSeq测序平台上对文库进行测序,而完整的16S rRNA基因则在PacBio RSII测序平台上进行测序。由MiSeq和PacBio平台生成的原始FASTQ文件在mothur v1.35.1中进行处理。
使用这两种平台时,CF肺部微生物组的α多样性存在极大差异。由于MiSeq对16S rRNA V4基因区域的测序深度,与PacBio RSII相比,能够观察到更多的可操作分类单元(OTU),以及更高的Chao1和香农指数。有趣的是,我们队列中的几名患者感染了大肠埃希菌,这在CF中是一种不常见的病原体。此外,可能由于其对完整16S rRNA基因的覆盖,只有PacBio RSII能够鉴定出伯克霍尔德菌,这是一种重要的CF病原体。
在使用16S序列比较CF患者临床样本中的微生物组多样性时,MiSeq和PacBio NGS平台可能会在微生物群落组成和结构方面产生不同的结果。在试图正确识别主要病原体与测量α多样性估计值时,可能需要使用不同的平台,并且为了尽量减少解释中的误差,在进行比较时使用相同的平台非常重要。