GENES DIFFUSION, Douai 59501, France.
PEGASE-Biosciences, Institut Pasteur de Lille, Lille 59019, France.
Sci Data. 2017 Jun 27;4:170081. doi: 10.1038/sdata.2017.81.
In the past decade, metagenomics studies have become widespread due to the arrival of second-generation sequencing platforms characterized by low costs, high throughput and short read lengths. Today, although benchtop sequencers are considered to be accurate platforms to deliver data for targeted metagenomics studies, the limiting factor has become the analysis of these data. In a previous paper, we performed an Ion Torrent PGM 16S rDNA gene sequencing of faecal DNAs from 48 Blastocystis-colonized patients and 48 Blastocystis-negative subjects, in order to decipher the impact of this widespread protist on gut microbiota composition and diversity. We report here on the Ion Torrent targeted metagenomic sequencing and analysis of these 96 human faecal samples, and the complete datasets from raw to analysed data. We also provide the key steps of the bioinformatic analyses, from library preparation to data filtering and OTUs tables generation. This data represents a valuable resource for the scientific community, enabling re-processing of these targeted metagenomic datasets through various pipelines and a comparative evaluation of microbiota analysis methods.
在过去的十年中,由于第二代测序平台的出现,宏基因组学研究变得广泛起来,这些平台的特点是成本低、通量高、读长短。如今,尽管台式测序仪被认为是用于靶向宏基因组学研究的准确平台,但限制因素已成为这些数据的分析。在之前的一篇论文中,我们对 48 名感染了芽囊原虫的患者和 48 名阴性的对照者的粪便 DNA 进行了 Ion Torrent PGM 16S rDNA 基因测序,以解析这种广泛存在的原生动物对肠道微生物群落组成和多样性的影响。我们在这里报告了对这 96 个人类粪便样本的 Ion Torrent 靶向宏基因组测序和分析,以及从原始数据到分析数据的完整数据集。我们还提供了生物信息学分析的关键步骤,从文库制备到数据过滤和 OTUs 表生成。这些数据代表了科学界的宝贵资源,通过各种管道重新处理这些靶向宏基因组数据集,并对微生物组分析方法进行比较评估。