Department of Respiratory Medicine, University of Occupational and Environmental Health, Japan, 1-1 Iseigaoka, Yahatanishi-ku, Kitakyushu-city, Fukuoka, 807-8555, Japan.
Department of Microbiology, University of Occupational and Environmental Health, Japan, Kitakyushu, Japan.
Sci Rep. 2021 Oct 1;11(1):19534. doi: 10.1038/s41598-021-98985-8.
Next-generation sequencing (NGS) technologies have been applied in bacterial flora analysis. However, there is no standardized protocol, and the optimal clustering threshold for estimating bacterial species in respiratory infection specimens is unknown. This study was conducted to investigate the optimal threshold for clustering 16S ribosomal RNA gene sequences into operational taxonomic units (OTUs) by comparing the results of NGS technology with those of the Sanger method, which has a higher accuracy of sequence per single read than NGS technology. This study included 45 patients with pneumonia with aspiration risks and 35 patients with lung abscess. Compared to Sanger method, the concordance rates of NGS technology (clustered at 100%, 99%, and 97% homology) with the predominant phylotype were 78.8%, 71.3%, and 65.0%, respectively. With respect to the specimens dominated by the Streptococcus mitis group, containing several important causative agents of pneumonia, Bray Curtis dissimilarity revealed that the OTUs obtained at 100% clustering threshold (versus those obtained at 99% and 97% thresholds; medians of 0.35, 0.69, and 0.71, respectively) were more similar to those obtained by the Sanger method, with statistical significance (p < 0.05). Clustering with 100% sequence identity is necessary when analyzing the microbiota of respiratory infections using NGS technology.
下一代测序 (NGS) 技术已应用于细菌菌群分析。然而,目前尚无标准化的方案,也不清楚用于估计呼吸道感染标本中细菌种类的最佳聚类阈值。本研究旨在通过比较 NGS 技术与 Sanger 方法的结果,来确定用于将 16S 核糖体 RNA 基因序列聚类为操作分类单位 (OTU) 的最佳聚类阈值。该研究纳入了 45 例有吸入风险的肺炎患者和 35 例肺脓肿患者。与 Sanger 方法相比,NGS 技术(聚类阈值为 100%、99%和 97%同源性)与主要种系型的一致性率分别为 78.8%、71.3%和 65.0%。对于主要由酿脓链球菌组(包含几种重要的肺炎病原体)的标本,Bray Curtis 不相似性表明,在 100%聚类阈值下获得的 OTU(与 99%和 97%阈值下获得的 OTU 相比;中位数分别为 0.35、0.69 和 0.71)与 Sanger 方法更为相似,具有统计学意义(p<0.05)。使用 NGS 技术分析呼吸道感染的微生物群时,需要进行 100%序列同一性聚类。