Department of Microbiology, Center for Biofilm Engineering, Montana State University, 366 EPS Building, Bozeman, MT 59717, USA.
Microb Ecol. 2012 Aug;64(2):499-508. doi: 10.1007/s00248-012-0043-9. Epub 2012 Apr 5.
Due to potential sequencing errors in pyrosequencing data, species richness and diversity indices of microbial systems can be miscalculated. The "traditional" sequence refinement method is not sufficient to account for overestimations (e.g., length, primer errors, ambiguous nucleotides). Recent in silico and single-organism studies have revealed the importance of sequence quality scores in the estimation of ecological indices; however, this is the first study to compare quality-score stringencies across four regions of the SSU rRNA gene sequence (V1V2, V3, V4, and V6) with actual environmental samples compared directly to corresponding clone libraries produced from the same primer sets. The nucleic acid sequences determined via pyrosequencing were subjected to varying quality-score cutoffs that ranged from 25 to 32, and at each quality-score cutoff, either 10 or 15 % of the nucleotides were allowed to be below the cutoff. When species richness estimates were compared for the tested samples, the cutoff values of Q27(15%), Q30(10%), and Q32(15%) for V1V2, V4, and V6, respectively, estimated similar values as obtained with clone libraries and Sanger sequencing. The most stringent Q tested (Q32(10%)) was not enough to account for species richness inflation of the V3 region pyrosequence data. Results indicated that quality-score assessment greatly improved estimates of ecological indices for environmental samples (species richness and α-diversity) and that the effect of quality-score filtering was region-dependent.
由于焦磷酸测序数据中存在潜在的测序错误,微生物系统的物种丰富度和多样性指数可能会被误算。“传统”的序列细化方法不足以解释这些高估现象(例如,长度、引物错误、模糊核苷酸)。最近的计算机模拟和单细胞研究揭示了序列质量评分在生态指数估计中的重要性;然而,这是第一项比较 SSU rRNA 基因序列四个区域(V1V2、V3、V4 和 V6)的质量评分严格程度与实际环境样本与从相同引物集直接产生的相应克隆文库的研究。通过焦磷酸测序确定的核酸序列受到质量评分截止值的影响,从 25 到 32 不等,在每个质量评分截止值下,允许 10%或 15%的核苷酸低于截止值。当对测试样本进行物种丰富度估计比较时,V1V2、V4 和 V6 分别为 Q27(15%)、Q30(10%)和 Q32(15%)的截止值估计值与克隆文库和 Sanger 测序获得的值相似。测试的最严格的 Q(Q32(10%))不足以解释 V3 区域焦序列数据的物种丰富度膨胀。结果表明,质量评分评估极大地提高了环境样本(物种丰富度和 α多样性)生态指数的估计值,并且质量评分过滤的效果取决于区域。
Sci Total Environ. 2017-10-28
Appl Environ Microbiol. 1999-4
Nucleic Acids Res. 2014-1-23
Environ Microbiol. 2010-3-11
Environ Microbiol. 2009-8-27
Nat Methods. 2009-9
Nucleic Acids Res. 2008-10
J Microbiol Methods. 2008-9
Nucleic Acids Res. 2007