Han Dongsheng, Gao Peng, Li Rui, Tan Ping, Xie Jiehong, Zhang Rui, Li Jinming
National Center for Clinical Laboratories, Beijing Hospital, National Center of Gerontology; Institute of Geriatric Medicine, Chinese Academy of Medical Sciences, 100005, PR China.
Graduate School, Peking Union Medical College, Chinese Academy of Medical Sciences, Beijing 100730, PR China.
J Adv Res. 2020 Jul 21;26:111-121. doi: 10.1016/j.jare.2020.07.010. eCollection 2020 Nov.
Microbiome research based on high-throughput sequencing has grown exponentially in recent years, but methodological variations can easily undermine the reproducibility across studies.
To systematically evaluate the comparability of sequencing results of 16S rRNA gene sequencing (16Ss)- and shotgun metagenomic sequencing (SMs)-based microbial community profiling in laboratories under routine conditions.
We designed a multicenter study across 35 participating laboratories in China using designed mock communities and homogenized fecal samples.
A wide range of practices and approaches was reported by the participating laboratories. The observed microbial compositions of the mock communities in 46.2% (12/26) of the 16Ss and 82.6% (19/23) of the SMs laboratories had significant correlations with the expected result (Spearman r>0.59, <0.05). The results from laboratories with near-identical protocols showed slight interlaboratory deviations. However, a high degree of interlaboratory deviation was found in the observed abundances of specific taxa, such as Bacteroides spp. (range: 0.3%-53.5%), Enterococci spp. (range: 0.8%-43.9%) and Fusobacterium spp. (range: 0.1%-39.8%). SMs performed better than 16Ss in detecting low-abundance bacteria (B. bifidum). The differences in DNA extraction methods, amplified regions and bioinformatics analysis tools (taxonomic classifiers and database) were important factors causing interlaboratory deviations. Addressing laboratory contamination is an urgent task because various sources of unexpected microbes were found in negative control samples.
Well-defined control samples, such as the mock communities in this study, should be routinely used in microbiome research for monitoring potential biases. The findings in this study will provide guidance in the choice of more reasonable operating procedures to minimize potential methodological biases in revealing human microbiota composition.
近年来,基于高通量测序的微生物组研究呈指数级增长,但方法学上的差异很容易破坏不同研究之间的可重复性。
在常规条件下,系统评估各实验室基于16S rRNA基因测序(16Ss)和鸟枪法宏基因组测序(SMs)的微生物群落分析测序结果的可比性。
我们在中国35个参与实验室开展了一项多中心研究,使用设计好的模拟群落和匀浆粪便样本。
参与实验室报告了广泛的操作方法和途径。在16Ss实验室中,46.2%(12/26)的模拟群落观察到的微生物组成与预期结果有显著相关性(Spearman秩相关系数r>0.59,P<0.05);在SMs实验室中,这一比例为82.6%(19/23)。采用几乎相同方案的实验室结果显示出轻微的实验室间偏差。然而,在特定分类群的观察丰度上发现了高度的实验室间偏差,如拟杆菌属(范围:0.3%-53.5%)、肠球菌属(范围:0.8%-43.9%)和梭杆菌属(范围:0.1%-39.8%)。在检测低丰度细菌(双歧杆菌)方面,SMs比16Ss表现更好。DNA提取方法、扩增区域和生物信息学分析工具(分类分类器和数据库)的差异是导致实验室间偏差的重要因素。解决实验室污染是一项紧迫任务,因为在阴性对照样本中发现了各种意外微生物来源。
在微生物组研究中应常规使用定义明确的对照样本,如本研究中的模拟群落,以监测潜在偏差。本研究结果将为选择更合理的操作程序提供指导,以尽量减少在揭示人类微生物群组成时潜在的方法学偏差。