Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, Maryland.
Costa Rican Agency for Biomedical Research-INCIENSA Foundation, San José, Costa Rica.
Am J Epidemiol. 2018 Jun 1;187(6):1282-1290. doi: 10.1093/aje/kwy064.
Temporal variation in microbiome measurements can reduce statistical power in research studies. Quantification of this variation is essential for designing studies of chronic disease. We analyzed 16S ribosomal RNA profiles in paired biological specimens separated by 6 months from 3 studies conducted during 1985-2013 (a National Cancer Institute colorectal cancer study, a Costa Rica study, and the Human Microbiome Project). We evaluated temporal stability by calculating intraclass correlation coefficients (ICCs). Sample sizes needed in order to detect microbiome differences between equal numbers of cases and controls for a nested case-control design were calculated on the basis of estimated ICCs. Across body sites, 12 phylum-level ICCs were greater than 0.5. Similarly, 11 alpha-diversity ICCs were greater than 0.5. Fecal beta-diversity estimates had ICCs over 0.5. For a single collection with most microbiome metrics, detecting an odds ratio of 2.0 would require 300-500 cases when matching 1 case to 1 control at P = 0.05. Use of 2 or 3 sequential specimens reduces the number of required subjects by 40%-50% for low-ICC metrics. Relative abundances of major phyla and alpha-diversity metrics have low temporal stability. Thus, detecting associations of moderate effect size with these metrics will require large sample sizes. Because beta diversity for feces is reasonably stable over time, smaller sample sizes can detect associations with community composition. Sequential prediagnostic specimens from thousands of prospectively ascertained cases are required to detect modest disease associations with particular microbiome metrics.
微生物组测量的时间变化会降低研究的统计效力。量化这种变化对于设计慢性疾病研究至关重要。我们分析了 1985-2013 年期间进行的 3 项研究(美国国立癌症研究所的结肠癌研究、哥斯达黎加研究和人类微生物组计划)中相隔 6 个月的配对生物样本的 16S 核糖体 RNA 图谱。我们通过计算组内相关系数 (ICC) 来评估时间稳定性。基于估计的 ICC,计算了为嵌套病例对照设计检测相同数量的病例和对照之间的微生物组差异所需的样本量。在各个身体部位,12 个门水平的 ICC 大于 0.5。同样,11 个 alpha 多样性 ICC 大于 0.5。粪便 beta 多样性估计的 ICC 大于 0.5。对于大多数微生物组指标的单次采集,当以 1 个病例匹配 1 个对照时,要检测比值比为 2.0 的情况,需要在 P = 0.05 时需要 300-500 个病例。对于低 ICC 指标,使用 2 或 3 个连续样本可将所需受试者数量减少 40%-50%。主要门的相对丰度和 alpha 多样性指标的时间稳定性较低。因此,检测这些指标与中度效应大小相关的关联需要大样本量。由于粪便的 beta 多样性在时间上相对稳定,因此较小的样本量可以检测与群落组成相关的关联。需要数千个前瞻性确定的病例的连续预测性标本来检测特定微生物组指标与适度疾病的关联。