Suppr超能文献

基于人群的纵向研究中慢性病基因-环境相互作用研究的效能分析:一项模拟研究

Power Analysis for Population-Based Longitudinal Studies Investigating Gene-Environment Interactions in Chronic Diseases: A Simulation Study.

作者信息

Ma Jinhui, Thabane Lehana, Beyene Joseph, Raina Parminder

机构信息

Department of Clinical Epidemiology and Biostatistics, McMaster University, Hamilton, Ontario, Canada.

McMaster University Evidence-based Practice Center, Hamilton, Ontario, Canada.

出版信息

PLoS One. 2016 Feb 22;11(2):e0149940. doi: 10.1371/journal.pone.0149940. eCollection 2016.

Abstract

Conventional methods for sample size calculation for population-based longitudinal studies tend to overestimate the statistical power by overlooking important determinants of the required sample size, such as the measurement errors and unmeasured etiological determinants, etc. In contrast, a simulation-based sample size calculation, if designed properly, allows these determinants to be taken into account and offers flexibility in accommodating complex study design features. The Canadian Longitudinal Study on Aging (CLSA) is a Canada-wide, 20-year follow-up study of 30,000 people between the ages of 45 and 85 years, with in-depth information collected every 3 years. A simulation study, based on an illness-death model, was conducted to: (1) investigate the statistical power profile of the CLSA to detect the effect of environmental and genetic risk factors, and their interaction on age-related chronic diseases; and (2) explore the design alternatives and implementation strategies for increasing the statistical power of population-based longitudinal studies in general. The results showed that the statistical power to identify the effect of environmental and genetic risk exposures, and their interaction on a disease was boosted when: (1) the prevalence of the risk exposures increased; (2) the disease of interest is relatively common in the population; and (3) risk exposures were measured accurately. In addition, the frequency of data collection every three years in the CLSA led to a slightly lower statistical power compared to the design assuming that participants underwent health monitoring continuously. The CLSA had sufficient power to detect a small (1<hazard ratio (HR)≤1.5) or moderate effect (1.5< HR≤2.0) of the environmental risk exposure, as long as the risk exposure and the disease of interest were not rare. It had enough power to detect a moderate or large (2.0<HR≤3.0) effect of the genetic risk exposure when the prevalence of the risk exposure was not very low (≥0.1) and the disease of interest was not rare (such as diabetes and dementia). The CLSA had enough power to detect a large effect of the gene-environment interaction only when both risk exposures had relatively high prevalence (0.2) and the disease of interest was very common (such as diabetes). The minimum detectable hazard ratios (MDHR) of the CLSA for the environmental and genetic risk exposures obtained from this simulation study were larger than those calculated according to the conventional sample size calculation method. For example, the MDHR for the environmental risk exposure was 1.15 according to the conventional method if the prevalence of the risk exposure was 0.1 and the disease of interest was dementia. In contrast, the MDHR was 1.61 if the same exposure was measured every 3 years with a misclassification rate of 0.1 according to this simulation study. With a given sample size, higher statistical power could be achieved by increasing the measuring frequency in participants with high risk of declining health status or changing risk exposures, and by increasing measurement accuracy of diseases and risk exposures. A properly designed simulation-based sample size calculation is superior to conventional methods when rigorous sample size calculation is necessary.

摘要

基于人群的纵向研究中,传统的样本量计算方法往往会高估统计效能,因为它们忽略了所需样本量的重要决定因素,如测量误差和未测量的病因决定因素等。相比之下,基于模拟的样本量计算如果设计得当,则可以将这些决定因素考虑在内,并在适应复杂的研究设计特征方面具有灵活性。加拿大老龄化纵向研究(CLSA)是一项在加拿大范围内进行的、对30000名年龄在45至85岁之间的人群进行20年随访的研究,每3年收集一次深入信息。基于疾病死亡模型进行了一项模拟研究,以:(1)研究CLSA检测环境和遗传风险因素及其相互作用对年龄相关慢性病影响的统计效能概况;(2)总体上探索提高基于人群的纵向研究统计效能的设计替代方案和实施策略。结果表明,在以下情况下,识别环境和遗传风险暴露及其相互作用对疾病影响的统计效能会提高:(1)风险暴露的患病率增加;(2)所关注的疾病在人群中相对常见;(3)风险暴露测量准确。此外,与假设参与者持续接受健康监测的设计相比,CLSA每三年进行一次数据收集导致统计效能略低。CLSA有足够的效能检测环境风险暴露的小(1<风险比(HR)≤1.5)或中等效应(1.5<HR≤2.0),只要风险暴露和所关注的疾病并非罕见。当风险暴露的患病率不是很低(≥0.1)且所关注的疾病并非罕见(如糖尿病和痴呆症)时,它有足够的效能检测遗传风险暴露的中等或大(2.0<HR≤3.0)效应。只有当两种风险暴露的患病率都相对较高(0.2)且所关注的疾病非常常见(如糖尿病)时,CLSA才有足够的效能检测基因 - 环境相互作用的大效应。从该模拟研究中获得的CLSA对于环境和遗传风险暴露的最小可检测风险比(MDHR)大于根据传统样本量计算方法计算得出的结果。例如,如果风险暴露的患病率为0.1且所关注的疾病是痴呆症,根据传统方法,环境风险暴露的MDHR为1.15。相比之下,根据该模拟研究,如果每3年测量一次相同的暴露且错误分类率为0.1,则MDHR为1.61。在给定样本量的情况下,通过提高健康状况下降风险高或风险暴露发生变化的参与者的测量频率,以及提高疾病和风险暴露的测量准确性,可以实现更高的统计效能。在需要进行严格的样本量计算时,设计得当的基于模拟的样本量计算优于传统方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fad/4762766/1c19f67471f4/pone.0149940.g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验