Suppr超能文献

流行病学方法与应用概述:观察性研究设计的优势与局限性。

Overview of the epidemiology methods and applications: strengths and limitations of observational study designs.

机构信息

School of Medicine, Department of Surgery, Washington University School of Medicine, St. Louis, Missouri.

出版信息

Crit Rev Food Sci Nutr. 2010;50 Suppl 1(s1):10-2. doi: 10.1080/10408398.2010.526838.

Abstract

The impact of study design on the results of medical research has long been an area of both substantial debate and a smaller body of empirical research. Examples come from many disciplines within clinical and public health research. Among the early major contributions in the 1970s was work by Mosteller and colleagues (Gilbert et al., 1997), who noted that innovations in surgery and anesthesia showed greater gains than standard therapy when nonrandomized, controlled trials were evaluated compared with the gains reported in randomized, controlled trials. More recently, we and others have evaluated the impact of design in medical and surgical research, and concluded that the mean gain comparing new therapies to established therapies was biased by study design in nonrandomized trials (Colditz et al., 1989; Miller et al., 1989). Benson and Hartz (2000) conducted a study in which they focused only on studies reported after 1985. On the basis of 136 reports of 19 diverse treatments, Benson and Hartz concluded that in only 2 of the 19 analyses did the combined data from the observational studies lie outside the 95% confidence interval for the combined data from the randomized trials. A similar study drew only on data reported from 1991 to 1995, which showed remarkably similar results among observational studies and randomized, controlled trials (Concato et al., 2000). These more recent data suggest that advancing the study design and analytic methods may reduce bias in some evaluations of medical and public health interventions. Such methods apply not only to the original studies, but also to the approaches that are taken to quantitatively combine results by using meta-analytic approaches such as random effects meta-regression, Bayesian meta-analysis, and the like (Normand, 1999). By focusing attention on thorough data analysis, design issues can be understood and their impact or bias can be estimated, on average, and then ideally accounted for in the interpretation of data. Before discussing dietary data, let us first consider some of the more clearly delineated preventive exposures. Issues of study design have been addressed in terms of combining randomized trials and observational studies in evaluating preventive interventions such as Bacillus Calmette-Guerin vaccination (Colditz et al., 1994) and mammography screening (Desmissie et al., 1998). When one is interpreting the apparent heterogeneity in the results, it is important to step back and ask what is the relationship being evaluated under these different study designs? For example, a randomized, controlled trial uses the intention-to-treat analysis to preserve the merit of randomization. Such an analysis does not evaluate the exposure-disease relationship, but rather examines the impact of offering a new therapy versus an alternative therapy (regardless of adherence to the intervention, or control or placebo). On the other hand, a case-control study or a prospective cohort study will evaluate the impact of the screening test among those participants who were screened as compared with those who were never screened. In prevention studies, the design raises major issues of the timing of the exposure in the natural history of disease and also the adherence to therapy by healthy research volunteers. Case-control studies of preventive interventions such as screening mammography and prospective population-based studies of pap smears have capitalized on this variation in time since the last screen to evaluate the protective interval for a screening test (IARC Work Group, 1986). In contrast, a trial must choose a level of exposure, such as annual mammography screenings or colon screenings every 10 years with a colonoscopy, regardless of the evolving evidence on the duration of protection after a negative screening test. Continuing with the mammography example, a detailed study by Demissie and colleagues (1998) combined data from seven randomized trials and six case-control studies that investigated the association between participation in breast cancer screening programs and breast cancer mortality. The authors showed that if one assumes noncompliance with mammography (approaching 30%) and 20% of the control group is screened, then the benefit of mammography in terms of reduced mortality is comparable in randomized, controlled trials and epidemiologic studies after adjusting for nonadherence (Demissie et al., 1998). Thus, the different designs fundamentally measure different constructs of the impact of screening. Zelen (1988) considered the challenges of primary prevention trials and addressed both compliance and models of carcinogenesis as major impediments to the use of randomized, controlled trials to evaluate cancer prevention strategies. It is important to contrast these issues in both treatment trials and prevention trials. In treatment trials, recently diagnosed patients, who are often in life-threatening situations, are typically offered the option to participate in a trial of a new therapy compared with standard therapy or placebo. Compliance or adherence to therapy is usually very high among these highly motivated patients and their outcomes are generally in the short- to mid-term. In contrast, prevention trials recruit large numbers of healthy participants, offer them a therapy, and then follow them over many years because the chronic diseases being prevented are relatively rare. With substantial noncompliance (often in the range of 20% to 40% over the duration of the trial), an intention-to-treat analysis is no longer unbiased, but rather gives a biased estimate of the effect, typically underestimating the magnitude of the association that is seen in observation studies in which those participants who have had exposure to a particular lifestyle component are compared with those without such an exposure. There are additional challenges for nutritional interventions, including the timing of diet as a preventive agent in the disease process and the range of nutrient intakes in the population. In retrospective case-control studies, recall bias of past diet is an additional issue with which to contend. Unlike smoking or screening tests in which the exposure is finite and can be completely stopped and started, one's diet, physical activity, and weight change cannot go to zero for prolonged periods and sustain life. The range of nutrient intake is a major issue when enrolling participants into prevention trials and observational studies. Health-conscious volunteers are more often identified and screened as eligible for a trial. The epidemiology of diet and colon cancer has been extensively studied. For example, Cho et al. (2004) conducted a combined analysis of prospective dietary studies of calcium and vitamin D intake data from 10 cohorts. The dose-response relationship for calcium showed that the greatest benefit for increasing calcium intake was for those participants who had a reported daily intake below 1000 mg/d. Increasing the intake of those individuals with low intake to the level of 1000 mg/day would yield a 20% reduction in risk. Beyond this level of intake, there was little additional reduction in the risk of colon cancer. In the Women's Health Initiative, participants had a mean calcium intake of 1150 mg/d at baseline and increased this intake in the intervention arm to 2250 mg/d on average. This magnitude of increase was of limited association in the combined, prospective, cohort studies and was not related to risk in the randomized trial (Wactawski-Wende et al., 2006). Similar findings apply to the interpretation of the vitamin D intervention and highlight the role of dietary intake at randomization when evaluating dietary components. Returning to the time frame of exposure in the carcinogenic process, the null randomized, controlled trials of fiber (Alberts et al., 2000) and fruit and vegetables (Schatzkin et al., 2000) for the prevention of polyp recurrence amply illustrate Zelen's concerns regarding the timing of the preventive intervention in the disease process. The extent of DNA damage accumulated across the colonic mucosa at the time of detecting the "eligibility polyp" was certainly not limited only to the removed polyp. Rather, these observations beg the question that at what stage in the disease process may fiber play a role in protecting against colon cancer? This contrasts with the richness of epidemiologic studies that can address exposure over the life course and relate such exposure to disease risk. Perhaps the best-known example is the radiation follow-up effects cohort in Japan in which a radiation dose was estimated for each woman who had been exposed to the effects of the atomic bombs on Hiroshima and Nagasaki and followed-up over 40 years. The results of this study showed a clear and strong relationship between the increased risk of breast cancer with higher exposure among those participants who were exposed before the age of 20 years (Land et al., 2003). Retrospective assessment of diet after disease diagnosis has been demonstrated to introduce bias into the evaluation of exposure-disease relationships. For example, Giovannucci et al. (1993) evaluated retrospective recall of diet after breast cancer diagnosis within the ongoing Nurses' Health Study. In contrast with the prospective analysis in which no relationship between dietary fat and breast cancer was observed, the retrospective analysis yielded a positive relationship for total fat and saturated fat (Giovannucci et al., 1993). By comparing the top quintile versus the bottom quintile of reported intake, the retrospective assessment yielded odds ratios of 1.43 for total fat and 1.38 for saturated fat. Therefore, the magnitude of bias was sufficient to distort evaluation of the diet-disease relationship. (ABSTRACT TRUNCATED)

摘要

研究设计对医学研究结果的影响一直是一个备受争议的领域,也是实证研究相对较少的领域。例子来自临床和公共卫生研究的许多学科。20 世纪 70 年代早期的主要贡献之一是 Mosteller 和同事的工作(Gilbert 等人,1997 年),他们指出,与随机对照试验报告的收益相比,手术和麻醉方面的创新在非随机对照试验中显示出更大的收益。最近,我们和其他人评估了医学和外科研究中的设计影响,并得出结论,非随机试验中的设计会导致新疗法与现有疗法比较的平均收益存在偏差(Colditz 等人,1989 年;Miller 等人,1989 年)。Benson 和 Hartz(2000 年)进行了一项仅关注 1985 年后报告的研究。Benson 和 Hartz 基于 19 种不同治疗方法的 136 项报告得出结论,在观察性研究中,只有 19 项分析中的综合数据超出了随机试验综合数据的 95%置信区间。一项类似的研究仅基于 1991 年至 1995 年报告的数据,该研究在观察性研究和随机对照试验中显示出非常相似的结果(Concato 等人,2000 年)。这些较新的数据表明,通过改进研究设计和分析方法,可以减少一些医学和公共卫生干预措施评估中的偏倚。这些方法不仅适用于原始研究,也适用于使用元分析方法(如随机效应元回归、贝叶斯元分析等)定量组合结果的方法(Normand,1999 年)。通过关注彻底的数据分析,可以理解设计问题,并对其影响或偏差进行平均估计,然后在解释数据时理想地考虑这些偏差。在讨论饮食数据之前,让我们首先考虑一些更明确的预防性暴露。在评估卡介苗接种(Colditz 等人,1994 年)和乳房 X 光筛查(Desmissie 等人,1998 年)等预防性干预措施时,已经考虑了研究设计的问题。当人们在解释这些不同研究设计的结果中的明显异质性时,重要的是要退后一步,问一下在这些不同的研究设计下正在评估什么关系?例如,随机对照试验使用意向治疗分析来保留随机化的优点。这种分析不评估暴露-疾病关系,而是检查提供新疗法与替代疗法的影响(无论是否遵守干预措施、对照或安慰剂)。另一方面,病例对照研究或前瞻性队列研究将评估在接受筛查的参与者中筛查测试的影响,而不是在从未接受过筛查的参与者中评估筛查测试的影响。在预防研究中,设计提出了暴露在疾病自然史中的时间以及健康研究志愿者对治疗的依从性等重大问题。筛查性乳房 X 光检查和基于人群的巴氏涂片前瞻性研究等预防性干预措施的病例对照研究利用了筛查之间的时间差异,以评估筛查测试的保护间隔(IARC 工作组,1986 年)。相比之下,试验必须选择暴露水平,例如每年进行乳房 X 光筛查或结肠癌筛查,每 10 年进行一次结肠镜检查,无论在阴性筛查测试后保护期的持续时间方面的证据如何。继续以乳房 X 光检查为例,Demissie 和同事(1998 年)的一项详细研究结合了七个随机试验和六个病例对照研究的数据,调查了参与乳腺癌筛查计划与乳腺癌死亡率之间的关联。作者表明,如果假设(接近 30%)不遵守乳房 X 光检查和对照组中有 20%接受筛查,那么在调整不依从性后,随机对照试验和流行病学研究中通过减少死亡率来评估乳房 X 光检查的益处是可比的(Demissie 等人,1998 年)。因此,不同的设计从根本上衡量了筛查的不同影响。Zelen(1988 年)考虑了初级预防试验的挑战,并解决了顺应性和致癌模型作为使用随机对照试验评估癌症预防策略的主要障碍。在治疗试验和预防试验中对比这些问题非常重要。在治疗试验中,最近被诊断患有危及生命疾病的患者通常可以选择参加新疗法的试验,而不是标准疗法或安慰剂。在这些高度积极的患者中,治疗依从性或顺应性通常非常高,并且他们的结果通常在短期到中期内。相比之下,预防试验招募大量健康参与者,为他们提供一种疗法,然后在多年后对他们进行随访,因为预防的慢性疾病相对较少见。在试验期间不遵守(通常在 20%至 40%之间),意向治疗分析不再无偏,而是对关联的估计产生偏差,通常低估了观察研究中看到的关联的幅度,在观察研究中,比较了具有特定生活方式成分暴露的参与者与没有这种暴露的参与者。营养干预措施还存在其他挑战,包括饮食作为疾病过程中预防剂的时间以及人群中营养素摄入量的范围。在回顾性病例对照研究中,过去饮食的回忆偏倚是另一个需要解决的问题。与吸烟或筛查测试不同,这些暴露是有限的,可以完全停止和开始,一个人的饮食、体力活动和体重变化不能长时间为零并维持生命。在预防试验和观察性研究中,营养素摄入量的范围是一个主要问题。健康意识强的志愿者更容易被识别和筛选参加试验。饮食与结肠癌的流行病学已经得到了广泛研究。例如,Cho 等人(2004 年)对 10 个队列的前瞻性饮食研究中的钙和维生素 D 摄入数据进行了综合分析。钙的剂量反应关系表明,增加钙摄入量的最大益处是对报告的每日摄入量低于 1000 毫克/天的参与者。将这些低摄入量个体的摄入量增加到 1000 毫克/天,可将风险降低 20%。在此水平以上,风险降低幅度不大。在妇女健康倡议中,参与者在基线时的平均钙摄入量为 1150 毫克/天,在干预组中将其增加到平均 2250 毫克/天。这种幅度的增加与前瞻性队列研究中联合、前瞻性队列研究中的风险没有关联,并且与随机试验中没有关联(Wactawski-Wende 等人,2006 年)。类似的发现适用于维生素 D 干预措施的解释,并强调了在评估饮食成分时随机化时饮食摄入量的作用。回到致癌过程中暴露的时间框架,纤维(Alberts 等人,2000 年)和水果和蔬菜(Schatzkin 等人,2000 年)预防息肉复发的随机对照试验充分说明了 Zelen 对疾病过程中预防性干预时间的关注。在检测到“合格息肉”时,结肠黏膜上累积的 DNA 损伤程度肯定不仅限于切除的息肉。相反,这些观察结果引出了一个问题,即纤维在保护结肠癌方面可能在疾病过程中的哪个阶段发挥作用?这与流行病学研究可以评估生命过程中的暴露并将这种暴露与疾病风险相关的丰富性形成对比。也许最著名的例子是日本原子弹爆炸幸存者辐射随访效应队列,该队列估计了每个暴露于广岛和长崎原子弹影响的女性的辐射剂量,并随访了 40 多年。这项研究的结果表明,在 20 岁之前暴露的参与者中,辐射暴露与乳腺癌风险增加之间存在明显且强烈的关系(Land 等人,2003 年)。疾病诊断后饮食的回顾性评估会导致暴露-疾病关系的评估产生偏差。例如,Giovannucci 等人(1993 年)评估了在正在进行的护士健康研究中乳腺癌诊断后对饮食的回顾性回忆。与前瞻性分析中未观察到饮食脂肪与乳腺癌之间存在关联不同,回顾性分析显示总脂肪和饱和脂肪呈正相关(Giovannucci 等人,1993 年)。通过比较报告摄入量的最高五分位数与最低五分位数,回顾性评估得出的比值比为总脂肪 1.43,饱和脂肪 1.38。因此,偏差的幅度足以扭曲饮食-疾病关系的评估。(摘要已省略)

相似文献

1
Overview of the epidemiology methods and applications: strengths and limitations of observational study designs.
Crit Rev Food Sci Nutr. 2010;50 Suppl 1(s1):10-2. doi: 10.1080/10408398.2010.526838.
3
The future of Cochrane Neonatal.
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.
4
Dietary glycation compounds - implications for human health.
Crit Rev Toxicol. 2024 Sep;54(8):485-617. doi: 10.1080/10408444.2024.2362985. Epub 2024 Aug 16.
8
9
10
[Standard technical specifications for methacholine chloride (Methacholine) bronchial challenge test (2023)].
Zhonghua Jie He He Hu Xi Za Zhi. 2024 Feb 12;47(2):101-119. doi: 10.3760/cma.j.cn112147-20231019-00247.

引用本文的文献

1
Mendelian randomization studies on ischemic stroke: a field synopsis and systematic review.
J Transl Med. 2025 Aug 22;23(1):955. doi: 10.1186/s12967-025-06992-4.
4
Association between pathological scar and hypertension: A two-sample Mendelian randomization study.
Medicine (Baltimore). 2024 Dec 27;103(52):e40977. doi: 10.1097/MD.0000000000040977.
6
Causal Relationships Between Emotional Instability and Respiratory Diseases: A Mendelian Randomization Analysis.
Psychol Res Behav Manag. 2024 Sep 12;17:3159-3167. doi: 10.2147/PRBM.S481913. eCollection 2024.
8
The use of positive deviance approach to improve health service delivery and quality of care: a scoping review.
BMC Health Serv Res. 2024 Apr 8;24(1):438. doi: 10.1186/s12913-024-10850-2.
9
Causal relationship between gut microbiota and glioblastoma: a two-sample Mendelian randomization study.
J Cancer. 2024 Jan 1;15(2):332-342. doi: 10.7150/jca.90149. eCollection 2024.
10
Tea consumption and risk of lung diseases: a two‑sample Mendelian randomization study.
BMC Pulm Med. 2023 Nov 22;23(1):461. doi: 10.1186/s12890-023-02762-4.

本文引用的文献

1
Smoking and smoking cessation in relation to mortality in women.
JAMA. 2008 May 7;299(17):2037-47. doi: 10.1001/jama.299.17.2037.
2
Methods for pooling results of epidemiologic studies: the Pooling Project of Prospective Studies of Diet and Cancer.
Am J Epidemiol. 2006 Jun 1;163(11):1053-64. doi: 10.1093/aje/kwj127. Epub 2006 Apr 19.
3
Calcium plus vitamin D supplementation and the risk of colorectal cancer.
N Engl J Med. 2006 Feb 16;354(7):684-96. doi: 10.1056/NEJMoa055222.
4
Validation of adolescent diet recalled by adults.
Epidemiology. 2006 Mar;17(2):226-9. doi: 10.1097/01.ede.0000198181.86685.49.
5
Adult recall of adolescent diet: reproducibility and comparison with maternal reporting.
Am J Epidemiol. 2005 Jan 1;161(1):89-97. doi: 10.1093/aje/kwi019.
6
Dairy foods, calcium, and colorectal cancer: a pooled analysis of 10 cohort studies.
J Natl Cancer Inst. 2004 Jul 7;96(13):1015-22. doi: 10.1093/jnci/djh185.
9
Randomized, controlled trials, observational studies, and the hierarchy of research designs.
N Engl J Med. 2000 Jun 22;342(25):1887-92. doi: 10.1056/NEJM200006223422507.
10
A comparison of observational studies and randomized, controlled trials.
N Engl J Med. 2000 Jun 22;342(25):1878-86. doi: 10.1056/NEJM200006223422506.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验