Tenny Steven, Kerndt Connor C., Hoffman Mary R.
University of Nebraska Medical Center
Spectrum Health/Michigan State University College of Human Medicine
A case-control study is a type of observational study commonly used to look at factors associated with diseases or outcomes. The case-control study starts with a group of cases, which are the individuals who have the outcome of interest. The researcher then tries to construct a second group of individuals called the controls, who are similar to the case individuals but do have the outcome of interest. The researcher then looks at historical factors to identify if some exposure(s) is/are found more commonly in the cases than the controls. If the exposure is found more commonly in the cases than in the controls, the researcher can hypothesize that the exposure may be linked to the outcome of interest. For example, a researcher may want to look at the rare cancer Kaposi's sarcoma. The researcher would find a group of individuals with Kaposi's sarcoma (the cases) and compare them to a group of patients who are similar to the cases in most ways but do not have Kaposi's sarcoma (controls). The researcher could then ask about various exposures to see if any exposure is more common in those with Kaposi's sarcoma (the cases) than those without Kaposi's sarcoma (the controls). The researcher might find that those with Kaposi's sarcoma are more likely to have HIV, and thus conclude that HIV may be a risk factor for the development of Kaposi's sarcoma. There are many advantages to case-control studies. First, the case-control approach allows for the study of rare diseases. If a disease occurs very infrequently, one would have to follow a large group of people for a long period of time to accrue enough incident cases to study. Such use of resources may be impractical, so a case-control study can be useful for identifying current cases and evaluating historical associated factors. For example, if a disease developed in 1 in 1000 people per year (0.001/year) then in ten years one would expect about 10 cases of a disease to exist in a group of 1000 people. If the disease is much rarer, say 1 in 1,000,0000 per year (0.0000001/year) this would require either having to follow 1,000,0000 people for ten years or 1000 people for 1000 years to accrue ten total cases. As it may be impractical to follow 1,000,000 for ten years or to wait 1000 years for recruitment, a case-control study allows for a more feasible approach. Second, the case-control study design makes it possible to look at multiple risk factors at once. In the example above about Kaposi's sarcoma, the researcher could ask both the cases and controls about exposures to HIV, asbestos, smoking, lead, sunburns, aniline dye, alcohol, herpes, human papillomavirus, or any number of possible exposures to identify those most likely associated with Kaposi's sarcoma. Case-control studies can also be very helpful when disease outbreaks occur, and potential links and exposures need to be identified. This study mechanism can be commonly seen in food-related disease outbreaks associated with contaminated products, or when rare diseases start to increase in frequency, as has been seen with measles in recent years. Because of these advantages, case-control studies are commonly used as one of the first studies to build evidence of an association between exposure and an event or disease. In a case-control study, the investigator can include unequal numbers of cases with controls such as 2:1 or 4:1 to increase the power of the study. The most commonly cited disadvantage in case-control studies is the potential for recall bias. Recall bias in a case-control study is the increased likelihood that those with the outcome will recall and report exposures compared to those without the outcome. In other words, even if both groups had exactly the same exposures, the participants in the cases group may report the exposure more often than the controls do. Recall bias may lead to concluding that there are associations between exposure and disease that do not, in fact, exist. It is due to subjects' imperfect memories of past exposures. If people with Kaposi's sarcoma are asked about exposure and history (e.g., HIV, asbestos, smoking, lead, sunburn, aniline dye, alcohol, herpes, human papillomavirus), the individuals with the disease are more likely to think harder about these exposures and recall having some of the exposures that the healthy controls. Case-control studies, due to their typically retrospective nature, can be used to establish a between exposures and outcomes, but cannot establish . These studies simply attempt to find correlations between past events and the current state. When designing a case-control study, the researcher must find an appropriate control group. Ideally, the case group (those with the outcome) and the control group (those without the outcome) will have almost the same characteristics, such as age, gender, overall health status, and other factors. The two groups should have similar histories and live in similar environments. If, for example, our cases of Kaposi's sarcoma came from across the country but our controls were only chosen from a small community in northern latitudes where people rarely go outside or get sunburns, asking about sunburn may not be a valid exposure to investigate. Similarly, if all of the cases of Kaposi's sarcoma were found to come from a small community outside a battery factory with high levels of lead in the environment, then controls from across the country with minimal lead exposure would not provide an appropriate control group. The investigator must put a great deal of effort into creating a proper control group to bolster the strength of the case-control study as well as enhance their ability to find true and valid potential correlations between exposures and disease states. Similarly, the researcher must recognize the potential for failing to identify confounding variables or exposures, introducing the possibility of confounding bias, which occurs when a variable that is not being accounted for that has a relationship with both the exposure and outcome. This can cause us to accidentally be studying something we are not accounting for but that may be systematically different between the groups.
病例对照研究是一种观察性研究,常用于探究与疾病或结果相关的因素。病例对照研究从一组病例开始,这些病例是患有感兴趣结果的个体。然后,研究人员试图构建另一组个体,称为对照组,他们与病例个体相似,但不具有感兴趣的结果。研究人员接着查看历史因素,以确定某些暴露在病例组中是否比对照组中更常见。如果在病例组中发现暴露比对照组中更常见,研究人员可以假设该暴露可能与感兴趣的结果有关。例如,研究人员可能想研究罕见的卡波西肉瘤。研究人员会找到一组患有卡波西肉瘤的个体(病例组),并将他们与一组在大多数方面与病例组相似但没有卡波西肉瘤的患者(对照组)进行比较。然后,研究人员可以询问各种暴露情况,以查看是否有任何暴露在患有卡波西肉瘤的个体(病例组)中比没有卡波西肉瘤的个体(对照组)中更常见。研究人员可能会发现,患有卡波西肉瘤的个体感染艾滋病毒的可能性更高,从而得出结论,艾滋病毒可能是卡波西肉瘤发生的一个风险因素。病例对照研究有许多优点。首先,病例对照方法允许对罕见疾病进行研究。如果一种疾病很少发生,就必须长期跟踪一大群人,以积累足够的发病病例进行研究。这样使用资源可能不切实际,因此病例对照研究对于识别当前病例和评估历史相关因素可能很有用。例如,如果一种疾病每年在每1000人中发生1例(0.001/年),那么在十年内,预计在1000人的群体中会有大约10例该疾病。如果该疾病更罕见,比如每年1/1000000(0.0000001/年),这要么需要跟踪1000000人十年,要么跟踪1000人一千年才能积累总共10例病例。由于跟踪1000000人十年或等待1000年进行招募可能不切实际,病例对照研究提供了一种更可行的方法。其次,病例对照研究设计使得可以同时研究多个风险因素。在上述关于卡波西肉瘤的例子中,研究人员可以询问病例组和对照组关于艾滋病毒、石棉、吸烟、铅、晒伤、苯胺染料、酒精、疱疹、人乳头瘤病毒或任何数量的可能暴露情况,以识别那些最有可能与卡波西肉瘤相关的因素。当疾病爆发时,病例对照研究也非常有帮助,此时需要识别潜在的联系和暴露情况。这种研究机制在与受污染产品相关的食源性疾病爆发中很常见,或者当罕见疾病的发病率开始上升时,就像近年来麻疹的情况。由于这些优点,病例对照研究通常被用作最早建立暴露与事件或疾病之间关联证据的研究之一。在病例对照研究中,研究人员可以纳入数量不等的病例和对照组,如2:1或4:1,以提高研究效能。病例对照研究中最常被提及的缺点是存在回忆偏倚的可能性。病例对照研究中的回忆偏倚是指与没有该结果的人相比,有该结果的人回忆和报告暴露情况的可能性增加。换句话说,即使两组的暴露情况完全相同,病例组的参与者可能比对照组更频繁地报告暴露情况。回忆偏倚可能导致得出暴露与疾病之间存在关联但实际上并不存在的结论。这是由于受试者对过去暴露情况的记忆不完美。如果询问患有卡波西肉瘤的人关于暴露和病史(如艾滋病毒、石棉、吸烟、铅、晒伤、苯胺染料、酒精、疱疹、人乳头瘤病毒),患病个体更有可能仔细思考这些暴露情况,并回忆起一些健康对照组没有的暴露情况。由于病例对照研究通常具有回顾性,它可以用于建立暴露与结果之间的 ,但不能建立 。这些研究只是试图找到过去事件与当前状态之间的相关性。在设计病例对照研究时,研究人员必须找到一个合适的对照组。理想情况下,病例组(有该结果的人)和对照组(没有该结果的人)将具有几乎相同的特征,如年龄、性别、总体健康状况和其他因素。两组应该有相似的病史并生活在相似的环境中。例如,如果我们的卡波西肉瘤病例来自全国各地,但我们的对照组仅从北方一个很少外出或晒伤的小社区中选择,询问晒伤情况可能不是一个有效的调查暴露因素。同样,如果所有卡波西肉瘤病例都来自一家电池厂外的小社区,那里环境中的铅含量很高,那么来自全国各地且铅暴露极少的对照组就不能提供一个合适的对照组。研究人员必须付出大量努力来创建一个合适的对照组,以增强病例对照研究的力度,并提高他们发现暴露与疾病状态之间真实有效潜在相关性的能力。同样,研究人员必须认识到未能识别混杂变量或暴露的可能性,从而引入混杂偏倚的可能性,当一个未被考虑的变量与暴露和结果都有关系时就会发生混杂偏倚。这可能导致我们意外地研究了一些我们没有考虑到但两组之间可能存在系统差异的因素。