Barry Daniel, Livingstone Victoria
Department of Mathematics and Statistics, University of Limerick, Limerick, Ireland.
Stat Med. 2006 Mar 30;25(6):965-75. doi: 10.1002/sim.2238.
Consider a case-control study designed to investigate the possible association between development of a particular disease and the value of a putative risk factor measured on an ordinal scale. Let E denote a subject's true risk factor value and let E* denote a subject's recorded risk factor value. Misclassification bias occurs if conclusions reached regarding the relationship between disease status and E* do not also apply to the relationship between disease status and E. We propose a model for the conditional probability distribution of E* given E. We show how the model may be used to investigate misclassification bias in a validation study where measurements of E* and E are available for both cases and controls and apply the methods developed to data from a test-retest study of recall bias in the context of screening for hypertension. We also consider a situation where the validation study is carried out on a subset of the subjects within a larger case-control study. In that case, values for E* are available for all subjects but values for E are available only for those subjects included in the validation study. We show how correct likelihood-based inference concerning association between disease status and risk factor value may be carried out using all of the available data. A Monte Carlo study shows how the inclusion of a validation study leads to a correction of recall bias problems at the cost of an increased standard error for the estimated association parameter.
考虑一项病例对照研究,旨在调查某一特定疾病的发生与按有序尺度测量的假定风险因素值之间的可能关联。设E表示受试者的真实风险因素值,E表示受试者记录的风险因素值。如果关于疾病状态与E之间关系得出的结论不适用于疾病状态与E之间的关系,就会出现错误分类偏倚。我们提出了一个给定E时E的条件概率分布模型。我们展示了该模型如何用于在验证研究中调查错误分类偏倚,在该研究中病例和对照的E和E测量值均可用,并将所开发的方法应用于高血压筛查背景下回忆偏倚的重测研究数据。我们还考虑了在一个更大的病例对照研究中的一部分受试者上进行验证研究的情况。在那种情况下,所有受试者的E*值可用,但只有验证研究中包含的那些受试者的E值可用。我们展示了如何使用所有可用数据对疾病状态与风险因素值之间的关联进行基于似然性的正确推断。一项蒙特卡罗研究表明,纳入验证研究如何以估计关联参数的标准误差增加为代价,导致回忆偏倚问题得到校正。