Feldman H A, Goldstein I, Hatzichristou D G, Krane R J, McKinlay J B
New England Research Institute, Watertown, MA 02172, USA.
J Clin Epidemiol. 1994 May;47(5):457-67. doi: 10.1016/0895-4356(94)90292-5.
Uncollected data must be filled in after the fact in many epidemiological studies. We compared several methods for constructing a surrogate variable for erectile potency, based on responses to related questions, in the Massachusetts Male Aging Study (MMAS). Quadratic discriminant analysis performed best among the procedures tested, achieving high rates of correct classification in cross-validation of training data and (according to posterior estimates) in classification of MMAS subjects. Two versions of the surrogate variable were constructed, one involving a discrete classification and the other involving a set of continuous probabilities. The two versions were used to investigate medical correlates of impotence in MMAS and produced similar results, although the continuous version appeared to provide higher statistical power. We conclude that construction of a surrogate variable is a viable and potentially beneficial option in epidemiological investigations.
在许多流行病学研究中,未收集的数据必须事后填写。在马萨诸塞州男性衰老研究(MMAS)中,我们基于对相关问题的回答,比较了几种构建勃起功能替代变量的方法。在测试的程序中,二次判别分析表现最佳,在训练数据的交叉验证以及(根据后验估计)MMAS受试者的分类中都实现了较高的正确分类率。构建了替代变量的两个版本,一个涉及离散分类,另一个涉及一组连续概率。这两个版本被用于研究MMAS中阳痿的医学关联,尽管连续版本似乎具有更高的统计效力,但产生了相似的结果。我们得出结论,在流行病学调查中构建替代变量是一个可行且可能有益的选择。