Suppr超能文献

加权分析中的有影响观测值:来自全国儿童和青少年纵向调查(NLSCY)的示例。

Influential observations in weighted analyses: examples from the National Longitudinal Survey of Children and Youth (NLSCY).

作者信息

Macnab Jennifer J, Koval J J, Speechley K N, Campbell M K

机构信息

Department of Epidemiology and Biostatistics, Faculty of Medecine and Dentistry, University of Western Ontario, London, Ontario, N6A 5C1, Canada.

出版信息

Chronic Dis Can. 2005 Winter;26(1):1-8.

Abstract

This paper highlights the impact of survey weights on model fit in multiple linear regression with specific reference to the National Longitudinal Survey of Children and Youth (NLSCY) and provides recommendations for the treatment of influential observations. Multiple linear regression was used to estimate the association between child and family factors in the preschool years and vocabulary development at school age. Analyses were performed with and without survey weights. The model fit was assessed by examining the distribution of the studentized residuals and the change in the regression coefficients that would occur if an observation were removed. Two summary measures of influence, Dffits and Cook's D are reported. The models were refit excluding influential observations. Weighting of the linear model resulted in previously non-influential observations having an undue influence on the estimation of the regression parameters in the weighted model. The influential observations were driven primarily by the size of the survey weight as opposed to unusual values of x and y. Researchers working with large national health surveys such as the NLSCY and the National Population Health Survey (NPHS) are advised to include a detailed influence analysis before any final conclusions are made.

摘要

本文重点阐述了调查权重对多元线性回归模型拟合的影响,特别提及了《全国儿童和青少年纵向调查》(NLSCY),并针对有影响力观测值的处理提出了建议。采用多元线性回归来估计学前儿童及家庭因素与学龄期词汇发展之间的关联。分析分别在有调查权重和无调查权重的情况下进行。通过检查学生化残差的分布以及去除一个观测值时回归系数的变化来评估模型拟合情况。报告了两个影响力的汇总指标,即Dffits和库克距离(Cook's D)。对模型进行重新拟合,排除有影响力的观测值。对线性模型进行加权导致之前无影响力的观测值对加权模型中回归参数的估计产生了不当影响。有影响力的观测值主要由调查权重的大小驱动,而非x和y的异常值。建议从事如NLSCY和《全国人口健康调查》(NPHS)这类大型国家健康调查的研究人员,在得出任何最终结论之前进行详细的影响力分析。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验