一种估计过采样方法中系统偏差的潜在变量模型方法。

A latent variable model approach to estimating systematic bias in the oversampling method.

机构信息

Department of Psychology, Northwestern University, Evanston, IL, USA,

出版信息

Behav Res Methods. 2014 Sep;46(3):786-97. doi: 10.3758/s13428-013-0402-6.

Abstract

The method of oversampling data from a preselected range of a variable's distribution is often applied by researchers who wish to study rare outcomes without substantially increasing sample size. Despite frequent use, however, it is not known whether this method introduces statistical bias due to disproportionate representation of a particular range of data. The present study employed simulated data sets to examine how oversampling introduces systematic bias in effect size estimates (of the relationship between oversampled predictor variables and the outcome variable), as compared with estimates based on a random sample. In general, results indicated that increased oversampling was associated with a decrease in the absolute value of effect size estimates. Critically, however, the actual magnitude of this decrease in effect size estimates was nominal. This finding thus provides the first evidence that the use of the oversampling method does not systematically bias results to a degree that would typically impact results in behavioral research. Examining the effect of sample size on oversampling yielded an additional important finding: For smaller samples, the use of oversampling may be necessary to avoid spuriously inflated effect sizes, which can arise when the number of predictor variables and rare outcomes is comparable.

摘要

研究人员经常采用从变量分布的预选范围内对数据进行过采样的方法，以便在不显著增加样本量的情况下研究罕见结果。然而，尽管这种方法经常被使用，但尚不清楚这种方法是否会由于对特定数据范围的不成比例表示而引入统计偏差。本研究使用模拟数据集来检查与基于随机样本的估计相比，过采样如何在效应量估计（过采样预测变量与因变量之间的关系）中引入系统偏差。一般来说，结果表明，过采样的增加与效应量估计的绝对值减小有关。然而，重要的是，这种效应量估计值的减小幅度实际上是微不足道的。因此，这一发现首次提供了证据，表明使用过采样方法不会系统地偏倚结果，以至于通常会影响行为研究的结果。检查样本量对过采样的影响得出了另一个重要发现：对于较小的样本，可能需要使用过采样来避免虚假膨胀的效应量，当预测变量和罕见结果的数量相当时，就会出现这种情况。

相似文献

A latent variable model approach to estimating systematic bias in the oversampling method.

Behav Res Methods. 2014 Sep;46(3):786-97. doi: 10.3758/s13428-013-0402-6.

The effect of sample size and bias on the reliability of estimates of error: a comparative study of Dahlberg's formula.

Eur J Orthod. 2012 Apr;34(2):158-63. doi: 10.1093/ejo/cjr010. Epub 2011 Mar 29.

Some cautions on the use of instrumental variables estimators in outcomes research: how bias in instrumental variables estimators is affected by instrument strength, instrument contamination, and sample size.

Value Health. 2011 Dec;14(8):1078-84. doi: 10.1016/j.jval.2011.06.009. Epub 2011 Oct 1.

Power and sample size when multiple endpoints are considered.

Pharm Stat. 2007 Jul-Sep;6(3):161-70. doi: 10.1002/pst.301.

Polychotomization of continuous variables in regression models based on the overall C index.

BMC Med Inform Decis Mak. 2006 Dec 14;6:41. doi: 10.1186/1472-6947-6-41.

Accounting for response misclassification and covariate measurement error improves power and reduces bias in epidemiologic studies.

Ann Epidemiol. 2010 Jul;20(7):562-7. doi: 10.1016/j.annepidem.2010.03.012.

Rapid gridding reconstruction with a minimal oversampling ratio.

IEEE Trans Med Imaging. 2005 Jun;24(6):799-808. doi: 10.1109/TMI.2005.848376.

The number of subjects per variable required in linear regression analyses.

J Clin Epidemiol. 2015 Jun;68(6):627-36. doi: 10.1016/j.jclinepi.2014.12.014. Epub 2015 Jan 22.

Assessment of small health risks based on exact sample sizes.

Stat Med. 1996 Jan 30;15(2):183-95. doi: 10.1002/(SICI)1097-0258(19960130)15:2<183::AID-SIM154>3.0.CO;2-S.

Concepts in sample size determination.

Indian J Dent Res. 2012 Sep-Oct;23(5):660-4. doi: 10.4103/0970-9290.107385.

引用本文的文献

A Shared Threat-Anticipation Circuit Is Dynamically Engaged at Different Moments by Certain and Uncertain Threat.

J Neurosci. 2025 Apr 16;45(16):e2113242025. doi: 10.1523/JNEUROSCI.2113-24.2025.

Neuroticism Is Prospectively Associated With 30-Month Changes in Broadband Internalizing Symptoms, but Not Narrowband Positive Affect or Anxious Arousal, in Emerging Adulthood.

Clin Psychol Sci. 2024 Sep;12(5):823-839. doi: 10.1177/21677026231205270. Epub 2023 Oct 24.

A shared threat-anticipation circuit is dynamically engaged at different moments by certain and uncertain threat.

bioRxiv. 2025 Feb 4:2024.07.10.602972. doi: 10.1101/2024.07.10.602972.

Neuroticism/Negative Emotionality Is Associated with Increased Reactivity to Uncertain Threat in the Bed Nucleus of the Stria Terminalis, Not the Amygdala.

J Neurosci. 2024 Aug 7;44(32):e1868232024. doi: 10.1523/JNEUROSCI.1868-23.2024.

Five-year follow-up of the iBerry Study: screening in early adolescence to identify those at risk of psychopathology in emerging adulthood.

Eur Child Adolesc Psychiatry. 2024 Dec;33(12):4285-4294. doi: 10.1007/s00787-024-02462-2. Epub 2024 May 22.

Personality predicts pre-COVID-19 to COVID-19 trajectories of transdiagnostic anxiety and depression symptoms.

J Psychopathol Clin Sci. 2023 Aug;132(6):645-656. doi: 10.1037/abn0000803. Epub 2023 Jun 1.

Neuroticism/negative emotionality is associated with increased reactivity to uncertain threat in the bed nucleus of the stria terminalis, not the amygdala.

bioRxiv. 2024 Jun 1:2023.02.09.527767. doi: 10.1101/2023.02.09.527767.

Psychol Sci. 2022 Jun;33(6):906-924. doi: 10.1177/09567976211056635. Epub 2022 Jun 3.

Five-year prospective neuroticism-stress effects on major depressive episodes: Primarily additive effects of the general neuroticism factor and stress.

J Abnorm Psychol. 2020 Aug;129(6):646-657. doi: 10.1037/abn0000530. Epub 2020 Jun 1.

Cortisol awakening response and additive serotonergic genetic risk interactively predict depression in two samples: The 2019 Donald F. Klein Early Career Investigator Award Paper.

Depress Anxiety. 2019 Jun;36(6):480-489. doi: 10.1002/da.22899. Epub 2019 Apr 24.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种估计过采样方法中系统偏差的潜在变量模型方法。

A latent variable model approach to estimating systematic bias in the oversampling method.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献