• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

比较不同分布和样本大小下的 Pearson 和 Spearman 相关系数:使用模拟和实证数据的教程。

Comparing the Pearson and Spearman correlation coefficients across distributions and sample sizes: A tutorial using simulations and empirical data.

机构信息

Department of BioMechanical Engineering, Faculty of Mechanical, Maritime and Materials Engineering.

Department of Psychology, University of Texas.

出版信息

Psychol Methods. 2016 Sep;21(3):273-90. doi: 10.1037/met0000079. Epub 2016 May 23.

DOI:10.1037/met0000079
PMID:27213982
Abstract

The Pearson product–moment correlation coefficient () and the Spearman rank correlation coefficient () are widely used in psychological research. We compare and on 3 criteria: variability, bias with respect to the population value, and robustness to an outlier. Using simulations across low (N = 5) to high (N = 1,000) sample sizes we show that, for normally distributed variables, and have similar expected values but is more variable, especially when the correlation is strong. However, when the variables have high kurtosis, is more variable than . Next, we conducted a sampling study of a psychometric dataset featuring symmetrically distributed data with light tails, and of 2 Likert-type survey datasets, 1 with light-tailed and the other with heavy-tailed distributions. Consistent with the simulations, had lower variability than in the psychometric dataset. In the survey datasets with heavy-tailed variables in particular, had lower variability than , and often corresponded more accurately to the population Pearson correlation coefficient () than did. The simulations and the sampling studies showed that variability in terms of standard deviations can be reduced by about 20% by choosing instead of . In comparison, increasing the sample size by a factor of 2 results in a 41% reduction of the standard deviations of and . In conclusion, is suitable for light-tailed distributions, whereas is preferable when variables feature heavy-tailed distributions or when outliers are present, as is often the case in psychological research.

摘要

皮尔逊积矩相关系数()和斯皮尔曼等级相关系数()在心理研究中被广泛使用。我们将在 3 个标准上比较和:变异性、对总体值的偏差以及对异常值的稳健性。通过在低(N=5)到高(N=1000)样本量的模拟,我们表明,对于正态分布的变量,和具有相似的期望值,但更具变异性,尤其是当相关性较强时。然而,当变量具有高峰度时,比更具变异性。接下来,我们对一个具有对称分布数据和轻尾的心理计量数据集,以及 2 个李克特类型的调查数据集进行了抽样研究,其中一个具有轻尾,另一个具有重尾分布。与模拟结果一致,在心理计量数据集中,比更具变异性。特别是在具有重尾变量的调查数据集中,比更具变异性,并且通常比更准确地对应于总体皮尔逊相关系数()。模拟和抽样研究表明,通过选择而不是,可以将标准差的变异性降低约 20%。相比之下,将样本量增加两倍会导致和的标准差分别降低 41%。总之,适合轻尾分布,而当变量具有重尾分布或存在异常值时,更适合,这在心理研究中经常发生。

相似文献

1
Comparing the Pearson and Spearman correlation coefficients across distributions and sample sizes: A tutorial using simulations and empirical data.比较不同分布和样本大小下的 Pearson 和 Spearman 相关系数:使用模拟和实证数据的教程。
Psychol Methods. 2016 Sep;21(3):273-90. doi: 10.1037/met0000079. Epub 2016 May 23.
2
Location tests for biomarker studies: a comparison using simulations for the two-sample case.生物标志物研究的定位测试:两样本情形下的模拟比较
Methods Inf Med. 2013;52(4):351-9. doi: 10.3414/ME12-02-0014. Epub 2013 Jul 23.
3
Correlation Coefficients: Appropriate Use and Interpretation.相关系数:合理使用与解释。
Anesth Analg. 2018 May;126(5):1763-1768. doi: 10.1213/ANE.0000000000002864.
4
Reducing Bias and Error in the Correlation Coefficient Due to Nonnormality.减少由于非正态性导致的相关系数偏差和误差。
Educ Psychol Meas. 2015 Oct;75(5):785-804. doi: 10.1177/0013164414557639. Epub 2014 Nov 11.
5
To transform or not transform skewed data for psychometric analysis: that is the question!在心理测量分析中是否对偏态数据进行转换:这就是问题所在!
Nurs Res. 2004 Jan-Feb;53(1):67-71. doi: 10.1097/00006199-200401000-00011.
6
The effects of sample size and variability on the correlation coefficient.样本大小和变异性对相关系数的影响。
Med Sci Sports Exerc. 1996 Mar;28(3):386-91. doi: 10.1097/00005768-199603000-00015.
7
Assessing the performance of MM/PBSA and MM/GBSA methods. 4. Accuracies of MM/PBSA and MM/GBSA methodologies evaluated by various simulation protocols using PDBbind data set.评估MM/PBSA和MM/GBSA方法的性能。4. 使用PDBbind数据集通过各种模拟协议评估MM/PBSA和MM/GBSA方法的准确性。
Phys Chem Chem Phys. 2014 Aug 21;16(31):16719-29. doi: 10.1039/c4cp01388c.
8
Testing the significance of a correlation with nonnormal data: comparison of Pearson, Spearman, transformation, and resampling approaches.用非正态数据检验相关性的显著性:皮尔逊、斯皮尔曼、转换和重抽样方法的比较。
Psychol Methods. 2012 Sep;17(3):399-417. doi: 10.1037/a0028087. Epub 2012 May 7.
9
Population models and simulation methods: The case of the Spearman rank correlation.人口模型与模拟方法:以斯皮尔曼等级相关为例。
Br J Math Stat Psychol. 2017 Nov;70(3):347-367. doi: 10.1111/bmsp.12085. Epub 2017 Jan 31.
10
How accurate is the Pearson r-from-Z approximation? A Monte Carlo simulation study.皮尔逊r值的Z近似法有多准确?一项蒙特卡洛模拟研究。
J Gen Psychol. 2012 Apr-Jun;139(2):68-77. doi: 10.1080/00221309.2012.661376.

引用本文的文献

1
Frailty prediction in patients with chronic digestive system diseases: based on multi-task learning model.慢性消化系统疾病患者的衰弱预测:基于多任务学习模型
Front Med (Lausanne). 2025 Aug 26;12:1633890. doi: 10.3389/fmed.2025.1633890. eCollection 2025.
2
Bridging the Gap: The Impact of Preoperative Nutritional Status and Inflammation in Postoperative Pain in Elderly Patients.弥合差距:术前营养状况和炎症对老年患者术后疼痛的影响
Anesthesiol Res Pract. 2025 Aug 28;2025:6832202. doi: 10.1155/anrp/6832202. eCollection 2025.
3
Body and Testicular Biometry and Epididymal Sperm Characteristics in Sambar Deer (Rusa unicolor (Kerr, 1792)).
水鹿(Rusa unicolor (Kerr, 1792))的身体和睾丸生物测量以及附睾精子特征
Reprod Domest Anim. 2025 Sep;60(9):e70122. doi: 10.1111/rda.70122.
4
Exploring the correlation between gait speed and balance in limb prosthesis users: A pilot study.探索肢体假肢使用者步态速度与平衡之间的相关性:一项初步研究。
Can Prosthet Orthot J. 2025 Aug 21;8(1):45517. doi: 10.33137/cpoj.v8i1.45517. eCollection 2025.
5
Leveraging advanced ensemble learning techniques for methane uptake prediction in metal organic frameworks.利用先进的集成学习技术预测金属有机框架中的甲烷吸收量。
Sci Rep. 2025 Aug 29;15(1):31832. doi: 10.1038/s41598-025-17028-8.
6
Microbial Community Assembly Mechanisms of Groundwater Under Salinity-Oxygen Stress in the Golmud River Watershed, Northwest China.中国西北格尔木河流域盐度-氧气胁迫下地下水微生物群落组装机制
Life (Basel). 2025 Aug 15;15(8):1301. doi: 10.3390/life15081301.
7
Multiplex nodal modularity: A novel network metric for the regional analysis of amnestic mild cognitive impairment during a working memory binding task.多重节点模块化:一种用于在工作记忆绑定任务期间对遗忘型轻度认知障碍进行区域分析的新型网络指标。
PLoS One. 2025 Aug 22;20(8):e0328736. doi: 10.1371/journal.pone.0328736. eCollection 2025.
8
Prevalence and predictors of medical information avoidance: a systematic review and meta-analysis.医疗信息回避的患病率及预测因素:一项系统综述与荟萃分析。
Ann Behav Med. 2025 Jan 4;59(1). doi: 10.1093/abm/kaaf058.
9
Deformable image registration of dark-field chest radiographs for functional lung assessment.用于功能性肺部评估的暗场胸部X光片的可变形图像配准
Med Phys. 2025 Aug;52(8):e18023. doi: 10.1002/mp.18023.
10
Dynamic gene regulatory networks drive seed dormancy and germination of Pinus tabuliformis.动态基因调控网络驱动油松种子休眠与萌发。
BMC Plant Biol. 2025 Aug 6;25(1):1032. doi: 10.1186/s12870-025-07041-4.