• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用稀疏数据有效测量识别性能。

Efficiently measuring recognition performance with sparse data.

作者信息

Schooler Lael J, Shiffrin Richard M

机构信息

Center for Adaptive Behavior and Cognition, Max Planck Institute for Human Development, Berlin, Germany.

出版信息

Behav Res Methods. 2005 Feb;37(1):3-10. doi: 10.3758/bf03206393.

DOI:10.3758/bf03206393
PMID:16097339
Abstract

We examine methods for measuring performance in signal-detection-like tasks when each participant provides only a few observations. Monte Carlo simulations demonstrate that standard statistical techniques applied to a d' analysis can lead to large numbers of Type I errors (incorrectly rejecting a hypothesis of no difference). Various statistical methods were compared in terms of their Type I and Type II error (incorrectly accepting a hypothesis of no difference) rates. Our conclusions are the same whether these two types of errors are weighted equally or Type I errors are weighted more heavily. The most promising method is to combine an aggregate d' measure with a percentile bootstrap confidence interval, a computer-intensive nonparametric method of statistical inference. Researchers who prefer statistical techniques more commonly used in psychology, such as a repeated measures t test, should use gamma (Goodman & Kruskal, 1954), since it performs slightly better than or nearly as well as d'. In general, when repeated measures t tests are used, gamma is more conservative than d': It makes more Type II errors, but its Type I error rate tends to be much closer to that of the traditional .05 alpha level. It is somewhat surprising that gamma performs as well as it does, given that the simulations that generated the hypothetical data conformed completely to the d' model. Analyses in which H--FA was used had the highest Type I error rates. Detailed simulation results can be downloaded from www.psychonomic.org/archive/Schooler-BRM-2004.zip.

摘要

我们研究了在每个参与者仅提供少量观察数据的类似信号检测任务中衡量表现的方法。蒙特卡罗模拟表明,应用于d'分析的标准统计技术可能会导致大量的I类错误(错误地拒绝无差异假设)。我们根据I类和II类错误(错误地接受无差异假设)率对各种统计方法进行了比较。无论这两类错误是同等加权还是I类错误加权更重,我们的结论都是相同的。最有前景的方法是将综合d'度量与百分位数自助置信区间相结合,这是一种计算密集型的非参数统计推断方法。更喜欢心理学中更常用统计技术(如重复测量t检验)的研究人员应使用伽马(古德曼和克鲁斯卡,1954),因为它的表现略优于d'或与d'相近。一般来说,当使用重复测量t检验时,伽马比d'更保守:它会产生更多的II类错误,但其I类错误率往往更接近传统的.05显著性水平。鉴于生成假设数据的模拟完全符合d'模型,伽马的表现如此之好有点令人惊讶。使用H - FA的分析具有最高的I类错误率。详细的模拟结果可从www.psychonomic.org/archive/Schooler - BRM - 2004.zip下载。

相似文献

1
Efficiently measuring recognition performance with sparse data.使用稀疏数据有效测量识别性能。
Behav Res Methods. 2005 Feb;37(1):3-10. doi: 10.3758/bf03206393.
2
Type I error rates and power analyses for single-point sensitivity measures.单点敏感性测量的I型错误率及效能分析。
Percept Psychophys. 2008 Feb;70(2):389-401. doi: 10.3758/pp.70.2.389.
3
Exploring function estimators as an alternative to regression in psychology.探索函数估计器作为心理学中回归分析的替代方法。
Behav Res Methods. 2005 Feb;37(1):23-36. doi: 10.3758/bf03206395.
4
The impact of nonnormality on full information maximum-likelihood estimation for structural equation models with missing data.非正态性对具有缺失数据的结构方程模型的全信息极大似然估计的影响。
Psychol Methods. 2001 Dec;6(4):352-70.
5
A comparison of different strategies for computing confidence intervals of the linkage disequilibrium measure D'.连锁不平衡度量D'的置信区间计算的不同策略比较。
Pac Symp Biocomput. 2004:128-39. doi: 10.1142/9789812704856_0013.
6
Random assignment of available cases: bootstrap standard errors and confidence intervals.可用病例的随机分配:自助法标准误差和置信区间。
Psychol Methods. 2001 Dec;6(4):402-12.
7
Analyzing recognition performance with sparse data.使用稀疏数据分析识别性能。
Behav Res Methods. 2008 Aug;40(3):722-7. doi: 10.3758/brm.40.3.722.
8
Standard errors and confidence intervals for correlations corrected for indirect range restriction: A simulation study comparing analytic and bootstrap methods.针对间接范围限制校正后的相关性的标准误差和置信区间:一项比较分析方法和自助法的模拟研究。
Br J Math Stat Psychol. 2018 Feb;71(1):39-59. doi: 10.1111/bmsp.12105. Epub 2017 Jun 20.
9
Measures of metacognition on signal-detection theoretic models.信号检测理论模型的元认知测量。
Psychol Methods. 2013 Dec;18(4):535-52. doi: 10.1037/a0033268. Epub 2013 Sep 30.
10
One-sided nonparametric tests for ordinal data.有序数据的单侧非参数检验。
Percept Mot Skills. 2005 Oct;101(2):510-4. doi: 10.2466/pms.101.2.510-514.

引用本文的文献

1
Assessing the distortions introduced when calculating d': A simulation approach.评估计算 d'时引入的扭曲:一种模拟方法。
Behav Res Methods. 2024 Oct;56(7):7728-7747. doi: 10.3758/s13428-024-02447-8. Epub 2024 Jul 3.
2
The role of sparsely distributed representations in familiarity recognition of verbal and olfactory materials.稀疏分布表征在言语和嗅觉材料熟悉度识别中的作用。
Cogn Process. 2018 Nov;19(4):481-494. doi: 10.1007/s10339-018-0862-9. Epub 2018 Apr 20.
3
The influence of direct and indirect speech on mental representations.
直接引语和间接引语对心理表象的影响。
PLoS One. 2013 Jun 12;8(6):e65480. doi: 10.1371/journal.pone.0065480. Print 2013.
4
The good, the bad, and the rare: memory for partners in social interactions.好的、坏的和稀有的:社交互动中对伴侣的记忆。
PLoS One. 2011 Apr 29;6(4):e18945. doi: 10.1371/journal.pone.0018945.
5
Delayed memory for visual-haptic exploration of familiar objects.对熟悉物体进行视觉-触觉探索的延迟记忆。
Psychon Bull Rev. 2008 Jun;15(3):574-80. doi: 10.3758/pbr.15.3.574.