• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

问卷歧视:(重新)引入德尔塔系数。

Questionnaire discrimination: (re)-introducing coefficient delta.

作者信息

Hankins Matthew

机构信息

King's College London, Department of Psychology (at Guy's), Institute of Psychiatry, London, UK.

出版信息

BMC Med Res Methodol. 2007 May 18;7:19. doi: 10.1186/1471-2288-7-19.

DOI:10.1186/1471-2288-7-19
PMID:17511862
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1884165/
Abstract

BACKGROUND

Questionnaires are used routinely in clinical research to measure health status and quality of life. Questionnaire measurements are traditionally formally assessed by indices of reliability (the degree of measurement error) and validity (the extent to which the questionnaire measures what it is supposed to measure). Neither of these indices assesses the degree to which the questionnaire is able to discriminate between individuals, an important aspect of measurement. This paper introduces and extends an existing index of a questionnaire's ability to distinguish between individuals, that is, the questionnaire's discrimination.

METHODS

Ferguson (1949) 1 derived an index of test discrimination, coefficient delta, for psychometric tests with dichotomous (correct/incorrect) items. In this paper a general form of the formula, deltaG, is derived for the more general class of questionnaires allowing for several response choices. The calculation and characteristics of deltaG are then demonstrated using questionnaire data (GHQ-12) from 2003-2004 British Household Panel Survey (N = 14761). Coefficients for reliability (alpha) and discrimination (deltaG) are computed for two commonly-used GHQ-12 coding methods: dichotomous coding and four-point Likert-type coding.

RESULTS

Both scoring methods were reliable (alpha > 0.88). However, deltaG was substantially lower (0.73) for the dichotomous coding of the GHQ-12 than for the Likert-type method (deltaG = 0.96), indicating that the dichotomous coding, although reliable, failed to discriminate between individuals.

CONCLUSION

Coefficient deltaG was shown to have decisive utility in distinguishing between the cross-sectional discrimination of two equally reliable scoring methods. Ferguson's delta has been neglected in discussions of questionnaire design and performance, perhaps because it has not been implemented in software and was restricted to questionnaires with dichotomous items, which are rare in health care research. It is suggested that the more general formula introduced here is reported as deltaG, to avoid the implication that items are dichotomously coded.

摘要

背景

问卷调查在临床研究中常用于测量健康状况和生活质量。传统上,问卷测量通过可靠性指标(测量误差程度)和效度指标(问卷测量其预期测量内容的程度)进行正式评估。这两个指标均未评估问卷区分个体的能力,而这是测量的一个重要方面。本文介绍并扩展了一个现有的衡量问卷区分个体能力的指标,即问卷的区分度。

方法

弗格森(1949年)1为具有二分法(正确/错误)项目的心理测量测试推导了一个测试区分度指标,即德尔塔系数。本文针对允许有多种回答选项的更一般类型的问卷推导了该公式的一般形式,即德尔塔G。然后使用2003 - 2004年英国家庭小组调查(N = 14761)的问卷数据(一般健康问卷 - 12项,GHQ - 12)展示德尔塔G的计算和特征。针对两种常用的GHQ - 12编码方法:二分法编码和四点李克特式编码,计算可靠性系数(阿尔法)和区分度系数(德尔塔G)。

结果

两种计分方法都具有可靠性(阿尔法> 0.88)。然而,GHQ - 12的二分法编码的德尔塔G(0.73)显著低于李克特式方法(德尔塔G = 0.96),这表明二分法编码虽然可靠,但未能区分个体。

结论

结果表明,系数德尔塔G在区分两种同样可靠的计分方法的横断面区分度方面具有决定性作用。在问卷设计和性能的讨论中,弗格森的德尔塔被忽视了,可能是因为它未在软件中实现,且仅限于具有二分法项目的问卷,而这类问卷在医疗保健研究中很少见。建议将此处引入的更一般公式报告为德尔塔G,以避免暗示项目采用二分法编码。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2962/1884165/937c7e0633de/1471-2288-7-19-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2962/1884165/937c7e0633de/1471-2288-7-19-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2962/1884165/937c7e0633de/1471-2288-7-19-1.jpg

相似文献

1
Questionnaire discrimination: (re)-introducing coefficient delta.问卷歧视:(重新)引入德尔塔系数。
BMC Med Res Methodol. 2007 May 18;7:19. doi: 10.1186/1471-2288-7-19.
2
The reliability of the twelve-item general health questionnaire (GHQ-12) under realistic assumptions.在现实假设下,十二项一般健康问卷(GHQ - 12)的可靠性。
BMC Public Health. 2008 Oct 14;8:355. doi: 10.1186/1471-2458-8-355.
3
Reimagining the General Health Questionnaire as a measure of emotional wellbeing: a study of postpartum women in Malta.将一般健康问卷重新构想为衡量情感健康的工具:马耳他产后妇女的研究。
Women Birth. 2013 Dec;26(4):e105-11. doi: 10.1016/j.wombi.2013.06.002. Epub 2013 Jul 23.
4
Comparison of Ferguson's δ and the Gini coefficient used for measuring the inequality of data related to health quality of life outcomes.比较弗格森的δ与基尼系数,用于衡量与健康生活质量结果相关的数据的不平等性。
Health Qual Life Outcomes. 2020 Apr 28;18(1):111. doi: 10.1186/s12955-020-01356-6.
5
Using the 12-item General Health Questionnaire to screen psychological distress from survivorship to end-of-life care: dimensionality and item quality.使用 12 项一般健康问卷从生存到临终关怀筛查心理困扰:维度和项目质量。
Psychooncology. 2012 Sep;21(9):954-61. doi: 10.1002/pon.1989. Epub 2011 May 9.
6
Factor structure and psychometric properties of the General Health Questionnaire (GHQ-12) among Ghanaian adolescents.加纳青少年中一般健康问卷(GHQ - 12)的因子结构和心理测量特性
J Child Adolesc Ment Health. 2015;27(1):53-7. doi: 10.2989/17280583.2015.1007867.
7
[Validation of a self-administered functional evaluation questionnaire after surgical treatment of lumbar spine stenosis].腰椎管狭窄症手术治疗后自我管理功能评估问卷的验证
Rev Chir Orthop Reparatrice Appar Mot. 2002 Oct;88(6):601-12.
8
The 12-item General Health Questionnaire (GHQ-12): translation and validation study of the Iranian version.12项一般健康问卷(GHQ - 12):伊朗版本的翻译与效度研究
Health Qual Life Outcomes. 2003 Nov 13;1:66. doi: 10.1186/1477-7525-1-66.
9
Identifying patients who require a change in their current acute migraine treatment: the Migraine Assessment of Current Therapy (Migraine-ACT) questionnaire.识别需要改变当前急性偏头痛治疗方案的患者:当前治疗偏头痛评估(Migraine-ACT)问卷。
Curr Med Res Opin. 2004 Jul;20(7):1125-35. doi: 10.1185/030079904125004079.
10
The Birmingham Relationship Continuity Measure: the development and evaluation of a measure of the perceived continuity of spousal relationships in dementia.伯明翰关系连续性量表:一种感知配偶在痴呆症中关系连续性的测量工具的开发和评估。
Int Psychogeriatr. 2013 Feb;25(2):263-74. doi: 10.1017/S1041610212001743. Epub 2012 Oct 30.

引用本文的文献

1
Dutch Nationwide Cohort Experience with a New PROMs Set in Metabolic and Bariatric Surgery: BODY-Q Obesity Module.荷兰全国性队列研究中代谢和减重手术新患者报告结局测量指标集(BODY-Q肥胖模块)的经验
Obes Surg. 2025 Jan;35(1):67-77. doi: 10.1007/s11695-024-07615-5. Epub 2024 Dec 26.
2
The ascendancy of research in acronyms related to COVID-19 displayed on a growth-share matrix (GSM): Bibliometric analysis.COVID-19 相关缩略语研究的崛起在增长份额矩阵(GSM)上的表现:文献计量分析。
Medicine (Baltimore). 2023 Apr 25;102(17):e33626. doi: 10.1097/MD.0000000000033626.
3
French validation of the Weight Efficacy Life-Style questionnaire (WEL): Links with mood, self-esteem and stress among the general population and a clinical sample of individuals with overweight and obesity.

本文引用的文献

1
On the theory of test discrimination.关于测验区分度的理论。
Psychometrika. 1949 Mar;14(1):61-8. doi: 10.1007/BF02290141.
2
Performance of health-status scales when used selectively or within multi-scale questionnaire.健康状况量表在选择性使用或在多量表问卷中使用时的表现。
BMC Med Res Methodol. 2003 Feb 13;3:3. doi: 10.1186/1471-2288-3-3.
体重效能量表(WEL)的法语验证:在普通人群和超重及肥胖人群的临床样本中与情绪、自尊和压力的关系。
PLoS One. 2021 Nov 16;16(11):e0259885. doi: 10.1371/journal.pone.0259885. eCollection 2021.
4
Psychometric analysis of the Brazilian-version Kidscreen-27 questionnaire.巴西版 Kidscreen-27 问卷的心理计量学分析。
Health Qual Life Outcomes. 2021 Jul 27;19(1):185. doi: 10.1186/s12955-021-01824-7.
5
Assessing Orthorexia Nervosa: Validation of the Polish Version of the Eating Habits Questionnaire in a General Population Sample.评估饮食正常强迫症:一般人群样本中饮食习惯问卷的波兰文版验证。
Nutrients. 2020 Dec 14;12(12):3820. doi: 10.3390/nu12123820.
6
An Assessment of the Psychometric Properties of the GHQ-12 in an English Population of Autistic Adults Without Learning Difficulties.《无学习困难的自闭症成年英语人群中 GHQ-12 的心理测量特性评估》
J Autism Dev Disord. 2021 Apr;51(4):1093-1106. doi: 10.1007/s10803-020-04604-2.
7
Comparison of Ferguson's δ and the Gini coefficient used for measuring the inequality of data related to health quality of life outcomes.比较弗格森的δ与基尼系数,用于衡量与健康生活质量结果相关的数据的不平等性。
Health Qual Life Outcomes. 2020 Apr 28;18(1):111. doi: 10.1186/s12955-020-01356-6.
8
Development and validation of the Italian version of the Mobile Application Rating Scale and its generalisability to apps targeting primary prevention.意大利语版移动应用程序评分量表的开发与验证及其对初级预防应用程序的通用性
BMC Med Inform Decis Mak. 2016 Jul 7;16:83. doi: 10.1186/s12911-016-0323-2.
9
Validation of the French translation-adaptation of the impact of cancer questionnaire version 2 (IOCv2) in a breast cancer survivor population.癌症问卷第2版(IOCv2)法语翻译改编版在乳腺癌幸存者群体中的验证。
Health Qual Life Outcomes. 2015 Jul 29;13:110. doi: 10.1186/s12955-015-0301-x.
10
Intraclass reliability for assessing how well Taiwan constrained hospital-provided medical services using statistical process control chart techniques.使用统计过程控制图技术评估台湾限制医院提供医疗服务的效果的组内可靠性。
BMC Med Res Methodol. 2012 May 15;12:67. doi: 10.1186/1471-2288-12-67.