• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

应用于多标记调查项目的不同评分算法对结果评估的影响:一项关于健康相关知识的实地研究

Impact of different scoring algorithms applied to multiple-mark survey items on outcome assessment: an in-field study on health-related knowledge.

作者信息

Domnich A, Panatto D, Arata L, Bevilacqua I, Apprato L, Gasparini R, Amicizia D

机构信息

Department of Health Sciences, University of Genoa, Italy.

出版信息

J Prev Med Hyg. 2015;56(4):E162-71.

PMID:26900331
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4753817/
Abstract

INTRODUCTION

Health-related knowledge is often assessed through multiple-choice tests. Among the different types of formats, researchers may opt to use multiple-mark items, i.e. with more than one correct answer. Although multiple-mark items have long been used in the academic setting - sometimes with scant or inconclusive results - little is known about the implementation of this format in research on in-field health education and promotion.

METHODS

A study population of secondary school students completed a survey on nutrition-related knowledge, followed by a single- lecture intervention. Answers were scored by means of eight different scoring algorithms and analyzed from the perspective of classical test theory. The same survey was re-administered to a sample of the students in order to evaluate the short-term change in their knowledge.

RESULTS

In all, 286 questionnaires were analyzed. Partial scoring algorithms displayed better psychometric characteristics than the dichotomous rule. In particular, the algorithm proposed by Ripkey and the balanced rule showed greater internal consistency and relative efficiency in scoring multiple-mark items. A penalizing algorithm in which the proportion of marked distracters was subtracted from that of marked correct answers was the only one that highlighted a significant difference in performance between natives and immigrants, probably owing to its slightly better discriminatory ability. This algorithm was also associated with the largest effect size in the pre-/post-intervention score change.

DISCUSSION

The choice of an appropriate rule for scoring multiple- mark items in research on health education and promotion should consider not only the psychometric properties of single algorithms but also the study aims and outcomes, since scoring rules differ in terms of biasness, reliability, difficulty, sensitivity to guessing and discrimination.

摘要

引言

与健康相关的知识通常通过多项选择题测试来评估。在不同类型的题型中,研究人员可能会选择使用多标记题目,即有多个正确答案的题目。尽管多标记题目长期以来一直在学术环境中使用——有时结果甚微或尚无定论——但对于这种题型在现场健康教育与促进研究中的应用知之甚少。

方法

以中学生为研究对象,完成了一项关于营养相关知识的调查,随后进行了一次讲座干预。答案通过八种不同的评分算法进行评分,并从经典测试理论的角度进行分析。为了评估学生知识的短期变化,对部分学生样本重新进行了相同的调查。

结果

总共分析了286份问卷。部分评分算法显示出比二分法规则更好的心理测量特征。特别是,里普基提出的算法和平衡规则在对多标记题目进行评分时显示出更高的内部一致性和相对效率。一种惩罚算法,即从标记的正确答案比例中减去标记的干扰项比例,是唯一突出显示本地人和移民在表现上存在显著差异的算法,这可能是由于其稍好的区分能力。该算法在干预前/后分数变化中也与最大的效应量相关。

讨论

在健康教育与促进研究中,选择合适的多标记题目评分规则不仅应考虑单个算法的心理测量特性,还应考虑研究目的和结果,因为评分规则在偏差、可靠性、难度、对猜测的敏感性和区分度方面存在差异。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b60f/4753817/b46532b9c0df/2421-4248-56-E162-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b60f/4753817/89816a129ada/2421-4248-56-E162-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b60f/4753817/339487fbf8c8/2421-4248-56-E162-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b60f/4753817/b46532b9c0df/2421-4248-56-E162-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b60f/4753817/89816a129ada/2421-4248-56-E162-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b60f/4753817/339487fbf8c8/2421-4248-56-E162-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b60f/4753817/b46532b9c0df/2421-4248-56-E162-g003.jpg

相似文献

1
Impact of different scoring algorithms applied to multiple-mark survey items on outcome assessment: an in-field study on health-related knowledge.应用于多标记调查项目的不同评分算法对结果评估的影响:一项关于健康相关知识的实地研究
J Prev Med Hyg. 2015;56(4):E162-71.
2
Pick-N multiple choice-exams: a comparison of scoring algorithms.选择题组测验评分算法比较。
Adv Health Sci Educ Theory Pract. 2011 May;16(2):211-21. doi: 10.1007/s10459-010-9256-1. Epub 2010 Oct 31.
3
Multiple true-false items: a comparison of scoring algorithms.多项是非题:评分算法比较。
Adv Health Sci Educ Theory Pract. 2018 Aug;23(3):455-463. doi: 10.1007/s10459-017-9805-y. Epub 2017 Nov 30.
4
Decision making under internal uncertainty: the case of multiple-choice tests with different scoring rules.内部不确定性下的决策:不同评分规则的多项选择题情形
Acta Psychol (Amst). 2003 Feb;112(2):207-20. doi: 10.1016/s0001-6918(02)00085-9.
5
Improving multiple-choice questions to better assess dental student knowledge: distractor utilization in oral and maxillofacial pathology course examinations.改进选择题以更好地评估牙科学员的知识:口腔颌面病理学课程考试中的干扰项利用。
J Dent Educ. 2013 Dec;77(12):1593-609.
6
Scoring Single-Response Multiple-Choice Items: Scoping Review and Comparison of Different Scoring Methods.单项选择题评分:不同评分方法的范围审查与比较
JMIR Med Educ. 2023 May 19;9:e44084. doi: 10.2196/44084.
7
Evidence-based decision about test scoring rules in clinical anatomy multiple-choice examinations.临床解剖学多项选择题考试中基于证据的考试评分规则决策。
Anat Sci Educ. 2015 May-Jun;8(3):242-8. doi: 10.1002/ase.1478. Epub 2014 Jul 22.
8
[French version of TASTE (test for the ability and evaluation)].[TASTE(能力与评估测试)的法语版本]
Encephale. 2001 Nov-Dec;27(6):527-38.
9
A Descriptive, Cross-sectional Survey of Turkish Nurses' Knowledge of Pressure Ulcer Risk, Prevention, and Staging.一项关于土耳其护士对压疮风险、预防及分期知识的描述性横断面调查。
Ostomy Wound Manage. 2017 Jun;63(6):40-46.
10
Psychometrics of Multiple Choice Questions with Non-Functioning Distracters: Implications to Medical Education.具有无效干扰项的多项选择题的心理测量学:对医学教育的启示。
Indian J Physiol Pharmacol. 2015 Oct-Dec;59(4):428-35.

引用本文的文献

1
Use of Multiple-Select Multiple-Choice Items in a Dental Undergraduate Curriculum: Retrospective Study Involving the Application of Different Scoring Methods.牙科本科课程中多项选择多项选择题的使用:涉及不同评分方法应用的回顾性研究
JMIR Med Educ. 2023 Mar 27;9:e43792. doi: 10.2196/43792.
2
Evaluating Different Scoring Methods for Multiple Response Items Providing Partial Credit.评估针对提供部分分数的多项选择题的不同评分方法。
Educ Psychol Meas. 2022 Feb;82(1):151-176. doi: 10.1177/0013164421994636. Epub 2021 Feb 22.
3
Pediatricians' Knowledge of Emergency Management of Dental Injuries and Use of Mouthguards: A Cross-Sectional Survey.

本文引用的文献

1
Uncontrolled Web-based administration of surveys on factual health-related knowledge: a randomized study of untimed versus timed quizzing.基于网络的关于事实性健康相关知识的调查问卷的无控制管理:一项关于非限时测验与限时测验的随机研究。
J Med Internet Res. 2015 Apr 13;17(4):e94. doi: 10.2196/jmir.3734.
2
Nutritional knowledge in European adolescents: results from the HELENA (Healthy Lifestyle in Europe by Nutrition in Adolescence) study.欧洲青少年的营养知识:来自 HELENA(青少年营养与欧洲生活方式健康)研究的结果。
Public Health Nutr. 2011 Dec;14(12):2083-91. doi: 10.1017/S1368980011001352. Epub 2011 Aug 2.
3
Development and validation of Fibromyalgia Knowledge Questionnaire: FKQ.
儿科医生对牙外伤应急处理及护齿器使用的知识:一项横断面调查
Dent J (Basel). 2021 Dec 15;9(12):152. doi: 10.3390/dj9120152.
纤维肌痛知识问卷(FKQ)的编制与验证。
Rheumatol Int. 2012 Mar;32(3):655-62. doi: 10.1007/s00296-010-1627-7. Epub 2010 Dec 5.
4
Pick-N multiple choice-exams: a comparison of scoring algorithms.选择题组测验评分算法比较。
Adv Health Sci Educ Theory Pract. 2011 May;16(2):211-21. doi: 10.1007/s10459-010-9256-1. Epub 2010 Oct 31.
5
Improvement in nutrition-related knowledge and behaviour of urban Asian Indian school children: findings from the 'Medical education for children/Adolescents for Realistic prevention of obesity and diabetes and for healthy aGeing' ( MARG) intervention study.城市印度裔亚洲学童营养相关知识与行为的改善:“儿童/青少年医学教育以切实预防肥胖、糖尿病及促进健康老龄化”(MARG)干预研究的结果
Br J Nutr. 2010 Aug;104(3):427-36. doi: 10.1017/S0007114510000681. Epub 2010 Apr 7.
6
The development and validation of a Low Back Pain Knowledge Questionnaire - LKQ.腰痛知识问卷(LKQ)的编制与验证。
Clinics (Sao Paulo). 2009;64(12):1167-75. doi: 10.1590/S1807-59322009001200006.
7
Validity and reliability testing of a short questionnaire developed to assess consumers' use, understanding and perception of food labels.用于评估消费者对食品标签的使用、理解和看法的简短问卷的有效性和可靠性测试。
Eur J Clin Nutr. 2010 Feb;64(2):210-7. doi: 10.1038/ejcn.2009.126. Epub 2009 Nov 11.
8
Statistical power analyses using G*Power 3.1: tests for correlation and regression analyses.使用 G*Power 3.1 进行统计功效分析:相关和回归分析的检验。
Behav Res Methods. 2009 Nov;41(4):1149-60. doi: 10.3758/BRM.41.4.1149.
9
Nutrition knowledge, attitudes and fat intake: application of the theory of reasoned action.营养知识、态度与脂肪摄入量:理性行动理论的应用
J Hum Nutr Diet. 2007 Jun;20(3):159-69. doi: 10.1111/j.1365-277X.2007.00776.x.
10
The meaning and the measure of health literacy.健康素养的含义与衡量标准。
J Gen Intern Med. 2006 Aug;21(8):878-83. doi: 10.1111/j.1525-1497.2006.00540.x.