Austvoll-Dahlgren Astrid, Guttersrud Øystein, Nsangi Allen, Semakula Daniel, Oxman Andrew D
Norwegian Institute of Public Health, Oslo, Norway.
Norwegian Centre for Science Education, University of Oslo, Oslo, Norway.
BMJ Open. 2017 May 25;7(5):e013185. doi: 10.1136/bmjopen-2016-013185.
The Claim Evaluation Tools database contains multiple-choice items for measuring people's ability to apply the key concepts they need to know to be able to assess treatment claims. We assessed items from the database using Rasch analysis to develop an outcome measure to be used in two randomised trials in Uganda. Rasch analysis is a form of psychometric testing relying on Item Response Theory. It is a dynamic way of developing outcome measures that are valid and reliable.
To assess the validity, reliability and responsiveness of 88 items addressing 22 key concepts using Rasch analysis.
We administrated four sets of multiple-choice items in English to 1114 people in Uganda and Norway, of which 685 were children and 429 were adults (including 171 health professionals). We scored all items dichotomously. We explored summary and individual fit statistics using the RUMM2030 analysis package. We used SPSS to perform distractor analysis.
Most items conformed well to the Rasch model, but some items needed revision. Overall, the four item sets had satisfactory reliability. We did not identify significant response dependence between any pairs of items and, overall, the magnitude of multidimensionality in the data was acceptable. The items had a high level of difficulty.
Most of the items conformed well to the Rasch model's expectations. Following revision of some items, we concluded that most of the items were suitable for use in an outcome measure for evaluating the ability of children or adults to assess treatment claims.
索赔评估工具数据库包含多项选择题,用于衡量人们应用关键概念以评估治疗索赔的能力。我们使用拉施分析对数据库中的题目进行评估,以开发一种将在乌干达的两项随机试验中使用的结果测量方法。拉施分析是一种基于项目反应理论的心理测量测试形式。它是开发有效且可靠的结果测量方法的动态方式。
使用拉施分析评估涉及22个关键概念的88个题目的有效性、可靠性和反应性。
我们用英语向乌干达和挪威的1114人发放了四套多项选择题,其中685人为儿童,429人为成年人(包括171名卫生专业人员)。我们对所有题目进行二分计分。我们使用RUMM2030分析软件包探索汇总和个体拟合统计量。我们使用SPSS进行干扰项分析。
大多数题目与拉施模型拟合良好,但有些题目需要修订。总体而言,这四套题目具有令人满意的可靠性。我们未发现任何题目对之间存在显著的反应依赖性,总体而言,数据中的多维性程度是可以接受的。这些题目难度较高。
大多数题目与拉施模型的预期拟合良好。在对一些题目进行修订后,我们得出结论,大多数题目适用于评估儿童或成年人评估治疗索赔能力的结果测量方法。