Wass V, Jones R, Van der Vleuten C
Department of General Practice and Primary Care, Guy's, King's and St Thomas' School of Medicine, London, UK.
Med Educ. 2001 Apr;35(4):321-5. doi: 10.1046/j.1365-2923.2001.00928.x.
In undergraduate clinical examinations, the use of real patients as long cases is being replaced by objective structured clinical examinations (OSCEs) which use simulated scenarios, although we lack published psychometric data on long cases to support the move from real to simulated patients.
To assess candidate performance across two history-taking long cases to estimate the number of cases required for a reliable assessment. Results are compared with psychometric data from an OSCE.
A final-year qualifying undergraduate clinical examination.
Two observed history-taking long cases were included, alongside an OSCE. Candidates interviewed two unstandardized real patients. The history-taking part (14 minutes) was observed, uninterrupted, by examiner(s) who assessed data gathering, interviewing, and diagnostic and management skills. The presentation (7 minutes) was unstructured; the examiner(s) intervened as appropriate. Marks were expressed as a percentage of the total possible score and analysed using generalizability theory to estimate intercase reliability.
Two examiner pairs independently rated both long cases for 79 (36.7%) of the 214 candidates. Projections based on generalizability theory showed that 10 20-minute cases would give reliabilities of 0.84 for single-marked and 0.88 for double-marked candidates, compared with a projected reliability of 0.73 for the same 214 candidates taking the OSCE.
If history-taking long cases are observed, three-and-a-half hours of testing time using 10 unstandardized patients would produce a reliable test. Long cases therefore are, in terms of reliability, no worse and no better than OSCEs in assessing clinical competence.
在本科临床考试中,使用真实患者进行长病例考试正被采用模拟场景的客观结构化临床考试(OSCE)所取代,尽管我们缺乏关于长病例的已发表心理测量数据来支持从真实患者转向模拟患者。
评估考生在两个病史采集长病例中的表现,以估计进行可靠评估所需的病例数量。将结果与OSCE的心理测量数据进行比较。
本科最后一年的资格临床考试。
包括两个观察病史采集的长病例以及一个OSCE。考生采访两名未标准化的真实患者。病史采集部分(14分钟)由考官不间断地观察,考官评估数据收集、访谈以及诊断和管理技能。陈述部分(7分钟)是非结构化的;考官酌情进行干预。分数以总分的百分比表示,并使用概化理论进行分析以估计病例间的信度。
两组考官对214名考生中的79名(36.7%)独立评定了两个长病例。基于概化理论的预测表明,对于单评分考生,10个20分钟的病例信度为0.84,双评分考生为0.88,而对于参加OSCE的相同214名考生,预测信度为0.73。
如果观察病史采集长病例,使用10名未标准化患者进行三个半小时的测试时间将产生可靠的测试。因此,就信度而言,长病例在评估临床能力方面并不比OSCE差,也不比其好。