Hripcsak George, Wilcox Adam
Department of Medical Informatics, Columbia University, New York, New York 10032, USA.
J Am Med Inform Assoc. 2002 Jan-Feb;9(1):1-15. doi: 10.1136/jamia.2002.0090001.
Medical informatics systems are often designed to perform at the level of human experts. Evaluation of the performance of these systems is often constrained by lack of reference standards, either because the appropriate response is not known or because no simple appropriate response exists. Even when performance can be assessed, it is not always clear whether the performance is sufficient or reasonable. These challenges can be addressed if an evaluator enlists the help of clinical domain experts. 1) The experts can carry out the same tasks as the system, and then their responses can be combined to generate a reference standard. 2)The experts can judge the appropriateness of system output directly. 3) The experts can serve as comparison subjects with which the system can be compared. These are separate roles that have different implications for study design, metrics, and issues of reliability and validity. Diagrams help delineate the roles of experts in complex study designs.
医学信息系统通常被设计为具备人类专家水平的性能。对这些系统性能的评估常常受到缺乏参考标准的限制,这要么是因为合适的响应未知,要么是因为不存在简单合适的响应。即使性能能够被评估,也不总是清楚该性能是否足够或合理。如果评估者寻求临床领域专家的帮助,这些挑战是可以解决的。1)专家可以执行与系统相同的任务,然后将他们的响应合并以生成参考标准。2)专家可以直接判断系统输出的适当性。3)专家可以作为与系统进行比较的对照对象。这些是不同的角色,对研究设计、指标以及可靠性和有效性问题具有不同的影响。图表有助于在复杂的研究设计中描绘专家的角色。