McCullough P A, Smith G S
Center for Clinical Effectiveness, Henry Ford Health System, Detroit, MI 48202, USA.
Am J Ind Med. 1998 Aug;34(2):133-6. doi: 10.1002/(sici)1097-0274(199808)34:2<133::aid-ajim5>3.0.co;2-y.
This article reviews the analysis of a narrative text electronic search technique being used in the insurance industry. We reviewed a previously published study of motor vehicle crashes in roadway construction workzones as well as additional data supplied by the authors with respect to the methods of keyword selection. The narrative text search technique was evaluated with decision statistics and was found to have a sensitivity of 92.3%, 95% confidence interval 67.5%-99.6%. This range of sensitivity, at its most extreme value, led to a 32.5% underestimation of claims prevalence. Furthermore, because the electronic search developed two classification categories from a limited text field (approximately 20 words), only half of the cases had at least one classification. Systematic error estimates were used to obtain true population proportions for crash characteristics, revealing significant underestimations in costs. This analysis highlights the need for investigators to apply decision statistics to narrative text searching techniques when they are used essentially as diagnostic test procedures on insurance claims datasets.
本文回顾了保险行业中正在使用的一种叙述性文本电子搜索技术的分析。我们回顾了之前发表的一项关于道路施工工作区机动车碰撞事故的研究,以及作者提供的关于关键词选择方法的额外数据。使用决策统计对叙述性文本搜索技术进行了评估,发现其敏感度为92.3%,95%置信区间为67.5%-99.6%。这个敏感度范围在其最极端值时,导致索赔发生率被低估了32.5%。此外,由于电子搜索从有限的文本字段(约20个单词)中开发了两个分类类别,只有一半的案例有至少一个分类。使用系统误差估计来获取碰撞特征的真实总体比例,结果显示成本被严重低估。该分析强调,当叙述性文本搜索技术基本上用作保险索赔数据集的诊断测试程序时,调查人员需要对其应用决策统计。