Luby Amanda
Department of Mathematics & Statistics, Swarthmore College, USA.
Forensic Sci Int Synerg. 2023 Jun 30;7:100340. doi: 10.1016/j.fsisyn.2023.100340. eCollection 2023.
In recent years, 'black box' studies in forensic science have emerged as the preferred way to provide information about the overall validity of forensic disciplines in practice. These studies provide aggregated error rates over many examiners and comparisons, but errors are not equally likely on all comparisons. Furthermore, inconclusive responses are common and vary across examiners and comparisons, but do not fit neatly into the error rate framework. This work introduces Item Response Theory (IRT) and variants for the forensic setting to account for these two issues. In the IRT framework, participant proficiency and item difficulty are estimated directly from the responses, which accounts for the different subsets of items that participants often answer. By incorporating a decision-tree framework into the model, inconclusive responses are treated as a distinct cognitive process, which allows inter-examiner differences to be estimated directly. The IRT-based model achieves superior predictive performance over standard logistic regression techniques, produces item effects that are consistent with common sense and prior work, and demonstrates that most of the variability among fingerprint examiner decisions occurs at the latent print evaluation stage and as a result of differing tendencies to make inconclusive decisions.
近年来,法医学中的“黑匣子”研究已成为提供法医学科在实际应用中整体有效性信息的首选方式。这些研究提供了许多鉴定人员和比对情况下的汇总错误率,但并非所有比对出现错误的可能性都相同。此外,不确定的结果很常见,且因鉴定人员和比对情况而异,但并不完全符合错误率框架。这项工作引入了项目反应理论(IRT)及其在法医环境中的变体,以解决这两个问题。在IRT框架中,参与者的能力和项目难度是直接从回答中估计出来的,这考虑了参与者经常回答的不同项目子集。通过将决策树框架纳入模型,不确定的回答被视为一个独特的认知过程,这使得可以直接估计鉴定人员之间的差异。基于IRT的模型比标准逻辑回归技术具有更好的预测性能,产生的项目效应与常识和先前的研究一致,并表明指纹鉴定人员决策中的大多数变异性发生在潜指纹评估阶段,并且是由于做出不确定决策的不同倾向所致。