Morrison Geoffrey Stewart, Poh Norman
Forensic Speech Science Laboratory, Centre for Forensic Linguistics, Aston University, Birmingham, England, United Kingdom; Isaac Newton Institute for Mathematical Sciences, Cambridge, England, United Kingdom.
Department of Computer Science, University of Surrey, Guildford, England, United Kingdom; QuintilesIMS, London, England, United Kingdom.
Sci Justice. 2018 May;58(3):200-218. doi: 10.1016/j.scijus.2017.12.005. Epub 2017 Dec 22.
When strength of forensic evidence is quantified using sample data and statistical models, a concern may be raised as to whether the output of a model overestimates the strength of evidence. This is particularly the case when the amount of sample data is small, and hence sampling variability is high. This concern is related to concern about precision. This paper describes, explores, and tests three procedures which shrink the value of the likelihood ratio or Bayes factor toward the neutral value of one. The procedures are: (1) a Bayesian procedure with uninformative priors, (2) use of empirical lower and upper bounds (ELUB), and (3) a novel form of regularized logistic regression. As a benchmark, they are compared with linear discriminant analysis, and in some instances with non-regularized logistic regression. The behaviours of the procedures are explored using Monte Carlo simulated data, and tested on real data from comparisons of voice recordings, face images, and glass fragments.
当使用样本数据和统计模型对法医证据的强度进行量化时,可能会有人担心模型的输出是否高估了证据的强度。当样本数据量较小时,情况尤其如此,此时抽样变异性较高。这种担忧与对精度的担忧有关。本文描述、探讨并测试了三种将似然比或贝叶斯因子的值向中性值1收缩的程序。这些程序是:(1)具有无信息先验的贝叶斯程序,(2)使用经验下限和上限(ELUB),以及(3)一种新型的正则化逻辑回归。作为基准,将它们与线性判别分析进行比较,在某些情况下还与非正则化逻辑回归进行比较。使用蒙特卡罗模拟数据探索这些程序的行为,并在语音记录、面部图像和玻璃碎片比较的真实数据上进行测试。