Academy of Forensic Science, 1347, West Guangfu Road, Shanghai 200063, PR China.
Academy of Forensic Science, 1347, West Guangfu Road, Shanghai 200063, PR China.
Forensic Sci Int. 2021 Aug;325:110869. doi: 10.1016/j.forsciint.2021.110869. Epub 2021 Jun 10.
Morphology-based classification of inkjet documents has the characteristics of low cost and high efficiency, but this method usually requires measurement and analysis of a large number of printed characters. This paper proposes a novel method for detecting the source of printed documents using a few printed letters. A dataset containing data pertaining to various inkjet printers, including 27 models of inkjets from HP, Canon, and Epson, and their printed documents were gathered. The specifications of the various brands and models of inkjets are summarised, and the characteristics of the microscopic appearance of the printheads are presented. Principal component analysis (PCA) of the variables was applied to describe the proximity between the specimens, and a two-dimensional kernel density estimation was used to describe the variation between and within printer brands and models. Then, specific cases were simulated by random sampling based on the collected inkjet dataset. Multivariate kernel density estimation was used to estimate the numerator and denominator probability distribution for computing the likelihood ratio (LR). The result of K-nearest neighbour analysis showed classification accuracy as high as 98%. The evaluation of the LR presented a significant result (EER=0, RMEP=0, RMED=0.07). This method helps to find a specific inkjet from even a few letters in the printed document for tactical purposes.
喷墨文档的基于形态学的分类具有成本低、效率高的特点,但这种方法通常需要测量和分析大量的打印字符。本文提出了一种使用少量打印字母来检测打印文档来源的新方法。收集了包含各种喷墨打印机数据的数据集,包括来自 HP、Canon 和 Epson 的 27 种喷墨打印机模型及其打印文档。总结了各种品牌和型号喷墨打印机的规格,并介绍了打印头微观外观的特点。应用主成分分析(PCA)来描述样本之间的接近程度,并使用二维核密度估计来描述打印机品牌和型号之间以及内部的变化。然后,根据收集的喷墨数据集进行随机抽样模拟特定情况。使用多元核密度估计来估计计算似然比(LR)的分子和分母概率分布。K-最近邻分析的结果显示分类准确率高达 98%。LR 的评估结果具有显著意义(EER=0,RMEP=0,RMED=0.07)。该方法有助于从打印文档中的几个字母中找到特定的喷墨打印机,这对于战术目的很有帮助。