Dass Sarat C, Zhu Yongfang, Jain Anil K
Department of Statistics & Probability, Michigan State University, East Lansing, MI 48824, USA.
IEEE Trans Pattern Anal Mach Intell. 2006 Dec;28(12):1902-319. doi: 10.1109/TPAMI.2006.255.
Authentication systems based on biometric features (e.g., fingerprint impressions, iris scans, human face images, etc.) are increasingly gaining widespread use and popularity. Often, vendors and owners of these commercial biometric systems claim impressive performance that is estimated based on some proprietary data. In such situations, there is a need to independently validate the claimed performance levels. System performance is typically evaluated by collecting biometric templates from n different subjects, and for convenience, acquiring multiple instances of the biometric for each of the n subjects. Very little work has been done in 1) constructing confidence regions based on the ROC curve for validating the claimed performance levels and 2) determining the required number of biometric samples needed to establish confidence regions of prespecified width for the ROC curve. To simplify the analysis that address these two problems, several previous studies have assumed that multiple acquisitions of the biometric entity are statistically independent. This assumption is too restrictive and is generally not valid. We have developed a validation technique based on multivariate copula models for correlated biometric acquisitions. Based on the same model, we also determine the minimum number of samples required to achieve confidence bands of desired width for the ROC curve. We illustrate the estimation of the confidence bands as well as the required number of biometric samples using a fingerprint matching system that is applied on samples collected from a small population.
基于生物特征(如指纹印记、虹膜扫描、人脸图像等)的认证系统越来越广泛地得到应用并受到欢迎。通常,这些商业生物特征系统的供应商和所有者声称基于一些专有数据得出的性能令人印象深刻。在这种情况下,需要独立验证所声称的性能水平。系统性能通常通过从n个不同主体收集生物特征模板来评估,为方便起见,为n个主体中的每一个获取多个生物特征实例。在以下两方面所做的工作很少:1)基于ROC曲线构建置信区域以验证所声称的性能水平;2)确定为ROC曲线建立指定宽度的置信区域所需的生物特征样本数量。为了简化针对这两个问题的分析,先前的一些研究假设生物特征实体的多次获取在统计上是独立的。这个假设过于严格,通常是无效的。我们开发了一种基于多元Copula模型的验证技术,用于相关生物特征获取。基于同一模型,我们还确定了为ROC曲线实现所需宽度的置信带所需的最小样本数量。我们使用一个应用于从一小群人收集的样本的指纹匹配系统来说明置信带的估计以及所需的生物特征样本数量。