Possolo Antonio, Iyer Hari K
Statistical Engineering Division, Information Technology Laboratory, National Institute of Standards and Technology, Gaithersburg, Maryland 20899-8980, USA.
Rev Sci Instrum. 2017 Jan;88(1):011301. doi: 10.1063/1.4974274.
Measurements involve comparisons of measured values with reference values traceable to measurement standards and are made to support decision-making. While the conventional definition of measurement focuses on quantitative properties (including ordinal properties), we adopt a broader view and entertain the possibility of regarding qualitative properties also as legitimate targets for measurement. A measurement result comprises the following: (i) a value that has been assigned to a property based on information derived from an experiment or computation, possibly also including information derived from other sources, and (ii) a characterization of the margin of doubt that remains about the true value of the property after taking that information into account. Measurement uncertainty is this margin of doubt, and it can be characterized by a probability distribution on the set of possible values of the property of interest. Mathematical or statistical models enable the quantification of measurement uncertainty and underlie the varied collection of methods available for uncertainty evaluation. Some of these methods have been in use for over a century (for example, as introduced by Gauss for the combination of mutually inconsistent observations or for the propagation of "errors"), while others are of fairly recent vintage (for example, Monte Carlo methods including those that involve Markov Chain Monte Carlo sampling). This contribution reviews the concepts, models, methods, and computations that are commonly used for the evaluation of measurement uncertainty, and illustrates their application in realistic examples drawn from multiple areas of science and technology, aiming to serve as a general, widely accessible reference.
测量涉及将测量值与可追溯至测量标准的参考值进行比较,其目的是为决策提供支持。虽然传统的测量定义侧重于定量属性(包括序数属性),但我们采用更宽泛的观点,并考虑将定性属性也视为合理的测量对象。一个测量结果包括以下内容:(i) 根据从实验或计算中获得的信息(可能还包括从其他来源获得的信息)赋予某一属性的值,以及 (ii) 在考虑该信息后,对该属性真实值仍存在的疑问程度的描述。测量不确定度就是这种疑问程度,它可以用感兴趣属性的可能值集合上的概率分布来表征。数学或统计模型能够对测量不确定度进行量化,并且是现有各种不确定度评估方法的基础。其中一些方法已经使用了一个多世纪(例如,高斯引入的用于相互不一致观测值的组合或 “误差” 传播的方法),而其他一些方法则是相当新的(例如,蒙特卡罗方法,包括那些涉及马尔可夫链蒙特卡罗抽样的方法)。本论文综述了常用于评估测量不确定度的概念、模型、方法和计算,并通过来自多个科学技术领域的实际例子说明它们的应用,旨在成为一份通用的、广泛可获取的参考资料。