Alkhairy Ibrahim, Low-Choy Samantha, Murray Justine, Wang Junhu, Pettitt Anthony
Griffith School of Information and Communication Technology, Science Faculty, Griffith University, Gold Coast Campus, Southport, QLD, Australia.
Department of Mathematics, Al Qunfudhah University College, Umm Al Qura University, Makkah, Saudi Arabia.
J Appl Stat. 2019 Dec 3;47(10):1848-1884. doi: 10.1080/02664763.2019.1697651. eCollection 2020.
Bayesian networks are now widespread for modelling uncertain knowledge. They graph probabilistic relationships, which are quantified using conditional probability tables (CPTs). When empirical data are unavailable, experts may specify CPTs. Here we propose novel methodology for quantifying CPTs: a Bayesian statistical approach to both elicitation and encoding of expert-specified probabilities, in a way that acknowledges their uncertainty. We illustrate this new approach using a case study describing habitat most at risk from feral pigs. For complicated CPTs, it is difficult to elicit all scenarios (CPT entries). Like the CPT Calculator software program, we ask about a few scenarios (e.g. under a one-factor-at-a-time design) to reduce the experts' workload. Unlike CPT Calculator, we adopt a global rather than local regression to 'fill out' CPT entries. Unlike other methods for scenario-based elicitation for regression, we capture uncertainty about each probability in a sequence that explicitly controls biases and enhances interpretation. Furthermore, to utilize all elicited information, we introduce Bayesian rather than Classical generalised linear modelling (GLM). For large CPTs (e.g. >3 levels per parent) we show Bayesian GLM supports richer inference, particularly on interactions, even with few scenarios, providing more information regarding accuracy of encoding.
贝叶斯网络如今在对不确定知识进行建模方面广泛应用。它们以图形方式展示概率关系,这些关系通过条件概率表(CPT)进行量化。当缺乏经验数据时,专家可以指定CPT。在此,我们提出了一种量化CPT的新方法:一种贝叶斯统计方法,用于专家指定概率的引出和编码,同时承认其不确定性。我们通过一个描述受野猪威胁最大的栖息地的案例研究来说明这种新方法。对于复杂的CPT,很难引出所有情况(CPT条目)。与CPT计算器软件程序一样,我们询问一些情况(例如在一次一个因素的设计下)以减轻专家的工作量。与CPT计算器不同的是,我们采用全局回归而非局部回归来“填充”CPT条目。与其他基于情况的回归引出方法不同,我们在一个明确控制偏差并增强解释的序列中捕捉每个概率的不确定性。此外,为了利用所有引出的信息,我们引入贝叶斯广义线性模型(GLM)而非经典GLM。对于大型CPT(例如每个父节点有超过3个级别),我们表明贝叶斯GLM支持更丰富的推断,特别是在交互作用方面,即使情况很少,也能提供更多关于编码准确性的信息。