Hössjer Ola, Díaz-Pachón Daniel Andrés, Rao J Sunil
Department of Mathematics, Stockholm University, SE-106 91 Stockholm, Sweden.
Division of Biostatistics, University of Miami, Miami, FL 33136, USA.
Entropy (Basel). 2022 Oct 14;24(10):1469. doi: 10.3390/e24101469.
Philosophers frequently define knowledge as justified, true belief. We built a mathematical framework that makes it possible to define learning (increasing number of true beliefs) and knowledge of an agent in precise ways, by phrasing belief in terms of epistemic probabilities, defined from Bayes' rule. The degree of true belief is quantified by means of active information I+: a comparison between the degree of belief of the agent and a completely ignorant person. Learning has occurred when either the agent's strength of belief in a true proposition has increased in comparison with the ignorant person (I+>0), or the strength of belief in a false proposition has decreased (I+<0). Knowledge additionally requires that learning occurs for the right reason, and in this context we introduce a framework of parallel worlds that correspond to parameters of a statistical model. This makes it possible to interpret learning as a hypothesis test for such a model, whereas knowledge acquisition additionally requires estimation of a true world parameter. Our framework of learning and knowledge acquisition is a hybrid between frequentism and Bayesianism. It can be generalized to a sequential setting, where information and data are updated over time. The theory is illustrated using examples of coin tossing, historical and future events, replication of studies, and causal inference. It can also be used to pinpoint shortcomings of machine learning, where typically learning rather than knowledge acquisition is in focus.
哲学家们常常将知识定义为有合理依据的真信念。我们构建了一个数学框架,通过根据贝叶斯规则定义的认知概率来表述信念,从而能够以精确的方式定义主体的学习(真信念数量的增加)和知识。真信念的程度通过主动信息I +来量化:主体的信念程度与一个完全无知的人的信念程度之间的比较。当主体对真命题的信念强度相对于无知的人有所增加(I +> 0),或者对假命题的信念强度有所降低(I +<0)时,就发生了学习。知识还要求学习是出于正确的原因,在此背景下,我们引入了一个与统计模型参数相对应的平行世界框架。这使得将学习解释为对这样一个模型的假设检验成为可能,而知识获取还需要对真实世界参数进行估计。我们的学习和知识获取框架是频率主义和贝叶斯主义的混合体。它可以推广到一个序列设置中,其中信息和数据会随着时间更新。该理论通过抛硬币、历史和未来事件、研究的重复以及因果推断等例子进行说明。它还可用于指出机器学习的缺点,机器学习通常关注的是学习而非知识获取。