Suppr超能文献

一种用于学习层次概念的信息论评分。

An information theoretic score for learning hierarchical concepts.

作者信息

Madani Omid

机构信息

Cisco Secure Workload, Cisco, San Jose, CA, United States.

出版信息

Front Comput Neurosci. 2023 May 2;17:1082502. doi: 10.3389/fncom.2023.1082502. eCollection 2023.

Abstract

How do humans learn the regularities of their complex noisy world in a robust manner? There is ample evidence that much of this learning and development occurs in an unsupervised fashion interactions with the environment. Both the structure of the world as well as the brain appear hierarchical in a number of ways, and structured hierarchical representations offer potential benefits for efficient learning and organization of knowledge, such as concepts (patterns) sharing parts (subpatterns), and for providing a foundation for symbolic computation and language. A major question arises: what drives the processes behind acquiring such hierarchical spatiotemporal concepts? We posit that the goal of advancing one's predictions is a major driver for learning such hierarchies and introduce an information-theoretic score that shows promise in guiding the processes, and, in particular, motivating the learner to build larger concepts. We have been exploring the challenges of building an integrated learning and developing system within the framework of , wherein concepts serve as (1) predictors, (2) targets of prediction, and (3) building blocks for future higher-level concepts. Our current implementation works on raw text: it begins at a low level, such as characters, which are the hardwired or primitive concepts, and grows its vocabulary of networked hierarchical concepts over time. Concepts are strings or n-grams in our current realization, but we hope to relax this limitation, e.g., to a larger subclass of finite automata. After an overview of the current system, we focus on the score, named CORE. CORE is based on comparing the prediction performance of the system with a simple baseline system that is limited to predicting with the primitives. CORE incorporates a tradeoff between how strongly a concept is predicted (or how well it fits its context, i.e., nearby predicted concepts) vs. how well it matches the (ground) "reality," i.e., the lowest level observations (the characters in the input episode). CORE is applicable to generative models such as probabilistic finite state machines (beyond strings). We highlight a few properties of CORE with examples. The learning is scalable and open-ended. For instance, thousands of concepts are learned after hundreds of thousands of episodes. We give examples of what is learned, and we also empirically compare with transformer neural networks and n-gram language models to situate the current implementation with respect to state-of-the-art and to further illustrate the similarities and differences with existing techniques. We touch on a variety of challenges and promising future directions in advancing the approach, in particular, the challenge of learning concepts with a more sophisticated structure.

摘要

人类如何稳健地学习其复杂多变的世界中的规律?有充分证据表明,这种学习和发展大多以无监督的方式发生——通过与环境的互动。世界的结构和大脑在许多方面似乎都是分层的,结构化的分层表示为高效学习和知识组织提供了潜在的好处,例如概念(模式)共享部分(子模式),并为符号计算和语言提供基础。一个主要问题出现了:是什么驱动了获取这种分层时空概念背后的过程?我们认为,提高预测能力的目标是学习这种层次结构的主要驱动力,并引入了一种信息论分数,该分数有望指导这些过程,特别是激励学习者构建更大的概念。我们一直在探索在[具体框架]内构建一个集成学习和发展系统的挑战,其中概念充当(1)预测器,(2)预测目标,以及(3)未来更高层次概念的构建块。我们当前的实现基于原始文本:它从较低层次开始,例如字符,这些是硬编码的或原始的概念,并随着时间的推移增加其网络化分层概念的词汇量。在我们当前的实现中,概念是字符串或n元语法,但我们希望放宽这一限制,例如,扩展到有限自动机的更大子类。在概述当前系统之后,我们将重点关注名为CORE的分数。CORE基于将系统的预测性能与一个简单的基线系统进行比较,该基线系统仅限于使用原语进行预测。CORE在一个概念被预测的强度(或者它与上下文的匹配程度,即附近预测的概念)与它与(基础的)“现实”的匹配程度之间进行权衡,即最低层次的观察结果(输入情节中的字符)。CORE适用于诸如概率有限状态机(超越字符串)之类的生成模型。我们通过示例突出了CORE的一些属性。学习是可扩展的且是开放式的。例如,在数十万次情节之后学习了数千个概念。我们给出所学内容的示例,并且我们还通过实证与变压器神经网络和n元语法语言模型进行比较,以便将当前实现与现有技术进行定位,并进一步说明与现有技术的异同。我们探讨了推进该方法过程中的各种挑战和有前景的未来方向,特别是学习具有更复杂结构的概念的挑战。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd3c/10185805/d39a2ecb83d7/fncom-17-1082502-g0001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验