Salk Institute for Biological Studies, La Jolla, CA 92093, U.S.A.
Division of Biological Sciences, University of California, San Diego, La Jolla, CA 92037, U.S.A.
Neural Comput. 2023 Feb 17;35(3):309-342. doi: 10.1162/neco_a_01563.
Large language models (LLMs) have been transformative. They are pretrained foundational models that are self-supervised and can be adapted with fine-tuning to a wide range of natural language tasks, each of which previously would have required a separate network model. This is one step closer to the extraordinary versatility of human language. GPT-3 and, more recently, LaMDA, both of them LLMs, can carry on dialogs with humans on many topics after minimal priming with a few examples. However, there has been a wide range of reactions and debate on whether these LLMs understand what they are saying or exhibit signs of intelligence. This high variance is exhibited in three interviews with LLMs reaching wildly different conclusions. A new possibility was uncovered that could explain this divergence. What appears to be intelligence in LLMs may in fact be a mirror that reflects the intelligence of the interviewer, a remarkable twist that could be considered a reverse Turing test. If so, then by studying interviews, we may be learning more about the intelligence and beliefs of the interviewer than the intelligence of the LLMs. As LLMs become more capable, they may transform the way we interact with machines and how they interact with each other. Increasingly, LLMs are being coupled with sensorimotor devices. LLMs can talk the talk, but can they walk the walk? A road map for achieving artificial general autonomy is outlined with seven major improvements inspired by brain systems and how LLMs could in turn be used to uncover new insights into brain function.
大型语言模型(LLMs)具有变革性。它们是经过预训练的基础模型,可以进行自我监督,并通过微调适应广泛的自然语言任务,而之前每个任务都需要一个单独的网络模型。这更接近人类语言的非凡通用性。GPT-3 和最近的 LaMDA 都是 LLM,可以在经过少量示例的初步提示后,与人类就许多话题进行对话。然而,对于这些 LLM 是否理解它们所说的内容或表现出智能迹象,人们的反应和争论不一。这在对三个 LLM 的采访中表现出了极大的差异,得出了截然不同的结论。一个新的可能性被揭示出来,可以解释这种分歧。在 LLM 中表现出的智能实际上可能是一种反映采访者智能的镜子,这是一个引人注目的转折,可以被认为是一种反向图灵测试。如果是这样,那么通过研究采访,我们可能会更多地了解采访者的智能和信念,而不是 LLM 的智能。随着 LLM 变得越来越强大,它们可能会改变我们与机器交互的方式以及它们相互交互的方式。越来越多的 LLM 正在与传感器和执行器设备结合使用。LLM 可以说会道,但它们能付诸行动吗?通过借鉴大脑系统和 LLM 如何反过来被用来揭示大脑功能的新见解,为实现人工通用自主性制定了一个路线图,提出了七个主要的改进。