Hu Shengguo, Li Mingyi, Xu Jiawen, Zhang Hongrui, Zhang Shanghang, Cui Tie Jun, Del Hougne Philipp, Li Lianlin
State Key Laboratory of Advanced Optical Communication Systems and Networks, School of Electronics, Peking University, Beijing, 100871, China.
National Key Laboratory for Multimedia Information Processing, School of Computer Science, Peking University, Beijing, 100871, China.
Light Sci Appl. 2025 Jan 1;14(1):12. doi: 10.1038/s41377-024-01678-w.
Metamaterials have revolutionized wave control; in the last two decades, they evolved from passive devices via programmable devices to sensor-endowed self-adaptive devices realizing a user-specified functionality. Although deep-learning techniques play an increasingly important role in metamaterial inverse design, measurement post-processing and end-to-end optimization, their role is ultimately still limited to approximating specific mathematical relations; the metamaterial is still limited to serving as proxy of a human operator, realizing a predefined functionality. Here, we propose and experimentally prototype a paradigm shift toward a metamaterial agent (coined metaAgent) endowed with reasoning and cognitive capabilities enabling the autonomous planning and successful execution of diverse long-horizon tasks, including electromagnetic (EM) field manipulations and interactions with robots and humans. Leveraging recently released foundation models, metaAgent reasons in high-level natural language, acting upon diverse prompts from an evolving complex environment. Specifically, metaAgent's cerebrum performs high-level task planning in natural language via a multi-agent discussion mechanism, where agents are domain experts in sensing, planning, grounding, and coding. In response to live environmental feedback within a real-world setting emulating an ambient-assisted living context (including human requests in natural language), our metaAgent prototype self-organizes a hierarchy of EM manipulation tasks in conjunction with commanding a robot. metaAgent masters foundational EM manipulation skills related to wireless communications and sensing, and it memorizes and learns from past experience based on human feedback.
超材料彻底改变了波的控制方式;在过去二十年中,它们从无源器件发展到可编程器件,再到具备传感器的自适应器件,实现了用户指定的功能。尽管深度学习技术在超材料逆向设计、测量后处理和端到端优化中发挥着越来越重要的作用,但其作用最终仍局限于逼近特定的数学关系;超材料仍局限于充当人类操作员的代理,实现预定义的功能。在此,我们提出并通过实验验证了一种向超材料智能体(称为metaAgent)的范式转变,该智能体具备推理和认知能力,能够自主规划并成功执行各种长期任务,包括电磁场操纵以及与机器人和人类的交互。利用最近发布的基础模型,metaAgent以高级自然语言进行推理,根据不断变化的复杂环境中的各种提示采取行动。具体而言,metaAgent的大脑通过多智能体讨论机制以自然语言执行高级任务规划,其中各智能体是传感、规划、基础和编码方面的领域专家。在模拟环境辅助生活场景(包括自然语言形式的人类请求)的现实世界环境中,响应实时环境反馈时,我们的metaAgent原型结合指挥机器人自主组织电磁场操纵任务的层次结构。metaAgent掌握与无线通信和传感相关的基础电磁场操纵技能,并根据人类反馈记忆和学习过往经验。