Hu Shengguo, Xu Jiawen, Li Mingyi, Cui Tie Jun, Li Lianlin
State Key Laboratory of Advanced Optical Communication Systems and Networks, School of Electronics, Peking University, Beijing 100871, China.
State Key Laboratory of Millimeter Waves, Southeast University, Nanjing 210096, China.
Nanophotonics. 2024 Jan 4;13(12):2213-2222. doi: 10.1515/nanoph-2023-0646. eCollection 2024 May.
Programmable metasurface has become a prominent tool in various areas including control, communication, computing, and so on, due to its unique capability in the electromagnetic (EM) manipulation. However, it is lack of the intelligence in the sense that it usually requires the manual intervention, and thus makes it hard to behavior as the human process. To endow the programmable metasurface with the intelligence, we here proposed the concept of the language-controllable programmable metasurface for autonomous EM manipulations by exploring the notable capability of large language models (LLMs) in attaining the human-like intelligence. We have established a proof-of-principle system of language-controllable programmable metasurface, where, for illustration, the programmable metasurface is designed to have 32 × 24 binary electronically controllable meta-atoms and work at around 5.5 GHz. In addition, we have constructed a visual-semantic map to facilitate the language-controllable EM manipulation in three-dimensional (3D) physical environments. We have experimentally demonstrated that our language-controllable programmable metasurface is capable of decomposing autonomously an ambiguous task of EM manipulation into a sequence of executable ones and implementing them individually in real-world indoor settings. We expect that the presented strategy could hold promising potential in pushing programmable metasurfaces towards human-level autonomous agents, which are capable of accomplishing the smart EM-involved multi-modality manipulations through self-directed planning and actions.
可编程超表面由于其在电磁操纵方面的独特能力,已成为包括控制、通信、计算等在内的各个领域的重要工具。然而,它缺乏智能,因为它通常需要人工干预,因此很难像人类一样运作。为了赋予可编程超表面智能,我们在此提出了语言可控可编程超表面的概念,通过探索大语言模型(LLMs)在实现类人智能方面的显著能力,实现自主电磁操纵。我们建立了一个语言可控可编程超表面的原理验证系统,例如,可编程超表面设计为具有32×24个二进制电控元原子,并在5.5GHz左右工作。此外,我们构建了一个视觉语义映射,以促进在三维(3D)物理环境中的语言可控电磁操纵。我们通过实验证明,我们的语言可控可编程超表面能够将电磁操纵的模糊任务自主分解为一系列可执行任务,并在现实世界的室内环境中单独实现它们。我们期望所提出的策略在推动可编程超表面向人类级自主智能体发展方面具有广阔的潜力,这些智能体能够通过自我指导的规划和行动完成涉及智能电磁的多模态操纵。