• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于单操作员双手遥操作的大语言模型驱动的自然语言交互控制框架。

Large language model-driven natural language interaction control framework for single-operator bimanual teleoperation.

作者信息

Fei Haolin, Xue Tao, He Yiyang, Lin Sheng, Du Guanglong, Guo Yao, Wang Ziwei

机构信息

School of Engineering, Lancaster University, Lancaster, United Kingdom.

Department of Automation, Tsinghua University, Beijing, China.

出版信息

Front Robot AI. 2025 Jul 17;12:1621033. doi: 10.3389/frobt.2025.1621033. eCollection 2025.

DOI:10.3389/frobt.2025.1621033
PMID:40747446
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12310453/
Abstract

Bimanual teleoperation imposes cognitive and coordination demands on a single human operator tasked with simultaneously controlling two robotic arms. Although assigning each arm to a separate operator can distribute workload, it often leads to ambiguities in decision authority and degrades overall efficiency. To overcome these challenges, we propose a novel bimanual teleoperation large language model assistant (BTLA) framework, an intelligent co-pilot that augments a single operator's motor control capabilities. In particular, BTLA enables operators to directly control one robotic arm through conventional teleoperation while directing a second assistive arm via simple voice commands, and therefore commanding two robotic arms simultaneously. By integrating the GPT-3.5-turbo model, BTLA interprets contextual voice instructions and autonomously selects among six predefined manipulation skills, including real-time mirroring, trajectory following, and autonomous object grasping. Experimental evaluations in bimanual object manipulation tasks demonstrate that BTLA increased task coverage by 76.1 and success rate by 240.8 relative to solo teleoperation, and outperformed dyadic control with a 19.4 gain in coverage and a 69.9 gain in success. Furthermore, NASA Task Load Index (NASA-TLX) assessments revealed a 38-52 reduction in operator mental workload, and 85 of participants rated the voice-based interaction as "natural" and "highly effective."

摘要

双手遥操作对负责同时控制两个机器人手臂的单个操作人员提出了认知和协调方面的要求。虽然将每个手臂分配给单独的操作人员可以分散工作量,但这往往会导致决策权限不明确,并降低整体效率。为了克服这些挑战,我们提出了一种新颖的双手遥操作大语言模型助手(BTLA)框架,这是一种智能副驾驶,可增强单个操作人员的运动控制能力。具体而言,BTLA使操作人员能够通过传统遥操作直接控制一个机器人手臂,同时通过简单的语音命令指挥第二个辅助手臂,从而同时指挥两个机器人手臂。通过集成GPT-3.5-turbo模型,BTLA解释上下文语音指令,并在六种预定义的操作技能中自主选择,包括实时镜像、轨迹跟踪和自主物体抓取。在双手物体操作任务中的实验评估表明,相对于单人遥操作,BTLA将任务覆盖率提高了76.1%,成功率提高了240.8%,并且在覆盖率方面比二元控制提高了19.4%,在成功率方面提高了69.9%。此外,美国国家航空航天局任务负荷指数(NASA-TLX)评估显示,操作人员的心理工作量减少了38%-52%,85%的参与者将基于语音的交互评为“自然”和“高效”。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71cd/12310453/3a5f4cfdc4e9/frobt-12-1621033-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71cd/12310453/a1071a5d54b5/frobt-12-1621033-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71cd/12310453/7c42d1e48180/frobt-12-1621033-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71cd/12310453/00aefdaf709d/frobt-12-1621033-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71cd/12310453/e490a30549a6/frobt-12-1621033-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71cd/12310453/86076cef54b7/frobt-12-1621033-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71cd/12310453/af4e3bbc807f/frobt-12-1621033-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71cd/12310453/0f88bd4c3a49/frobt-12-1621033-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71cd/12310453/f56874aa3902/frobt-12-1621033-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71cd/12310453/3a5f4cfdc4e9/frobt-12-1621033-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71cd/12310453/a1071a5d54b5/frobt-12-1621033-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71cd/12310453/7c42d1e48180/frobt-12-1621033-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71cd/12310453/00aefdaf709d/frobt-12-1621033-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71cd/12310453/e490a30549a6/frobt-12-1621033-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71cd/12310453/86076cef54b7/frobt-12-1621033-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71cd/12310453/af4e3bbc807f/frobt-12-1621033-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71cd/12310453/0f88bd4c3a49/frobt-12-1621033-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71cd/12310453/f56874aa3902/frobt-12-1621033-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71cd/12310453/3a5f4cfdc4e9/frobt-12-1621033-g009.jpg

相似文献

1
Large language model-driven natural language interaction control framework for single-operator bimanual teleoperation.用于单操作员双手遥操作的大语言模型驱动的自然语言交互控制框架。
Front Robot AI. 2025 Jul 17;12:1621033. doi: 10.3389/frobt.2025.1621033. eCollection 2025.
2
Short-Term Memory Impairment短期记忆障碍
3
Immersive Teleoperation via Collaborative Device-Agnostic Interfaces for Smart Haptics: A Study on Operational Efficiency and Cognitive Overflow for Industrial Assistive Applications.通过用于智能触觉的协作式设备无关接口实现沉浸式远程操作:工业辅助应用中的操作效率与认知溢出研究
Sensors (Basel). 2025 Jun 26;25(13):3993. doi: 10.3390/s25133993.
4
A New Method for Inducing Mental Fatigue: A High Mental Workload Task Paradigm Based on Complex Cognitive Abilities and Time Pressure.一种诱导精神疲劳的新方法:基于复杂认知能力和时间压力的高精神负荷任务范式
Brain Sci. 2025 May 22;15(6):541. doi: 10.3390/brainsci15060541.
5
Intelligent control of robotic X-ray devices using a language-promptable digital twin.使用语言可提示数字孪生对机器人X射线设备进行智能控制。
Int J Comput Assist Radiol Surg. 2025 Apr 9. doi: 10.1007/s11548-025-03351-y.
6
Menstrual Health Education Using a Specialized Large Language Model in India: Development and Evaluation Study of MenstLLaMA.在印度使用专门的大语言模型进行月经健康教育:MenstLLaMA的开发与评估研究
J Med Internet Res. 2025 Jul 16;27:e71977. doi: 10.2196/71977.
7
Enabling by voice: an exploratory study on how interactive smart agents (ISAs) can change the design of environmental control (EC) equipment and service.语音启用:关于交互式智能代理(ISA)如何改变环境控制(EC)设备及服务设计的探索性研究。
Disabil Rehabil Assist Technol. 2025 Jul 23:1-30. doi: 10.1080/17483107.2025.2530195.
8
Unveiling GPT-4V's hidden challenges behind high accuracy on USMLE questions: Observational Study.揭示GPT-4V在美国医师执照考试(USMLE)问题上高精度背后的隐藏挑战:观察性研究。
J Med Internet Res. 2025 Feb 7;27:e65146. doi: 10.2196/65146.
9
Training Laparoscopic Surgeons: Assessing Workload and Skill Using Nasa-Tlx.培训腹腔镜外科医生:使用美国国家航空航天局任务负荷指数评估工作量和技能
J Surg Educ. 2025 Sep;82(9):103588. doi: 10.1016/j.jsurg.2025.103588. Epub 2025 Jul 10.
10
A Scoping Review of the Observed and Perceived Functional Impacts Associated With Language and Learning Disorders in School-Aged Children.一项关于学龄儿童语言和学习障碍相关的观察到的和感知到的功能影响的范围综述。
Int J Lang Commun Disord. 2025 Jul-Aug;60(4):e70086. doi: 10.1111/1460-6984.70086.

本文引用的文献

1
A Haptic Shared Autonomy With Partial Orientation Regulation for DoF Deficiency in Remote Side.针对远程侧自由度缺失,采用具有部分方向调节的触觉共享自主控制方法。
IEEE Trans Haptics. 2023 Jan-Mar;16(1):86-95. doi: 10.1109/TOH.2023.3239602.
2
The Classification and New Trends of Shared Control Strategies in Telerobotic Systems: A Survey.遥操作机器人系统中共享控制策略的分类及新趋势:综述。
IEEE Trans Haptics. 2023 Apr-Jun;16(2):118-133. doi: 10.1109/TOH.2023.3253856. Epub 2023 Jun 20.
3
A Novel Training and Collaboration Integrated Framework for Human-Agent Teleoperation.
一种新型的人机远程操作培训与协作集成框架。
Sensors (Basel). 2021 Dec 14;21(24):8341. doi: 10.3390/s21248341.
4
A Three-Limb Teleoperated Robotic System with Foot Control for Flexible Endoscopic Surgery.三肢遥操作机器人系统,带足部控制,用于柔性内窥镜手术。
Ann Biomed Eng. 2021 Sep;49(9):2282-2296. doi: 10.1007/s10439-021-02766-3. Epub 2021 Apr 8.
5
Shared control-based bimanual robot manipulation.基于共享控制的双手机器人操作。
Sci Robot. 2019 May 29;4(30). doi: 10.1126/scirobotics.aaw0955.
6
Two is better than one: physical interactions improve motor performance in humans.两人优于一人:身体互动可改善人类的运动表现。
Sci Rep. 2014 Jan 23;4:3824. doi: 10.1038/srep03824.