一种使用Informer模型提高远程机器人手术准确性的预测方法。

A Predictive Approach for Enhancing Accuracy in Remote Robotic Surgery Using Informer Model.

作者信息

Lashari Muhammad Hanif, Ahmed Shakil, Batayneh Wafa, Khokhar Ashfaq

机构信息

Department of Electrical & Computer Engineering, Iowa State University, Ames, IA 50011, USA.

出版信息

Sensors (Basel). 2025 May 13;25(10):3067. doi: 10.3390/s25103067.

DOI:10.3390/s25103067

PMID:40431859

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12115247/

Abstract

Precise and real-time estimation of the robotic arm's position on the patient's side is essential for the success of remote robotic surgery in Tactile Internet (TI) environments. This paper presents a prediction model based on the Transformer-based Informer framework for accurate and efficient position estimation, combined with a Four-State Hidden Markov Model (4-State HMM) to simulate realistic packet loss scenarios. The proposed approach addresses challenges such as network delays, jitter, and packet loss to ensure reliable and precise operation in remote surgical applications. The method integrates the optimization problem into the Informer model by embedding constraints such as energy efficiency, smoothness, and robustness into its training process using a differentiable optimization layer. The Informer framework uses features such as ProbSparse attention, attention distilling, and a generative-style decoder to focus on position-critical features while maintaining a low computational complexity of O(LlogL). The method is evaluated using the JIGSAWS dataset, achieving a prediction accuracy of over 90% under various network scenarios. A comparison with models such as TCN, RNN, and LSTM demonstrates the Informer framework's superior performance in handling position prediction and meeting real-time requirements, making it suitable for Tactile Internet-enabled robotic surgery.

摘要

在触觉互联网（TI）环境中，精确且实时地估计机器人手臂在患者身体一侧的位置对于远程机器人手术的成功至关重要。本文提出了一种基于Transformer的Informer框架的预测模型，用于准确高效的位置估计，并结合四状态隐马尔可夫模型（4-State HMM）来模拟实际的数据包丢失情况。所提出的方法解决了网络延迟、抖动和数据包丢失等挑战，以确保在远程手术应用中可靠且精确地操作。该方法通过使用可微优化层将诸如能量效率、平滑性和鲁棒性等约束嵌入到Informer模型的训练过程中，从而将优化问题集成到Informer模型中。Informer框架使用概率稀疏注意力、注意力蒸馏和生成式解码器等功能，在保持O(LlogL)的低计算复杂度的同时，专注于对位置至关重要的特征。该方法使用JIGSAWS数据集进行评估，在各种网络场景下实现了超过90%的预测准确率。与TCN、RNN和LSTM等模型的比较表明，Informer框架在处理位置预测和满足实时要求方面具有卓越的性能，使其适用于支持触觉互联网的机器人手术。