He Xian, Zhang Shuai, Chu Jian, Jia Tongyu, Yu Lantao, Ouyang Bo
School of Management, Hefei University of Technology, Hefei, China.
Key Laboratory of Process Optimization and Intelligent Decision-Making (Ministry of Education), Hefei University of Technology, Hefei, China.
Cyborg Bionic Syst. 2025 Apr 14;6:0114. doi: 10.34133/cbsystems.0114. eCollection 2025.
Intraoperative soft tissue manipulation is a critical challenge in autonomous robotic surgery. Furthermore, the intricate environment surrounding the target soft tissues poses additional hindrances to autonomous robotic decision-making. Previous studies assumed the grasping point was known and the target deformation could be achieved. The constraints were assumed to be constant during the operation, and there were no obstacles around the soft tissue. To address these problems, an intuition-guided deep reinforcement learning framework based on soft actor-critic (ID-SAC) was proposed for soft tissue manipulation under unknown constraints. The SAC algorithm is automatically activated upon encountering an obstacle, and the designed intuitive manipulation (IM) strategy is used to pull soft tissues toward the target shape directly when the obstacle is distant. A regulator factor is designed as an action within this framework to coordinate the IM approach and the SAC network. A reward function is designed to balance the exploration and exploitation of large deformations. Simultaneously, we proposed an autonomous grasp point selection neural network to prevent the impractical selection of grasp points, ensuring they can reach the target while avoiding grasping lesions and constrained areas. Successful simulation results confirmed that the proposed framework can manipulate the soft tissue while avoiding obstacles and adding new positional constraints. Compared with the SAC algorithm, the proposed framework can markedly increase the robotic soft tissue manipulation ability by automatically adjusting the regulator factors.
术中软组织操作是自主机器人手术中的一项关键挑战。此外,目标软组织周围复杂的环境给自主机器人决策带来了额外的阻碍。以往的研究假设抓取点已知且目标变形能够实现。假设操作过程中的约束条件是恒定的,并且软组织周围没有障碍物。为了解决这些问题,提出了一种基于软演员-评论家(ID-SAC)的直觉引导深度强化学习框架,用于在未知约束条件下进行软组织操作。SAC算法在遇到障碍物时自动激活,当障碍物距离较远时,使用设计的直观操作(IM)策略将软组织直接拉向目标形状。在该框架内设计了一个调节因子作为一种动作,以协调IM方法和SAC网络。设计了一个奖励函数来平衡对大变形的探索和利用。同时,我们提出了一种自主抓取点选择神经网络,以防止抓取点的不切实际选择,确保它们能够到达目标,同时避免抓取病变和受限区域。成功的仿真结果证实,所提出的框架能够在避免障碍物并添加新的位置约束的同时操纵软组织。与SAC算法相比,所提出的框架能够通过自动调整调节因子显著提高机器人的软组织操作能力。