使用逆强化学习进行机械血栓切除术中的导管和导丝的自主导航。

Autonomous navigation of catheters and guidewires in mechanical thrombectomy using inverse reinforcement learning.

机构信息

Surgical and Interventional Engineering, School of Biomedical Engineering and Imaging Sciences, Kings College London, London, UK.

AIBE, Friedrich-Alexander University Erlangen-Nürnberg, Erlangen, Germany.

出版信息

Int J Comput Assist Radiol Surg. 2024 Aug;19(8):1569-1578. doi: 10.1007/s11548-024-03208-w. Epub 2024 Jun 17.

DOI:10.1007/s11548-024-03208-w

PMID:38884893

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7616368/

Abstract

PURPOSE

Autonomous navigation of catheters and guidewires can enhance endovascular surgery safety and efficacy, reducing procedure times and operator radiation exposure. Integrating tele-operated robotics could widen access to time-sensitive emergency procedures like mechanical thrombectomy (MT). Reinforcement learning (RL) shows potential in endovascular navigation, yet its application encounters challenges without a reward signal. This study explores the viability of autonomous guidewire navigation in MT vasculature using inverse reinforcement learning (IRL) to leverage expert demonstrations.

METHODS

Employing the Simulation Open Framework Architecture (SOFA), this study established a simulation-based training and evaluation environment for MT navigation. We used IRL to infer reward functions from expert behaviour when navigating a guidewire and catheter. We utilized the soft actor-critic algorithm to train models with various reward functions and compared their performance in silico.

RESULTS

We demonstrated feasibility of navigation using IRL. When evaluating single- versus dual-device (i.e. guidewire versus catheter and guidewire) tracking, both methods achieved high success rates of 95% and 96%, respectively. Dual tracking, however, utilized both devices mimicking an expert. A success rate of 100% and procedure time of 22.6 s were obtained when training with a reward function obtained through 'reward shaping'. This outperformed a dense reward function (96%, 24.9 s) and an IRL-derived reward function (48%, 59.2 s).

CONCLUSIONS

We have contributed to the advancement of autonomous endovascular intervention navigation, particularly MT, by effectively employing IRL based on demonstrator expertise. The results underscore the potential of using reward shaping to efficiently train models, offering a promising avenue for enhancing the accessibility and precision of MT procedures. We envisage that future research can extend our methodology to diverse anatomical structures to enhance generalizability.

摘要

目的

导管和导丝的自主导航可以提高血管内手术的安全性和效果，减少手术时间和操作人员的辐射暴露。远程操作机器人的集成可以扩大机械血栓切除术 (MT) 等时间敏感的急诊手术的应用范围。强化学习 (RL) 在血管内导航中显示出潜力，但由于没有奖励信号，其应用面临挑战。本研究通过使用逆强化学习 (IRL) 来利用专家演示来探索在 MT 脉管系统中进行自主导丝导航的可行性。

方法

本研究使用 Simulation Open Framework Architecture (SOFA) 建立了一个基于模拟的 MT 导航培训和评估环境。我们使用 IRL 从导航导丝和导管时的专家行为中推断奖励函数。我们使用软动作-评论家算法来训练具有不同奖励函数的模型，并在模拟中比较它们的性能。

结果

我们证明了使用 IRL 进行导航的可行性。在评估单设备（即导丝）与双设备（即导丝和导管）跟踪时，两种方法的成功率均高达 95% 和 96%。然而，双跟踪使用了两种设备，模仿了专家的操作。当使用通过“奖励塑造”获得的奖励函数进行训练时，获得了 100%的成功率和 22.6 秒的手术时间。这优于密集奖励函数 (96%，24.9 秒) 和 IRL 衍生的奖励函数 (48%，59.2 秒)。

结论

我们通过有效地利用基于演示专家的 IRL，为自主血管内干预导航，特别是 MT 的发展做出了贡献。结果强调了使用奖励塑造来有效地训练模型的潜力，为提高 MT 手术的可及性和精度提供了有前途的途径。我们设想，未来的研究可以将我们的方法扩展到不同的解剖结构，以提高通用性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e3ab/11588929/dae949de71d5/11548_2024_3208_Fig1_HTML.jpg

相似文献

Autonomous navigation of catheters and guidewires in mechanical thrombectomy using inverse reinforcement learning.

Int J Comput Assist Radiol Surg. 2024 Aug;19(8):1569-1578. doi: 10.1007/s11548-024-03208-w. Epub 2024 Jun 17.

Reinforcement learning for safe autonomous two-device navigation of cerebral vessels in mechanical thrombectomy.

Int J Comput Assist Radiol Surg. 2025 Apr 3. doi: 10.1007/s11548-025-03339-8.

Benchmarking reinforcement learning algorithms for autonomous mechanical thrombectomy.

Int J Comput Assist Radiol Surg. 2025 Jun;20(6):1231-1238. doi: 10.1007/s11548-025-03360-x. Epub 2025 Apr 29.

Artificial intelligence in the autonomous navigation of endovascular interventions: a systematic review.

Front Hum Neurosci. 2023 Aug 4;17:1239374. doi: 10.3389/fnhum.2023.1239374. eCollection 2023.

Three-dimensional electromagnetic navigation vs. fluoroscopy for endovascular aneurysm repair: a prospective feasibility study in patients.

J Endovasc Ther. 2012 Feb;19(1):70-8. doi: 10.1583/11-3557.1.

A zero-shot reinforcement learning strategy for autonomous guidewire navigation.

Int J Comput Assist Radiol Surg. 2024 Jun;19(6):1185-1192. doi: 10.1007/s11548-024-03092-4. Epub 2024 Apr 16.

Mechanical Thrombectomy using Distal Access Catheters: Current Status and Future Prospects.

J Neuroimaging. 2020 Nov;30(6):754-761. doi: 10.1111/jon.12793. Epub 2020 Nov 3.

Leveraging Expert Demonstration Features for Deep Reinforcement Learning in Floor Cleaning Robot Navigation.

Sensors (Basel). 2022 Oct 12;22(20):7750. doi: 10.3390/s22207750.

Learning-based autonomous vascular guidewire navigation without human demonstration in the venous system of a porcine liver.

Int J Comput Assist Radiol Surg. 2022 Nov;17(11):2033-2040. doi: 10.1007/s11548-022-02646-8. Epub 2022 May 23.

Recurrent neural networks for generalization towards the vessel geometry in autonomous endovascular guidewire navigation in the aortic arch.

Int J Comput Assist Radiol Surg. 2023 Sep;18(9):1735-1744. doi: 10.1007/s11548-023-02938-7. Epub 2023 May 28.

引用本文的文献

Benchmarking reinforcement learning algorithms for autonomous mechanical thrombectomy.

Int J Comput Assist Radiol Surg. 2025 Jun;20(6):1231-1238. doi: 10.1007/s11548-025-03360-x. Epub 2025 Apr 29.

Advanced Robotics for the Next-Generation of Cardiac Interventions.

Micromachines (Basel). 2025 Mar 22;16(4):363. doi: 10.3390/mi16040363.

Reinforcement learning for safe autonomous two-device navigation of cerebral vessels in mechanical thrombectomy.

Int J Comput Assist Radiol Surg. 2025 Apr 3. doi: 10.1007/s11548-025-03339-8.

本文引用的文献

Artificial intelligence in the autonomous navigation of endovascular interventions: a systematic review.

Front Hum Neurosci. 2023 Aug 4;17:1239374. doi: 10.3389/fnhum.2023.1239374. eCollection 2023.

Robotic Diagnostic Cerebral Angiography: A Multicenter Experience of 113 Patients.

J Neurointerv Surg. 2024 Jun 17;16(7):726-730. doi: 10.1136/jnis-2023-020448.

Comparative verification of control methodology for robotic interventional neuroradiology procedures.

Int J Comput Assist Radiol Surg. 2023 Nov;18(11):1977-1986. doi: 10.1007/s11548-023-02991-2. Epub 2023 Jul 17.

Recurrent neural networks for generalization towards the vessel geometry in autonomous endovascular guidewire navigation in the aortic arch.

Int J Comput Assist Radiol Surg. 2023 Sep;18(9):1735-1744. doi: 10.1007/s11548-023-02938-7. Epub 2023 May 28.

Learning-based autonomous vascular guidewire navigation without human demonstration in the venous system of a porcine liver.

Int J Comput Assist Radiol Surg. 2022 Nov;17(11):2033-2040. doi: 10.1007/s11548-022-02646-8. Epub 2022 May 23.

Neurosurgery and artificial intelligence.

AIMS Neurosci. 2021 Aug 6;8(4):477-495. doi: 10.3934/Neuroscience.2021025. eCollection 2021.

Robotics in neurointerventional surgery: a systematic review of the literature.

J Neurointerv Surg. 2022 Jun;14(6):539-545. doi: 10.1136/neurintsurg-2021-018096. Epub 2021 Nov 19.

Machine Learning: Algorithms, Real-World Applications and Research Directions.

SN Comput Sci. 2021;2(3):160. doi: 10.1007/s42979-021-00592-x. Epub 2021 Mar 22.

Estimating the number of UK stroke patients eligible for endovascular thrombectomy.

Eur Stroke J. 2017 Dec;2(4):319-326. doi: 10.1177/2396987317733343. Epub 2017 Oct 4.

Endovascular Treatment of Ischemic Stroke: An Updated Meta-Analysis of Efficacy and Safety.

Vasc Endovascular Surg. 2017 May;51(4):215-219. doi: 10.1177/1538574417698905. Epub 2017 Mar 17.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr
超能文献

使用逆强化学习进行机械血栓切除术中的导管和导丝的自主导航。

Autonomous navigation of catheters and guidewires in mechanical thrombectomy using inverse reinforcement learning.

机构信息