World Model for AI Autonomous Navigation in Mechanical Thrombectomy
- URL: http://arxiv.org/abs/2509.25518v2
- Date: Thu, 02 Oct 2025 05:02:54 GMT
- Title: World Model for AI Autonomous Navigation in Mechanical Thrombectomy
- Authors: Harry Robertshaw, Han-Ru Wu, Alejandro Granados, Thomas C Booth,
- Abstract summary: We propose a world model for autonomous endovascular navigation using TD-MPC2, a model-based RL algorithm.<n>We trained a single RL agent across multiple endovascular navigation tasks in ten real patient vasculatures, comparing performance against the state-of-the-art Soft Actor-Critic (SAC) method.<n>Results indicate that TD-MPC2 significantly outperforms SAC in multi-task learning, achieving a 65% mean success rate compared to SAC's 37%, with notable improvements in path ratio.
- Score: 39.51808126417126
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Autonomous navigation for mechanical thrombectomy (MT) remains a critical challenge due to the complexity of vascular anatomy and the need for precise, real-time decision-making. Reinforcement learning (RL)-based approaches have demonstrated potential in automating endovascular navigation, but current methods often struggle with generalization across multiple patient vasculatures and long-horizon tasks. We propose a world model for autonomous endovascular navigation using TD-MPC2, a model-based RL algorithm. We trained a single RL agent across multiple endovascular navigation tasks in ten real patient vasculatures, comparing performance against the state-of-the-art Soft Actor-Critic (SAC) method. Results indicate that TD-MPC2 significantly outperforms SAC in multi-task learning, achieving a 65% mean success rate compared to SAC's 37%, with notable improvements in path ratio. TD-MPC2 exhibited increased procedure times, suggesting a trade-off between success rate and execution speed. These findings highlight the potential of world models for improving autonomous endovascular navigation and lay the foundation for future research in generalizable AI-driven robotic interventions.
Related papers
- Toward AI Autonomous Navigation for Mechanical Thrombectomy using Hierarchical Modular Multi-agent Reinforcement Learning (HM-MARL) [57.65363326406228]
We propose a Hierarchical Modular Multi-Agent Reinforcement Learning framework for autonomous two-device navigation in vitro.<n>HM-MARL was developed to autonomously navigate a guide catheter and guidewire from the femoral artery to the internal carotid artery (ICA)<n>A modular multi-agent approach was used to decompose the complex navigation task into specialized subtasks, each trained using Soft Actor-Critic RL.<n>In vitro, both HM-MARL models successfully navigated 100% of trials from the femoral artery to the right common carotid artery and 80% to the right ICA but failed on the left-side vessel challenge
arXiv Detail & Related papers (2026-02-20T23:50:35Z) - MedSAM-Agent: Empowering Interactive Medical Image Segmentation with Multi-turn Agentic Reinforcement Learning [53.37068897861388]
MedSAM-Agent is a framework that reformulates interactive segmentation as a multi-step autonomous decision-making process.<n>We develop a two-stage training pipeline that integrates multi-turn, end-to-end outcome verification.<n>Experiments across 6 medical modalities and 21 datasets demonstrate that MedSAM-Agent achieves state-of-the-art performance.
arXiv Detail & Related papers (2026-02-03T09:47:49Z) - DINO-CVA: A Multimodal Goal-Conditioned Vision-to-Action Model for Autonomous Catheter Navigation [0.33727511459109777]
This work moves towards autonomous catheter navigation by introducing DINO-CVA, a multimodal goal-conditioned behavior cloning framework.<n>The proposed model fuses visual observations and joystick kinematics into a joint embedding space, enabling policies that are both vision-aware and kinematic-aware.<n>Results show that DINO-CVA achieves high accuracy in predicting actions, matching the performance of a kinematics-only baseline.
arXiv Detail & Related papers (2025-10-19T22:59:32Z) - Autonomous Soft Robotic Guidewire Navigation via Imitation Learning [3.1381624795986345]
In endovascular surgery, interventionists push a thin tube called a catheter, guided by a thin wire to a treatment site inside the patient's blood vessels.<n>Guidewires with robotic tips can enhance maneuverability, but they present challenges in modeling and control.<n>We develop a transformer-based imitation learning framework with goal conditioning, relative action outputs, and automatic contrast dye injections.
arXiv Detail & Related papers (2025-10-10T15:57:09Z) - Reinforcement Learning for Safe Autonomous Two Device Navigation of Cerebral Vessels in Mechanical Thrombectomy [38.51304071363744]
Current reinforcement learning methods only reach the carotid arteries.<n>We propose a safe dual-device RL algorithm that can navigate beyond the carotid arteries to cerebral vessels.
arXiv Detail & Related papers (2025-03-31T14:25:46Z) - A deep learning framework for efficient pathology image analysis [0.0]
We introduce Eagle, a framework that emulates pathologists by selectively analyzing informative regions.<n>It processes a slide in 2.27 seconds, reducing computational time by more than 99% compared to existing models.<n>It provides robust and interpretable outputs, supporting rapid slide searches, integration into multi-omics pipelines and emerging clinical foundation models.
arXiv Detail & Related papers (2025-02-18T16:45:01Z) - Autonomous navigation of catheters and guidewires in mechanical thrombectomy using inverse reinforcement learning [39.70065117918227]
Autonomous navigation of catheters and guidewires can enhance endovascular surgery safety and efficacy, reducing procedure times and operator radiation exposure.
This study explores the viability of autonomous navigation in MT vasculature using inverse RL (IRL) to leverage expert demonstrations.
arXiv Detail & Related papers (2024-06-18T11:00:55Z) - A Zero-Shot Reinforcement Learning Strategy for Autonomous Guidewire
Navigation [0.0]
The treatment of cardiovascular diseases requires complex and challenging navigation of a guidewire and catheter.
This often leads to lengthy interventions during which the patient and clinician are exposed to X-ray radiation.
Deep Reinforcement Learning approaches have shown promise in learning this task and may be the key to automating catheter navigation during robotized interventions.
arXiv Detail & Related papers (2024-03-05T08:46:54Z) - Confidence-Controlled Exploration: Efficient Sparse-Reward Policy Learning for Robot Navigation [72.24964965882783]
Reinforcement learning (RL) is a promising approach for robotic navigation, allowing robots to learn through trial and error.<n>Real-world robotic tasks often suffer from sparse rewards, leading to inefficient exploration and suboptimal policies.<n>We introduce Confidence-Controlled Exploration (CCE), a novel method that improves sample efficiency in RL-based robotic navigation without modifying the reward function.
arXiv Detail & Related papers (2023-06-09T18:45:15Z) - Robotic Navigation Autonomy for Subretinal Injection via Intelligent
Real-Time Virtual iOCT Volume Slicing [88.99939660183881]
We propose a framework for autonomous robotic navigation for subretinal injection.
Our method consists of an instrument pose estimation method, an online registration between the robotic and the i OCT system, and trajectory planning tailored for navigation to an injection target.
Our experiments on ex-vivo porcine eyes demonstrate the precision and repeatability of the method.
arXiv Detail & Related papers (2023-01-17T21:41:21Z) - Application of the nnU-Net for automatic segmentation of lung lesion on
CT images, and implication on radiomic models [1.8231394717039833]
A deep-learning automatic segmentation method was applied on computed tomography images of non-small-cell lung cancer patients.
The use of manual vs automatic segmentation in the performance of survival radiomic models was assessed, as well.
arXiv Detail & Related papers (2022-09-24T15:04:23Z) - Multiple Time Series Fusion Based on LSTM An Application to CAP A Phase
Classification Using EEG [56.155331323304]
Deep learning based electroencephalogram channels' feature level fusion is carried out in this work.
Channel selection, fusion, and classification procedures were optimized by two optimization algorithms.
arXiv Detail & Related papers (2021-12-18T14:17:49Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.