Reinforcement Learning for Safe Autonomous Two Device Navigation of Cerebral Vessels in Mechanical Thrombectomy
- URL: http://arxiv.org/abs/2503.24140v1
- Date: Mon, 31 Mar 2025 14:25:46 GMT
- Title: Reinforcement Learning for Safe Autonomous Two Device Navigation of Cerebral Vessels in Mechanical Thrombectomy
- Authors: Harry Robertshaw, Benjamin Jackson, Jiaheng Wang, Hadi Sadati, Lennart Karstensen, Alejandro Granados, Thomas C Booth,
- Abstract summary: Current reinforcement learning methods only reach the carotid arteries.<n>We propose a safe dual-device RL algorithm that can navigate beyond the carotid arteries to cerebral vessels.
- Score: 38.51304071363744
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Purpose: Autonomous systems in mechanical thrombectomy (MT) hold promise for reducing procedure times, minimizing radiation exposure, and enhancing patient safety. However, current reinforcement learning (RL) methods only reach the carotid arteries, are not generalizable to other patient vasculatures, and do not consider safety. We propose a safe dual-device RL algorithm that can navigate beyond the carotid arteries to cerebral vessels. Methods: We used the Simulation Open Framework Architecture to represent the intricacies of cerebral vessels, and a modified Soft Actor-Critic RL algorithm to learn, for the first time, the navigation of micro-catheters and micro-guidewires. We incorporate patient safety metrics into our reward function by integrating guidewire tip forces. Inverse RL is used with demonstrator data on 12 patient-specific vascular cases. Results: Our simulation demonstrates successful autonomous navigation within unseen cerebral vessels, achieving a 96% success rate, 7.0s procedure time, and 0.24 N mean forces, well below the proposed 1.5 N vessel rupture threshold. Conclusion: To the best of our knowledge, our proposed autonomous system for MT two-device navigation reaches cerebral vessels, considers safety, and is generalizable to unseen patient-specific cases for the first time. We envisage future work will extend the validation to vasculatures of different complexity and on in vitro models. While our contributions pave the way towards deploying agents in clinical settings, safety and trustworthiness will be crucial elements to consider when proposing new methodology.
Related papers
- ETSM: Automating Dissection Trajectory Suggestion and Confidence Map-Based Safety Margin Prediction for Robot-assisted Endoscopic Submucosal Dissection [10.2380174289706]
We create the ESD Trajectory and Confidence Map-based Safety (ETSM) dataset with $1849$ short clips, focusing on submucosal dissection with a dual-arm robotic system.
We also introduce a framework that combines optimal dissection trajectory prediction with a confidence map-based safety margin.
Our approach bridges gaps in current research by improving prediction accuracy and enhancing the safety of the dissection process.
arXiv Detail & Related papers (2024-11-28T03:19:18Z) - Safe Navigation for Robotic Digestive Endoscopy via Human Intervention-based Reinforcement Learning [5.520042381826271]
Existing automated reinforcement learning navigation algorithms often result in potentially risky collisions.<n>We propose a Human Intervention (HI)-based Proximal Policy Optimization framework, dubbed HI-PPO, which incorporates expert knowledge to enhance RDE's safety.<n>Results show that HI-PPO achieved a mean ATE of (8.02 textmm) and a Security Score of (0.862) demonstrating performance comparable to human experts.
arXiv Detail & Related papers (2024-09-24T03:01:30Z) - Think Step by Step: Chain-of-Gesture Prompting for Error Detection in Robotic Surgical Videos [18.106255939686545]
This letter presents a novel and real-time end-to-end error detection framework, Chain-of-Thought (COG) prompting.
This encompasses two reasoning modules designed to mimic the decision-making processes of expert surgeons.
Our method encapsulates the reasoning processes inherent to surgical activities enabling it to outperform the state-of-the-art by 4.6% in F1 score, 4.6% in Accuracy, and 5.9% in Jaccard index while processing each frame in 6.69 milliseconds on average.
arXiv Detail & Related papers (2024-06-27T14:43:50Z) - Autonomous navigation of catheters and guidewires in mechanical thrombectomy using inverse reinforcement learning [39.70065117918227]
Autonomous navigation of catheters and guidewires can enhance endovascular surgery safety and efficacy, reducing procedure times and operator radiation exposure.
This study explores the viability of autonomous navigation in MT vasculature using inverse RL (IRL) to leverage expert demonstrations.
arXiv Detail & Related papers (2024-06-18T11:00:55Z) - A Zero-Shot Reinforcement Learning Strategy for Autonomous Guidewire
Navigation [0.0]
The treatment of cardiovascular diseases requires complex and challenging navigation of a guidewire and catheter.
This often leads to lengthy interventions during which the patient and clinician are exposed to X-ray radiation.
Deep Reinforcement Learning approaches have shown promise in learning this task and may be the key to automating catheter navigation during robotized interventions.
arXiv Detail & Related papers (2024-03-05T08:46:54Z) - Safe Deep RL for Intraoperative Planning of Pedicle Screw Placement [61.28459114068828]
We propose an intraoperative planning approach for robotic spine surgery that leverages real-time observation for drill path planning based on Safe Deep Reinforcement Learning (DRL)
Our approach was capable of achieving 90% bone penetration with respect to the gold standard (GS) drill planning.
arXiv Detail & Related papers (2023-05-09T11:42:53Z) - A Multiplicative Value Function for Safe and Efficient Reinforcement
Learning [131.96501469927733]
We propose a safe model-free RL algorithm with a novel multiplicative value function consisting of a safety critic and a reward critic.
The safety critic predicts the probability of constraint violation and discounts the reward critic that only estimates constraint-free returns.
We evaluate our method in four safety-focused environments, including classical RL benchmarks augmented with safety constraints and robot navigation tasks with images and raw Lidar scans as observations.
arXiv Detail & Related papers (2023-03-07T18:29:15Z) - Robotic Navigation Autonomy for Subretinal Injection via Intelligent
Real-Time Virtual iOCT Volume Slicing [88.99939660183881]
We propose a framework for autonomous robotic navigation for subretinal injection.
Our method consists of an instrument pose estimation method, an online registration between the robotic and the i OCT system, and trajectory planning tailored for navigation to an injection target.
Our experiments on ex-vivo porcine eyes demonstrate the precision and repeatability of the method.
arXiv Detail & Related papers (2023-01-17T21:41:21Z) - Building Brains: Subvolume Recombination for Data Augmentation in Large
Vessel Occlusion Detection [56.67577446132946]
A large training data set is required for a standard deep learning-based model to learn this strategy from data.
We propose an augmentation method that generates artificial training samples by recombining vessel tree segmentations of the hemispheres from different patients.
In line with the augmentation scheme, we use a 3D-DenseNet fed with task-specific input, fostering a side-by-side comparison between the hemispheres.
arXiv Detail & Related papers (2022-05-05T10:31:57Z) - Learning to be Safe: Deep RL with a Safety Critic [72.00568333130391]
A natural first approach toward safe RL is to manually specify constraints on the policy's behavior.
We propose to learn how to be safe in one set of tasks and environments, and then use that learned intuition to constrain future behaviors.
arXiv Detail & Related papers (2020-10-27T20:53:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.