A Zero-Shot Reinforcement Learning Strategy for Autonomous Guidewire
Navigation
- URL: http://arxiv.org/abs/2403.02777v1
- Date: Tue, 5 Mar 2024 08:46:54 GMT
- Title: A Zero-Shot Reinforcement Learning Strategy for Autonomous Guidewire
Navigation
- Authors: Valentina Scarponi (MIMESIS, ICube), Michel Duprez (ICube, MIMESIS),
Florent Nageotte (ICube), St\'ephane Cotin (ICube, MIMESIS)
- Abstract summary: The treatment of cardiovascular diseases requires complex and challenging navigation of a guidewire and catheter.
This often leads to lengthy interventions during which the patient and clinician are exposed to X-ray radiation.
Deep Reinforcement Learning approaches have shown promise in learning this task and may be the key to automating catheter navigation during robotized interventions.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Purpose: The treatment of cardiovascular diseases requires complex and
challenging navigation of a guidewire and catheter. This often leads to lengthy
interventions during which the patient and clinician are exposed to X-ray
radiation. Deep Reinforcement Learning approaches have shown promise in
learning this task and may be the key to automating catheter navigation during
robotized interventions. Yet, existing training methods show limited
capabilities at generalizing to unseen vascular anatomies, requiring to be
retrained each time the geometry changes. Methods: In this paper, we propose a
zero-shot learning strategy for three-dimensional autonomous endovascular
navigation. Using a very small training set of branching patterns, our
reinforcement learning algorithm is able to learn a control that can then be
applied to unseen vascular anatomies without retraining. Results: We
demonstrate our method on 4 different vascular systems, with an average success
rate of 95% at reaching random targets on these anatomies. Our strategy is also
computationally efficient, allowing the training of our controller to be
performed in only 2 hours. Conclusion: Our training method proved its ability
to navigate unseen geometries with different characteristics, thanks to a
nearly shape-invariant observation space.
Related papers
- Fast Medical Shape Reconstruction via Meta-learned Implicit Neural Representations [5.213304732451705]
Minimizing retrieval and processing times potentially enhances swift response and decision-making in critical scenarios.
Recent methods attempt to solve the medical shape reconstruction problem by utilizing implicit neural functions.
arXiv Detail & Related papers (2024-09-11T08:44:10Z) - Autonomous navigation of catheters and guidewires in mechanical thrombectomy using inverse reinforcement learning [39.70065117918227]
Autonomous navigation of catheters and guidewires can enhance endovascular surgery safety and efficacy, reducing procedure times and operator radiation exposure.
This study explores the viability of autonomous navigation in MT vasculature using inverse RL (IRL) to leverage expert demonstrations.
arXiv Detail & Related papers (2024-06-18T11:00:55Z) - RLIF: Interactive Imitation Learning as Reinforcement Learning [56.997263135104504]
We show how off-policy reinforcement learning can enable improved performance under assumptions that are similar but potentially even more practical than those of interactive imitation learning.
Our proposed method uses reinforcement learning with user intervention signals themselves as rewards.
This relaxes the assumption that intervening experts in interactive imitation learning should be near-optimal and enables the algorithm to learn behaviors that improve over the potential suboptimal human expert.
arXiv Detail & Related papers (2023-11-21T21:05:21Z) - Automated Fidelity Assessment for Strategy Training in Inpatient
Rehabilitation using Natural Language Processing [53.096237570992294]
Strategy training is a rehabilitation approach that teaches skills to reduce disability among those with cognitive impairments following a stroke.
Standardized fidelity assessment is used to measure adherence to treatment principles.
We developed a rule-based NLP algorithm, a long-short term memory (LSTM) model, and a bidirectional encoder representation from transformers (BERT) model for this task.
arXiv Detail & Related papers (2022-09-14T15:33:30Z) - Federated Cycling (FedCy): Semi-supervised Federated Learning of
Surgical Phases [57.90226879210227]
FedCy is a semi-supervised learning (FSSL) method that combines FL and self-supervised learning to exploit a decentralized dataset of both labeled and unlabeled videos.
We demonstrate significant performance gains over state-of-the-art FSSL methods on the task of automatic recognition of surgical phases.
arXiv Detail & Related papers (2022-03-14T17:44:53Z) - Learn2Reg: comprehensive multi-task medical image registration
challenge, dataset and evaluation in the era of deep learning [19.267693026491482]
Learn2Reg covers a wide range of anatomies: brain, abdomen and thorax, modalities: ultrasound, CT, MRI, populations: intra- and inter-patient and levels of supervision.
Our complementary set of metrics, including robustness, accuracy, plausibility and speed enables unique insight into the current-state-of-the-art of medical image registration.
arXiv Detail & Related papers (2021-12-08T09:46:39Z) - Personalized Rehabilitation Robotics based on Online Learning Control [62.6606062732021]
We propose a novel online learning control architecture, which is able to personalize the control force at run time to each individual user.
We evaluate our method in an experimental user study, where the learning controller is shown to provide personalized control, while also obtaining safe interaction forces.
arXiv Detail & Related papers (2021-10-01T15:28:44Z) - Self Context and Shape Prior for Sensorless Freehand 3D Ultrasound
Reconstruction [61.62191904755521]
3D freehand US reconstruction is promising in addressing the problem by providing broad range and freeform scan.
Existing deep learning based methods only focus on the basic cases of skill sequences.
We propose a novel approach to sensorless freehand 3D US reconstruction considering the complex skill sequences.
arXiv Detail & Related papers (2021-07-31T16:06:50Z) - Adversarial Training is Not Ready for Robot Learning [55.493354071227174]
Adversarial training is an effective method to train deep learning models that are resilient to norm-bounded perturbations.
We show theoretically and experimentally that neural controllers obtained via adversarial training are subjected to three types of defects.
Our results suggest that adversarial training is not yet ready for robot learning.
arXiv Detail & Related papers (2021-03-15T07:51:31Z) - "Train one, Classify one, Teach one" -- Cross-surgery transfer learning
for surgical step recognition [14.635480748841317]
We analyze, for the first time, surgical step recognition on four different laparoscopic surgeries.
We introduce a new architecture, the Time-Series Adaptation Network (TSAN), an architecture optimized for transfer learning of surgical step recognition.
Our proposed architecture leads to better performance compared to other possible architectures, reaching over 90% accuracy when transferring from laparoscopic Cholecystectomy to the other three procedure types.
arXiv Detail & Related papers (2021-02-24T14:36:18Z) - Unifying Cardiovascular Modelling with Deep Reinforcement Learning for
Uncertainty Aware Control of Sepsis Treatment [0.2399911126932526]
There is no universally agreed upon strategy for vasopressor and fluid administration.
Sepsis is the leading cause of mortality in the ICU, responsible for 6% of all hospitalizations and 35% of all in-hospital deaths in USA.
We propose a novel approach, exploiting and unifying complementary strengths of Mathematical Modelling, Deep Learning, Reinforcement Learning and Uncertainty Quantification.
arXiv Detail & Related papers (2021-01-21T07:32:02Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.