Optimizing Latent Graph Representations of Surgical Scenes for Zero-Shot
Domain Transfer
- URL: http://arxiv.org/abs/2403.06953v1
- Date: Mon, 11 Mar 2024 17:36:11 GMT
- Title: Optimizing Latent Graph Representations of Surgical Scenes for Zero-Shot
Domain Transfer
- Authors: Siddhant Satyanaik, Aditya Murali, Deepak Alapatt, Xin Wang, Pietro
Mascagni, Nicolas Padoy
- Abstract summary: We evaluate four object-centric approaches for domain generalization, establishing baseline performance.
We develop an optimized method specifically tailored for domain generalization, LG-DG, that includes a novel disentanglement loss function.
Our optimized approach, LG-DG, achieves an improvement of 9.28% over the best baseline approach.
- Score: 6.880129372917993
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Purpose: Advances in deep learning have resulted in effective models for
surgical video analysis; however, these models often fail to generalize across
medical centers due to domain shift caused by variations in surgical workflow,
camera setups, and patient demographics. Recently, object-centric learning has
emerged as a promising approach for improved surgical scene understanding,
capturing and disentangling visual and semantic properties of surgical tools
and anatomy to improve downstream task performance. In this work, we conduct a
multi-centric performance benchmark of object-centric approaches, focusing on
Critical View of Safety assessment in laparoscopic cholecystectomy, then
propose an improved approach for unseen domain generalization.
Methods: We evaluate four object-centric approaches for domain
generalization, establishing baseline performance. Next, leveraging the
disentangled nature of object-centric representations, we dissect one of these
methods through a series of ablations (e.g. ignoring either visual or semantic
features for downstream classification). Finally, based on the results of these
ablations, we develop an optimized method specifically tailored for domain
generalization, LG-DG, that includes a novel disentanglement loss function.
Results: Our optimized approach, LG-DG, achieves an improvement of 9.28% over
the best baseline approach. More broadly, we show that object-centric
approaches are highly effective for domain generalization thanks to their
modular approach to representation learning.
Conclusion: We investigate the use of object-centric methods for unseen
domain generalization, identify method-agnostic factors critical for
performance, and present an optimized approach that substantially outperforms
existing methods.
Related papers
- Pruning the Path to Optimal Care: Identifying Systematically Suboptimal Medical Decision-Making with Inverse Reinforcement Learning [14.688842697886484]
We present a novel application of Inverse Reinforcement Learning that identifies suboptimal clinician actions based on the actions of their peers.
This approach centers two stages of IRL with an intermediate step to prune trajectories displaying behavior that deviates significantly from the consensus.
arXiv Detail & Related papers (2024-11-07T23:16:59Z) - GS-EMA: Integrating Gradient Surgery Exponential Moving Average with
Boundary-Aware Contrastive Learning for Enhanced Domain Generalization in
Aneurysm Segmentation [41.97669338211682]
We propose a novel domain generalization strategy that employs gradient surgery exponential moving average (GS-EMA) optimization technique and boundary-aware contrastive learning (BACL)
Our approach is distinct in its ability to adapt to new, unseen domains by learning domain-invariant features, thereby improving the robustness and accuracy of aneurysm segmentation across diverse clinical datasets.
arXiv Detail & Related papers (2024-02-23T10:02:15Z) - Hypergraph-Transformer (HGT) for Interactive Event Prediction in
Laparoscopic and Robotic Surgery [50.3022015601057]
We propose a predictive neural network that is capable of understanding and predicting critical interactive aspects of surgical workflow from intra-abdominal video.
We verify our approach on established surgical datasets and applications, including the detection and prediction of action triplets.
Our results demonstrate the superiority of our approach compared to unstructured alternatives.
arXiv Detail & Related papers (2024-02-03T00:58:05Z) - Optimizing $CO_{2}$ Capture in Pressure Swing Adsorption Units: A Deep
Neural Network Approach with Optimality Evaluation and Operating Maps for
Decision-Making [0.0]
This study focuses on enhancing Pressure Swing Adsorption units for carbon dioxide capture.
We developed and implemented a multiple-input, single-output (MISO) framework comprising two deep neural network (DNN) models.
This approach delineated feasible operational regions (FORs) and highlighted the spectrum of optimal decision-making scenarios.
arXiv Detail & Related papers (2023-12-06T19:43:37Z) - Exploring the Optimization Objective of One-Class Classification for
Anomaly Detection [2.9266769103356305]
One-class classification (OCC) is a longstanding method for anomaly detection.
In this work, we conduct a thorough investigation into the optimization objective of OCC.
We unveil a key insights: any space with the suitable norm can serve as an equivalent substitute for the hypersphere center.
This novel insight sparks a simple and data-agnostic deep one-class classification method.
arXiv Detail & Related papers (2023-08-23T03:46:04Z) - Data Augmentation-Based Unsupervised Domain Adaptation In Medical
Imaging [0.709016563801433]
We propose an unsupervised method for robust domain adaptation in brain MRI segmentation by leveraging MRI-specific augmentation techniques.
The results show that our proposed approach achieves high accuracy, exhibits broad applicability, and showcases remarkable robustness against domain shift in various tasks.
arXiv Detail & Related papers (2023-08-08T17:00:11Z) - Taxonomy Adaptive Cross-Domain Adaptation in Medical Imaging via
Optimization Trajectory Distillation [73.83178465971552]
The success of automated medical image analysis depends on large-scale and expert-annotated training sets.
Unsupervised domain adaptation (UDA) has been raised as a promising approach to alleviate the burden of labeled data collection.
We propose optimization trajectory distillation, a unified approach to address the two technical challenges from a new perspective.
arXiv Detail & Related papers (2023-07-27T08:58:05Z) - Dissecting Self-Supervised Learning Methods for Surgical Computer Vision [51.370873913181605]
Self-Supervised Learning (SSL) methods have begun to gain traction in the general computer vision community.
The effectiveness of SSL methods in more complex and impactful domains, such as medicine and surgery, remains limited and unexplored.
We present an extensive analysis of the performance of these methods on the Cholec80 dataset for two fundamental and popular tasks in surgical context understanding, phase recognition and tool presence detection.
arXiv Detail & Related papers (2022-07-01T14:17:11Z) - Real-time landmark detection for precise endoscopic submucosal
dissection via shape-aware relation network [51.44506007844284]
We propose a shape-aware relation network for accurate and real-time landmark detection in endoscopic submucosal dissection surgery.
We first devise an algorithm to automatically generate relation keypoint heatmaps, which intuitively represent the prior knowledge of spatial relations among landmarks.
We then develop two complementary regularization schemes to progressively incorporate the prior knowledge into the training process.
arXiv Detail & Related papers (2021-11-08T07:57:30Z) - Shape-aware Meta-learning for Generalizing Prostate MRI Segmentation to
Unseen Domains [68.73614619875814]
We present a novel shape-aware meta-learning scheme to improve the model generalization in prostate MRI segmentation.
Experimental results show that our approach outperforms many state-of-the-art generalization methods consistently across all six settings of unseen domains.
arXiv Detail & Related papers (2020-07-04T07:56:02Z) - Automatic Data Augmentation via Deep Reinforcement Learning for
Effective Kidney Tumor Segmentation [57.78765460295249]
We develop a novel automatic learning-based data augmentation method for medical image segmentation.
In our method, we innovatively combine the data augmentation module and the subsequent segmentation module in an end-to-end training manner with a consistent loss.
We extensively evaluated our method on CT kidney tumor segmentation which validated the promising results of our method.
arXiv Detail & Related papers (2020-02-22T14:10:13Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.