Rethinking Exemplars for Continual Semantic Segmentation in Endoscopy
Scenes: Entropy-based Mini-Batch Pseudo-Replay
- URL: http://arxiv.org/abs/2308.14100v1
- Date: Sun, 27 Aug 2023 13:07:44 GMT
- Title: Rethinking Exemplars for Continual Semantic Segmentation in Endoscopy
Scenes: Entropy-based Mini-Batch Pseudo-Replay
- Authors: Guankun Wang, Long Bai, Yanan Wu, Tong Chen, Hongliang Ren
- Abstract summary: Endoscopy is a widely used technique for the early detection of diseases or robotic-assisted minimally invasive surgery (RMIS)
Existing deep learning (DL) models may suffer from catastrophic forgetting.
Data privacy and storage issues may lead to the unavailability of old data when updating the model.
We propose a Endoscopy Continual Semantic (EndoCSS) framework that does not involve the storage and privacy issues of data.
- Score: 18.383604936008744
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Endoscopy is a widely used technique for the early detection of diseases or
robotic-assisted minimally invasive surgery (RMIS). Numerous deep learning
(DL)-based research works have been developed for automated diagnosis or
processing of endoscopic view. However, existing DL models may suffer from
catastrophic forgetting. When new target classes are introduced over time or
cross institutions, the performance of old classes may suffer severe
degradation. More seriously, data privacy and storage issues may lead to the
unavailability of old data when updating the model. Therefore, it is necessary
to develop a continual learning (CL) methodology to solve the problem of
catastrophic forgetting in endoscopic image segmentation. To tackle this, we
propose a Endoscopy Continual Semantic Segmentation (EndoCSS) framework that
does not involve the storage and privacy issues of exemplar data. The framework
includes a mini-batch pseudo-replay (MB-PR) mechanism and a self-adaptive noisy
cross-entropy (SAN-CE) loss. The MB-PR strategy circumvents privacy and storage
issues by generating pseudo-replay images through a generative model.
Meanwhile, the MB-PR strategy can also correct the model deviation to the
replay data and current training data, which is aroused by the significant
difference in the amount of current and replay images. Therefore, the model can
perform effective representation learning on both new and old tasks. SAN-CE
loss can help model fitting by adjusting the model's output logits, and also
improve the robustness of training. Extensive continual semantic segmentation
(CSS) experiments on public datasets demonstrate that our method can robustly
and effectively address the catastrophic forgetting brought by class increment
in endoscopy scenes. The results show that our framework holds excellent
potential for real-world deployment in a streaming learning manner.
Related papers
- Image Synthesis with Class-Aware Semantic Diffusion Models for Surgical Scene Segmentation [3.6723640056915436]
We propose the Class-Aware Semantic Diffusion Model (CASDM) to tackle data scarcity and imbalance.
Class-aware mean squared error and class-aware self-perceptual loss functions have been defined to prioritize critical, less visible classes.
We are the first to generate multi-class segmentation maps using text prompts in a novel fashion to specify their contents.
arXiv Detail & Related papers (2024-10-31T14:14:30Z) - Continual Domain Incremental Learning for Privacy-aware Digital Pathology [3.6630930118966814]
Continual learning (CL) techniques aim to reduce the forgetting of past data when learning new data with distributional shift conditions.
We develop a Generative Latent Replay-based CL (GLRCL) approach to store past data and perform latent replay with new data.
arXiv Detail & Related papers (2024-09-10T12:21:54Z) - Adaptive Affinity-Based Generalization For MRI Imaging Segmentation Across Resource-Limited Settings [1.5703963908242198]
This paper introduces a novel relation-based knowledge framework by seamlessly combining adaptive affinity-based and kernel-based distillation.
To validate our innovative approach, we conducted experiments on publicly available multi-source prostate MRI data.
arXiv Detail & Related papers (2024-04-03T13:35:51Z) - ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning [54.68180752416519]
Panoptic segmentation is a cutting-edge computer vision task.
We introduce a novel and efficient method for continual panoptic segmentation based on Visual Prompt Tuning, dubbed ECLIPSE.
Our approach involves freezing the base model parameters and fine-tuning only a small set of prompt embeddings, addressing both catastrophic forgetting and plasticity.
arXiv Detail & Related papers (2024-03-29T11:31:12Z) - Self-STORM: Deep Unrolled Self-Supervised Learning for Super-Resolution Microscopy [55.2480439325792]
We introduce deep unrolled self-supervised learning, which alleviates the need for such data by training a sequence-specific, model-based autoencoder.
Our proposed method exceeds the performance of its supervised counterparts.
arXiv Detail & Related papers (2024-03-25T17:40:32Z) - ArSDM: Colonoscopy Images Synthesis with Adaptive Refinement Semantic
Diffusion Models [69.9178140563928]
Colonoscopy analysis is essential for assisting clinical diagnosis and treatment.
The scarcity of annotated data limits the effectiveness and generalization of existing methods.
We propose an Adaptive Refinement Semantic Diffusion Model (ArSDM) to generate colonoscopy images that benefit the downstream tasks.
arXiv Detail & Related papers (2023-09-03T07:55:46Z) - Vision-Language Modelling For Radiological Imaging and Reports In The
Low Data Regime [70.04389979779195]
This paper explores training medical vision-language models (VLMs) where the visual and language inputs are embedded into a common space.
We explore several candidate methods to improve low-data performance, including adapting generic pre-trained models to novel image and text domains.
Using text-to-image retrieval as a benchmark, we evaluate the performance of these methods with variable sized training datasets of paired chest X-rays and radiological reports.
arXiv Detail & Related papers (2023-03-30T18:20:00Z) - Reducing Catastrophic Forgetting in Self Organizing Maps with
Internally-Induced Generative Replay [67.50637511633212]
A lifelong learning agent is able to continually learn from potentially infinite streams of pattern sensory data.
One major historic difficulty in building agents that adapt is that neural systems struggle to retain previously-acquired knowledge when learning from new samples.
This problem is known as catastrophic forgetting (interference) and remains an unsolved problem in the domain of machine learning to this day.
arXiv Detail & Related papers (2021-12-09T07:11:14Z) - About Explicit Variance Minimization: Training Neural Networks for
Medical Imaging With Limited Data Annotations [2.3204178451683264]
Variance Aware Training (VAT) method exploits this property by introducing the variance error into the model loss function.
We validate VAT on three medical imaging datasets from diverse domains and various learning objectives.
arXiv Detail & Related papers (2021-05-28T21:34:04Z) - Self-Adaptive Transfer Learning for Multicenter Glaucoma Classification
in Fundus Retina Images [9.826586293806837]
We propose a self-adaptive transfer learning (SATL) strategy to fill the domain gap between multicenter datasets.
Specifically, the encoder of a DL model that is pre-trained on the source domain is used to initialize the encoder of a reconstruction model.
Results demonstrate that the proposed SATL strategy is effective in the domain adaptation task between a private and two public glaucoma diagnosis datasets.
arXiv Detail & Related papers (2021-05-07T05:20:37Z) - HM4: Hidden Markov Model with Memory Management for Visual Place
Recognition [54.051025148533554]
We develop a Hidden Markov Model approach for visual place recognition in autonomous driving.
Our algorithm, dubbed HM$4$, exploits temporal look-ahead to transfer promising candidate images between passive storage and active memory.
We show that this allows constant time and space inference for a fixed coverage area.
arXiv Detail & Related papers (2020-11-01T08:49:24Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.