CoDEPS: Online Continual Learning for Depth Estimation and Panoptic
Segmentation
- URL: http://arxiv.org/abs/2303.10147v2
- Date: Wed, 31 May 2023 09:05:34 GMT
- Title: CoDEPS: Online Continual Learning for Depth Estimation and Panoptic
Segmentation
- Authors: Niclas V\"odisch, K\"ursat Petek, Wolfram Burgard, Abhinav Valada
- Abstract summary: We introduce continual learning for deep learning-based monocular depth estimation and panoptic segmentation in new environments in an online manner.
We propose a novel domain-mixing strategy to generate pseudo-labels to adapt panoptic segmentation.
We explicitly address the limited storage capacity of robotic systems by leveraging sampling strategies for constructing a fixed-size replay buffer.
- Score: 28.782231314289174
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Operating a robot in the open world requires a high level of robustness with
respect to previously unseen environments. Optimally, the robot is able to
adapt by itself to new conditions without human supervision, e.g.,
automatically adjusting its perception system to changing lighting conditions.
In this work, we address the task of continual learning for deep learning-based
monocular depth estimation and panoptic segmentation in new environments in an
online manner. We introduce CoDEPS to perform continual learning involving
multiple real-world domains while mitigating catastrophic forgetting by
leveraging experience replay. In particular, we propose a novel domain-mixing
strategy to generate pseudo-labels to adapt panoptic segmentation. Furthermore,
we explicitly address the limited storage capacity of robotic systems by
leveraging sampling strategies for constructing a fixed-size replay buffer
based on rare semantic class sampling and image diversity. We perform extensive
evaluations of CoDEPS on various real-world datasets demonstrating that it
successfully adapts to unseen environments without sacrificing performance on
previous domains while achieving state-of-the-art results. The code of our work
is publicly available at http://codeps.cs.uni-freiburg.de.
Related papers
- ReALFRED: An Embodied Instruction Following Benchmark in Photo-Realistic Environments [13.988804095409133]
We propose the ReALFRED benchmark that employs real-world scenes, objects, and room layouts to learn agents to complete household tasks.
Specifically, we extend the ALFRED benchmark with updates for larger environmental spaces with smaller visual domain gaps.
With ReALFRED, we analyze previously crafted methods for the ALFRED benchmark and observe that they consistently yield lower performance in all metrics.
arXiv Detail & Related papers (2024-07-26T07:00:27Z) - ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning [54.68180752416519]
Panoptic segmentation is a cutting-edge computer vision task.
We introduce a novel and efficient method for continual panoptic segmentation based on Visual Prompt Tuning, dubbed ECLIPSE.
Our approach involves freezing the base model parameters and fine-tuning only a small set of prompt embeddings, addressing both catastrophic forgetting and plasticity.
arXiv Detail & Related papers (2024-03-29T11:31:12Z) - SCIM: Simultaneous Clustering, Inference, and Mapping for Open-World
Semantic Scene Understanding [34.19666841489646]
We show how a robot can autonomously discover novel semantic classes and improve accuracy on known classes when exploring an unknown environment.
We develop a general framework for mapping and clustering that we then use to generate a self-supervised learning signal to update a semantic segmentation model.
In particular, we show how clustering parameters can be optimized during deployment and that fusion of multiple observation modalities improves novel object discovery compared to prior work.
arXiv Detail & Related papers (2022-06-21T18:41:51Z) - Continual Predictive Learning from Videos [100.27176974654559]
We study a new continual learning problem in the context of video prediction.
We propose the continual predictive learning (CPL) approach, which learns a mixture world model via predictive experience replay.
We construct two new benchmarks based on RoboNet and KTH, in which different tasks correspond to different physical robotic environments or human actions.
arXiv Detail & Related papers (2022-04-12T08:32:26Z) - Stronger Generalization Guarantees for Robot Learning by Combining
Generative Models and Real-World Data [5.935761705025763]
We provide a framework for providing generalization guarantees by leveraging a finite dataset of real-world environments.
We demonstrate our approach on two simulated systems with nonlinear/hybrid dynamics and rich sensing modalities.
arXiv Detail & Related papers (2021-11-16T20:13:10Z) - Evaluating Continual Learning Algorithms by Generating 3D Virtual
Environments [66.83839051693695]
Continual learning refers to the ability of humans and animals to incrementally learn over time in a given environment.
We propose to leverage recent advances in 3D virtual environments in order to approach the automatic generation of potentially life-long dynamic scenes with photo-realistic appearance.
A novel element of this paper is that scenes are described in a parametric way, thus allowing the user to fully control the visual complexity of the input stream the agent perceives.
arXiv Detail & Related papers (2021-09-16T10:37:21Z) - Bayesian Embeddings for Few-Shot Open World Recognition [60.39866770427436]
We extend embedding-based few-shot learning algorithms to the open-world recognition setting.
We benchmark our framework on open-world extensions of the common MiniImageNet and TieredImageNet few-shot learning datasets.
arXiv Detail & Related papers (2021-07-29T00:38:47Z) - Towards Real-World Category-level Articulation Pose Estimation [46.813224754603866]
Category-level Articulation Pose Estimation (CAPE) methods are studied under the single-instance setting with a fixed kinematic structure for each category.
Considering these limitations, we reform this problem setting for real-world environments and suggest a CAPE-Real (CAPER) task setting.
This setting allows varied kinematic structures within a semantic category, and multiple instances to co-exist in an observation of real world.
arXiv Detail & Related papers (2021-05-07T13:41:16Z) - Learning to Continuously Optimize Wireless Resource In Episodically
Dynamic Environment [55.91291559442884]
This work develops a methodology that enables data-driven methods to continuously learn and optimize in a dynamic environment.
We propose to build the notion of continual learning into the modeling process of learning wireless systems.
Our design is based on a novel min-max formulation which ensures certain fairness" across different data samples.
arXiv Detail & Related papers (2020-11-16T08:24:34Z) - Unsupervised Domain Adaptation in Person re-ID via k-Reciprocal
Clustering and Large-Scale Heterogeneous Environment Synthesis [76.46004354572956]
We introduce an unsupervised domain adaptation approach for person re-identification.
Experimental results show that the proposed ktCUDA and SHRED approach achieves an average improvement of +5.7 mAP in re-identification performance.
arXiv Detail & Related papers (2020-01-14T17:43:52Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.