Memory-based Semantic Segmentation for Off-road Unstructured Natural
Environments
- URL: http://arxiv.org/abs/2108.05635v1
- Date: Thu, 12 Aug 2021 10:04:47 GMT
- Title: Memory-based Semantic Segmentation for Off-road Unstructured Natural
Environments
- Authors: Youngsaeng Jin, David K. Han and Hanseok Ko
- Abstract summary: We propose a built-in memory module for semantic segmentation.
The memory module stores significant representations of training images as memory items.
We conduct experiments on the Robot Unstructured Ground Driving dataset and RELLIS dataset.
- Score: 29.498304237783763
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: With the availability of many datasets tailored for autonomous driving in
real-world urban scenes, semantic segmentation for urban driving scenes
achieves significant progress. However, semantic segmentation for off-road,
unstructured environments is not widely studied. Directly applying existing
segmentation networks often results in performance degradation as they cannot
overcome intrinsic problems in such environments, such as illumination changes.
In this paper, a built-in memory module for semantic segmentation is proposed
to overcome these problems. The memory module stores significant
representations of training images as memory items. In addition to the encoder
embedding like items together, the proposed memory module is specifically
designed to cluster together instances of the same class even when there are
significant variances in embedded features. Therefore, it makes segmentation
networks better deal with unexpected illumination changes. A triplet loss is
used in training to minimize redundancy in storing discriminative
representations of the memory module. The proposed memory module is general so
that it can be adopted in a variety of networks. We conduct experiments on the
Robot Unstructured Ground Driving (RUGD) dataset and RELLIS dataset, which are
collected from off-road, unstructured natural environments. Experimental
results show that the proposed memory module improves the performance of
existing segmentation networks and contributes to capturing unclear objects
over various off-road, unstructured natural scenes with equivalent
computational cost and network parameters. As the proposed method can be
integrated into compact networks, it presents a viable approach for
resource-limited small autonomous platforms.
Related papers
- Embodied-RAG: General Non-parametric Embodied Memory for Retrieval and Generation [65.23793829741014]
Embodied-RAG is a framework that enhances the model of an embodied agent with a non-parametric memory system.
At its core, Embodied-RAG's memory is structured as a semantic forest, storing language descriptions at varying levels of detail.
We demonstrate that Embodied-RAG effectively bridges RAG to the robotics domain, successfully handling over 200 explanation and navigation queries.
arXiv Detail & Related papers (2024-09-26T21:44:11Z) - MRFP: Learning Generalizable Semantic Segmentation from Sim-2-Real with Multi-Resolution Feature Perturbation [2.0293118701268154]
We propose a novel MultiResolution Feature Perturbation (MRFP) technique to randomize domain-specific fine-grained features and perturb style of coarse features.
MRFP helps state-of-the-art deep neural networks to learn robust domain invariant features for simulation-to-real semantic segmentation.
arXiv Detail & Related papers (2023-11-30T08:02:49Z) - Prompt-Matched Semantic Segmentation [96.99924127527002]
The objective of this work is to explore how to effectively adapt pre-trained foundation models to various downstream tasks of image semantic segmentation.
We propose a novel Inter-Stage Prompt-Matched Framework, which maintains the original structure of the foundation model while generating visual prompts adaptively for task-oriented tuning.
A lightweight module termed Semantic-aware Prompt Matcher is then introduced to hierarchically interpolate between two stages to learn reasonable prompts for each specific task.
arXiv Detail & Related papers (2022-08-22T09:12:53Z) - Unsupervised domain adaptation semantic segmentation of high-resolution
remote sensing imagery with invariant domain-level context memory [10.210120085157161]
This study proposes a novel unsupervised domain adaptation semantic segmentation network (MemoryAdaptNet) for the semantic segmentation of HRS imagery.
MemoryAdaptNet constructs an output space adversarial learning scheme to bridge the domain distribution discrepancy between source domain and target domain.
Experiments under three cross-domain tasks indicate that our proposed MemoryAdaptNet is remarkably superior to the state-of-the-art methods.
arXiv Detail & Related papers (2022-08-16T12:35:57Z) - Pin the Memory: Learning to Generalize Semantic Segmentation [68.367763672095]
We present a novel memory-guided domain generalization method for semantic segmentation based on meta-learning framework.
Our method abstracts the conceptual knowledge of semantic classes into categorical memory which is constant beyond the domains.
arXiv Detail & Related papers (2022-04-07T17:34:01Z) - Memory-Guided Semantic Learning Network for Temporal Sentence Grounding [55.31041933103645]
We propose a memory-augmented network that learns and memorizes the rarely appeared content in TSG tasks.
MGSL-Net consists of three main parts: a cross-modal inter-action module, a memory augmentation module, and a heterogeneous attention module.
arXiv Detail & Related papers (2022-01-03T02:32:06Z) - RICE: Refining Instance Masks in Cluttered Environments with Graph
Neural Networks [53.15260967235835]
We propose a novel framework that refines the output of such methods by utilizing a graph-based representation of instance masks.
We train deep networks capable of sampling smart perturbations to the segmentations, and a graph neural network, which can encode relations between objects, to evaluate the segmentations.
We demonstrate an application that uses uncertainty estimates generated by our method to guide a manipulator, leading to efficient understanding of cluttered scenes.
arXiv Detail & Related papers (2021-06-29T20:29:29Z) - End-to-End Egospheric Spatial Memory [32.42361470456194]
We propose a parameter-free module, Egospheric Spatial Memory (ESM), which encodes the memory in an ego-sphere around the agent.
ESM can be trained end-to-end via either imitation or reinforcement learning.
We show applications to semantic segmentation on the ScanNet dataset, where ESM naturally combines image-level and map-level inference modalities.
arXiv Detail & Related papers (2021-02-15T18:59:07Z) - Video Object Segmentation with Episodic Graph Memory Networks [198.74780033475724]
A graph memory network is developed to address the novel idea of "learning to update the segmentation model"
We exploit an episodic memory network, organized as a fully connected graph, to store frames as nodes and capture cross-frame correlations by edges.
The proposed graph memory network yields a neat yet principled framework, which can generalize well both one-shot and zero-shot video object segmentation tasks.
arXiv Detail & Related papers (2020-07-14T13:19:19Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.